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Introduction 


The  37th  Annual  Sanibel  Symposium,  orga¬ 
nized  by  the  faculty,  students,  and  staff  of 
the  Quantum  Theory  Project  of  the  University  of 
Florida,  was  held  on  March  1-7, 1997.  The  meeting 
was  again  held  at  the  Ponce  de  Leon  Conference 
Center  in  St.  Augustine,  Florida.  More  than  300 
participants  gathered  for  seven  days  of  lectures 
and  informal  discussions. 

The  symposium  followed  the  established  format 
with  plenary  and  poster  sessions.  This  year,  the 
schedule  was  shortened  somewhat  with  a  compact 
seven-day  integrated  program  of  quantum  biol- 
ogy,  quantum  chemistry,  and  condensed  matter 
physics.  The  topics  of  the  sessions  covered  by 
these  proceedings  include  Density  Functional  The¬ 
ory  (DFT)  and  Applications,  DFT  for  Intermolecu- 
lar  Forces,  Clusters  and  Nanostructures,  Theory  of 
Floppy  Molecules,  Exotic  Anions,  Theoretical  Inor¬ 
ganic  Chemistry,  Foundations  of  Quantum  Me¬ 
chanics,  Non-Linear  Optics,  Quantum-Classical 
Molecular  Dynamics,  and  Industrial  Application. 

The  articles  have  been  subjected  to  the  ordinary 
refereeing  procedures  of  the  International  Journal  of 
Quantum  Chemistry,  The  articles  presented  in  the 
sessions  on  quantum  biology  and  associated  poster 
sessions  are  published  in  a  separate  issue  of  the 
International  Journal  of  Quantum  Chemistry. 
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ABSTRACT:  From  1927  to  1943,  the  mathematical  and  conceptual  foundations  of 
quantum  mechanics,  and  their  prolongations  into  the  formulation  of  statistical  mechanics, 
were  either  the  direct  subject  of  or  one  of  the  primary  motivations  for  several  publications 
by  von  Neumann.  We  contrast  the  reception  received  by  the  work  done  during  the  first 
5  years  (1927-1932)  and  that  received  by  the  monumental  mathematical  constructs 
wrought  in  the  second  period  (1934-1943);  we  describe  how  the  latter  were  to  lead  to  the 
elaboration  of  a  formalism  generalizing  the  fundamental  structures  of  quantum  mechanics 
so  that  they  can  encompass  and  deal  consistently  with  situations  typical  of  the  quantum 
many-body  problems  of  statistical  mechanics.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant 
Chem  65:  379-387,  1997 


Introduction 

Von  Neumann  died  almost  exactly  40  years 
ago  (February  8,  1957).  A  mathematician  had 
passed  away,  who — like  Poincare  and  Gauss  be¬ 
fore  him — had  contributed  to  such  a  vast  array  of 
mathematical  endeavors  that  it  is  a  trite  question 
to  ask  whether  he  was  of  a  pure  or  applied  persua¬ 
sion.  Every  participant  in  this  symposium  is  aware 
of  the  impact  von  Neumann  has  left  upon  the 
methods  that  we  rely  on  in  our  everyday  scientific 
activities;  to  wit:  his  1932  book  is  still  recognized 
today  as  one  of  the  landmarks  one  can  turn  to  for  a 
concise  formulation  of  the  foundations  of  quantum 
mechanics;  and  the  computer,  which  he  created,  is 
an  evermore  powerful  investigative  tool  that  is 
pervasive  in  the  present  company.  It  seems  there- 
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fore  proper  to  pay  him  homage  here,  not  only  for 
his  having  enabled  us  to  cultivate  our  field  so 
intensively,  but  chiefly  for  having  opened  the 
fences  of  the  quantum  pastures  toward  the  greater 
outdoors,  as  yet  not  fully  surveyed. 


1.  The  Familiar  Landscape 
(1927-1932) 

The  first  article  [1]  witnessing  the  interest  von 
Neumann  was  to  show  in  the  foundations  of  quan¬ 
tum  mechanics  originated  in  a  course  given  by 
Hilbert  in  Gottingen  during  the  winter  semester 
1926/27;  the  notes  were  prepared  by  Nordheim, 
and  we  are  told  that  von  Neumann  contributed 
'"important  parts"  of  the  mathematical  develop¬ 
ments.  One  of  the  overt  aims  was  to  digest,  mathe¬ 
matically  and  conceptually,  the  ideas  of  Jordan 
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and  of  Dirac.  The  stimulating  and  very  direct  in¬ 
fluence  of  the  pioneering  work  of  Heisenberg,  Born, 
and  Jordan  is  explicitly  acknowledged  from  the 
very  beginning  of  the  article,  while  the  last  section 
is  devoted  to  the  Schrodinger  equation.  The  syn¬ 
thesis  is  clearly  on  the  making,  fusing  into  the 
theory  of  operators  acting  on  a  Hilbert  space  the  two 
forms  of  quantum  mechanics  which  we  now  rou¬ 
tinely  recognize  to  be  equivalent — as  Schrddinger 
himself  had  pointed  out  only  shortly  before:  the 
algebraic  formulation^  then  called  the  '"matrix-mech¬ 
anics,"  of  quantum  theory  (Heisenberg),  and  the 
analytic  formulation,  via  partial-differential  equa¬ 
tions,  the  so-called  "wave-mechanics"  (Schrodin- 
ger). 

The  same  year,  von  Neumann  published  three 
other  articles  on  the  foundations  of  quantum  me¬ 
chanics,  its  probabilistic  background,  and  the  ther¬ 
modynamics  of  quantum  mechanical  ensembles 
[2“4].  His  interest  was  also  directed  toward  more 
specific  problems,  such  as  the  quantum  spectral 
properties  of  the  electron  [5-7];  a  discussion  of  the 
extension  of  ergodic  theory  and  the  H-theorem  to 
the  "new  mechanics"  [8];  and  several  other  articles 
as  well,  with  ostensibly  mathematical  titles  (e.g., 
[9]),  witnessing  to  the  fact  that  von  Neumann  was 
profoundly  involved  in  trying  to  attack  the  mathe¬ 
matical  (or  conceptual)  foundations  of  the  theory 
from  as  many  angles  as  possible. 

1931  is  the  year  of  the  theorem  most  universally 
attached  to  von  Neumann's  name  in  the  physics 
community;  it  is  the  statement  and  the  proof  of  the 
uniqueness  of  the  Schrodinger  representation  [10]. 
Specifically,  this  theorem  establishes  the  remark¬ 
able  fact  that  if  denotes  the  group  gener¬ 

ated  by  exponentiating  the  position  and  momen¬ 
tum  to  obtain  the  Weyl  form  of  the  CCR  (canonical 
commutation  relations)  then  has  exactly 

one  irreducible  representation,  the  one  found  by 
Schrodinger.  In  the  spirit  of  the  early  1930s,  this 
result  is  worth  comparing  with  the  richness  of  the 
inequivalent  irreducible  representations  that  were 
known  to  exist  for  most  of  the  other  groups 
used  in  physics;  in  particular,  spectroscopists  knew 
how  to  exploit  the  fact  that  the  rotation  group  in 
three  dimensions  has  as  many  irreducible  repre¬ 
sentations  as  there  are  complex,  finite-dimensional 
vector  spaces.  By  contrast,  von  Neumann's 
uniqueness  theorem  is  why  we  can  teach  quantum 
mechanics  and  start  by  telling  our  students  that 
the  momentum  and  position  operators  are  can¬ 
onically  realized  as  differentiation  (Pfl^Xx)  = 


-iid^^Xx),  and  as  multiplication  (Qj^^)(x)  = 
xfi^i  x),  with  k  ^  1, 2, . . . ,  n,  where  are  elements 
of  the  Hilbert  space  C),  i.e.,  are  complex¬ 

valued  functions,  defined  on  the  configuration 
space  R",  and  assumed  to  be  square-integrable 
with  respect  to  the  Lebesgue  measure. 

Finally,  the  culminating  achievement  arrived 
with  the  epoch-marking  Mathematische  Grundlagen 
der  Quantenmechanik  [11].  I  often  wondered  why 
this  jewel  saw  its  first  English  translation  only  in 
1955;  was  this  simply  due  to  the  towering  preemi¬ 
nence  of  Dirac's  book,  or  could  it  be  a  reflection 
from  the  forced  emigration  to  the  United  States  of 
so  many  of  the  scientists  who  were  conversant 
with  the  subject  matter  in  the  original  language?  In 
the  meantime,  translation  or  not,  von  Neumann's 
book  certainly  became  for  a  large  part  of  the 
physics  community  the  standard  reference  on  the 
conceptual,  as  well  as  the  mathematical,  founda¬ 
tions  of  quantum  mechanics  (QM).  In  particular, 
its  introduction  of  the  density  matrix,  rather  than 
the  wave  function,  as  the  fundamental  concept  for 
a  quantum  mechanical  state  is  quite  remarkable; 
the  discussion  of  the  "measuring  process"  in¬ 
scribed  there  is  both  compelling  and  so  profoundly 
unsatisfactory  that  it  created  a  semantic  contro¬ 
versy  that  has  not  yet  really  abated;  the  task  of 
elucidating  the  problems  involved  in  the  elusive 
interpretation  of  the  quantum  measuring  process 
has  benefited,  in  particular,  from  the  sustained 
attention  of  Wigner,  who  was  in  a  position  to  add 
to  his  own  penetrating  reflections  [12],  the  author¬ 
ity  of  his  lifelong  friendship  with  von  Neumann. 

As  far  as  the  treatment  of  the  mechanics  of 
quantum  systems  with  finitely  many  degrees  of  free¬ 
dom  is  concerned,  the  formulation  of  scattering 
theory  seems  to  be  the  only  postwar  conceptual 
and  mathematical  development  not  anticipated  in 
von  Neumann's  book.  This  lacuna  has  now  been 
tended  to  in  [13]  (see  also  its  predecessor  [14]),  and 
it  can  be  fairly  said  that  this  did  not  require  any 
drastic  reconsiderations  of  the  conceptual  mold  in 
which  von  Neumann's  book  had  cast  QM. 

Indeed,  after  the  publication  of  von  Neumann's 
book,  most  physicists  had  apparently  every  reason 
to  "know"  that  the  observables  of  QM  are  the 
self-adjoint  operators  acting  in  a  fixed  Hilbert 
space;  that  the  states  are  the  density  matrices  asso¬ 
ciated  to  this  Hilbert  space;  and  that  this  represen¬ 
tation  is  uniquely  determined  by  the  configuration 
space  of  the  system  to  be  considered. 

Well,  almost  every  reason,  as  we  shall  now  see. 
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2.  Two  Brief  Excursions  (1934-1936) 

It  is  commonly  acknowledged  that  somewhere 
around  the  turn  of  the  1930s,  something  went 
tangy  in  quantum  physics,  A  perception  might 
have  appeared  according  to  which  the  success  of 
quantum  spectroscopy  in  atomic  and  molecular 
physics  was  not  going  to  carry  smoothly  over  to 
quantum  electrodynamics.  The  rather  difficult  up¬ 
bringing  of  the  still  impetuous  quantum  field  the¬ 
ory  might  indeed  have  suggested  that  the  princi¬ 
ples  and/or  the  formalism  were  not  quite  right. 
For  instance,  in  an  article  that  we  shall  briefly 
review  later  in  this  section  [15],  we  read  of  an 
alternate  "suitable  starting  point  for  a  generaliza¬ 
tion  of  the  present  quantum  mechanical  theory. 
The  need  of  such  a  generalization  arises  from  the 
(probably)  fundamental  difficulties  resulting  when 
one  attempts  to  apply  quantum  mechanics  to 
questions  in  relativistic  and  nuclear  phenomena." 

While  the  above  were  powerful  physical  moti¬ 
vations  indeed,  it  seems  that  it  was  also  for  rea¬ 
sons  of  a  different  order  that  von  Neumann  be¬ 
came  dissatisfied  with  the  foundations  of  QM.  In 
one  of  his  other  modes  of  operation,  he  had  min¬ 
gled  mightily  into  the  foundations  of  pure  mathe¬ 
matics;  even  to  us,  it  is  clear  that  the  foundations 
of  quantum  physics  were  not  on  a  footing  accept¬ 
able  to  the  likes  of  Hilbert,  nor  were  they  on  par 
with  Kolmogorov's  axiomatization  of  probability 
theory.  Note  incidentally  that  von  Neumann  had 
himself  contributed  also  to  the  allied  field  of  math¬ 
ematics  called  measure  theory.  Therefore,  von 
Neumann's  reasons  to  probe  further  into  the  foun¬ 
dations  of  quantum  physics  participated  signifi¬ 
cantly  from  longings  ambient  in  the  mathematical 
community. 

Yet,  one  does  not  need  to  indulge  in  much 
guessing  or  tentative  reconstructions  of  his  deepest 
motivations:  the  published  record  shows  that  von 
Neumann  decided  to  concentrate  his  quest,  at  first, 
on  analyzing  the  necessity  of  Hilbert  spaces  in  the 
formulation  of  quantum  theory. 

The  first  excursion  in  this  direction  is  [15],  from 
which  the  above  quote  was  taken.  It  is  the  prede¬ 
cessor  of  the  modern  algebraic  approach.  The  pur¬ 
pose  was  to  focus  on  the  set  of  observables  and 
thus  to  try  to  abstract  the  structure  of  the  set 
of  all  (bounded)  self-adjoint  operators,  dispensing 
with  the  reference  to  an  underlying  Hilbert  space. 


Since  such  operators  do  not  necessarily  commute, 
their  products  are  not  necessarily  self-adjoint;  and 
therefore  do  not  necessarily  belong  to  the 
substitute  advanced  in  this  article  is  to  consider 
instead  of  the  ordinary  product,  AB  the  symmetric 
product 

AoB  =  +  BA)  =  |[(A  +  Bf  -  ^  -  B^]. 

While  this  product  is  clearly  commutative,  it  is  not 
associative;  thus,  points  to  the  necessity  of 
studying  a  new  algebraic  structure,  which  is  now 
referred  to  as  a  Jordan  algebra.  The  article  pursued 
this  new  theory  far  enough  to  be  able  to  classify  all 
the  "simple"  finite-dimensional  objects  in  this  cate¬ 
gory.  All  of  them,  except  one,  are,  in  fact,  the 
self-adjoint  part  of  some  matrix  algebra,  over  the 
reals,  the  complex  numbers,  or  the  quaternions. 
These  are  called  "special"  Jordan  algebras,  to  dis¬ 
tinguish  them  from  the  "exceptional"  one  they 
had  obtained,  involving  Caley  numbers;  the  latter, 
however,  has  not  yet  found  useful  physical  appli¬ 
cations.  Its  interest  is  mostly  to  point  to  the  exis¬ 
tence  of  Jordan  algebras  that  are  not  special.  For 
many  years,  to  determine  whether  a  Jordan  alge¬ 
bra  is  "special"  (i.e.,  is  the  self-adjoint  part  of  an 
associative  algebra)  was  an  open  problem  for  infi¬ 
nite — dimensional  Jordan  algebras,  even  when 
these  algebras  are  equipped  with  a  norm;  the  an¬ 
swer  [16]  somewhat  surprisingly  was  reduced  to 
the  question  of  whether  a  consistent  orientability 
condition  in  the  state  space  of  these  algebras  is 
satisfied  (for  an  elementary  description  of  what  is 
involved,  see,  for  instance,  pp.  378-381  in  [17]). 

Von  Neumann's  second  excursion  [18]  in  the 
direction  of  probing  further  the  role  of  the  Hilbert 
space  formulation  that  he  had  described  in  [11] 
was  to  abstract  the  structure  of  the  family  L  of  all 
projectors  on  a  Hilbert  space  or,  equivalently,  the 
structure  of  the  set  of  its  closed  subspaces.  Under 
the  usual  inclusion,  one  can  define  the  intersection 
of  two  projectors  as  the  largest  projector  contained 
in  both  of  them  and  the  union  of  two  such  projec¬ 
tors  as  the  smallest  projector  containing  them  both. 
There  is  a  smallest  element,  the  zero  projector,  and 
a  largest  element,  the  identity  map.  Finally,  a  com¬ 
plementation  is  defined,  attributing  to  every  pro¬ 
jector  P  the  projector  J  —  P,  its  orthogonal  comple¬ 
ment.  The  result  of  this  description  is  that  L  is  a 
complemented  lattice,  in  much  the  same  way  as 
the  collection  of  all  measurable  subsets  (or  their 
indicator  functions)  of  a  probability  space  is  a 
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complemented  lattice.  There  is,  however,  a  major 
difference:  the  latter  is  Boolean,  i.e.,  the  intersec- 
tion  and  union  enjoy  the  familiar  distributive  rela¬ 
tions.  This  property  is  not  true  for  L,  and  this 
reflects  an  essential  feature  of  QM,  the  noncommu¬ 
tativity  of  the  operators  acting  on  the  Hilbert  space. 
The  question  asked  in  [18]  was  how  to  characterize 
L  intrinsically  or,  phrasing  this  in  more  physical 
terms,  how  to  define  abstractly  the  quantum 
equivalent  of  a  classical  probability  space.  Birkhoff 
and  von  Neumann  only  found  a  partial  answer:  an 
abstract  property — called  modularity — seemed  to 
be  a  good  substitute  to  replace  the  classical  notion 
of  distributivity;  however,  Birkhoff  and  von  Neu¬ 
mann  pointed  out  that  this  property  is  not  suffi¬ 
cient  to  capture  the  structure  of  the  set  of  all  the 
projectors,  since  it  is  satisfied  only  when  the  Hilbert 
space  if  finite-dimensional,  clearly  not  an  acceptable 
restriction  for  the  purpose  of  QM,  A  weakened 
version  of  modularity  was  found  a  quarter  of  cen¬ 
tury  later  [19].  In  physical  terms  (i.e.,  here  in  purely 
lattice-theoretical  terms),  the  weak  modularity 
condition  requires  that  if  P  implies  Q  (i.e.,  P  c  Q, 
or,  equivalently,  in  the  language  of  projections 
operators  acting  on  a  Hilbert  space,  PQ  =  P),  then 
P  and  Q  are  compatible  (i.e.,  the  sublattice  of  L 
generated  by  P,  Q,  P  ,  Q  is  Boolean  or,  equiva¬ 
lently,  in  the  operator  language,  PQ  =  QP).  To 
obtain  that  this  weak  modularity  characterizes  the 
lattice  of  all  projectors  in  a  Hilbert  space,  one  must 
add  an  atomicity  condition  exploiting  the  presence 
of  minimal  projections  (also  called  "atoms"  or 
"points"),  a  condition  the  omission  of  which  led 
later  to  interesting  structures  von  Neumann  was  to 
explore;  see  next  section. 

The  lattice  theoretical  approach  to  QM  captures 
enough  of  the  fundamental  structure  of  the  theory 
that  it  is  sufficient  to  define  symmetries  as  auto¬ 
morphisms  of  L  preserving  its  structure,  in  order 
to  obtain — at  long  last — an  elementary  proof  for 
the  theorem  by  Wigner  [20]  according  to  which,  in 
a  Hilbert  space  formalism  where  all  bounded 
self-adjoint  operators  correspond  to  physical  ob¬ 
servables,  symmetries  are  implemented  either  by 
unitary  or  by  antiunitary  operators;  for  the  proof, 
see  [21],  and  for  a  critical  assessment,  [22]  and 
references  therein. 

By  1936  therefore,  one  could  hope  that  some 
useful  information  would  be  retrieved  from  the 
study  of  the  algebra  of  observables  or  from  the 
lattice  of  projections  that  generate  this  algebra;  the 
main  problem  was  to  find  the  topological  condi¬ 
tions  allowing  one  to  relax  the  conditions  enough 


to  encompass  infinite-dimensional  situations.  This 
is  a  story  for  the  next  section. 


3.  The  Great  Outdoors 
( 1 936-Onwards) 

Many  problems  that  the  traditional  QM  does 
not  seem  able  to  cope  with  in  a  consistent  manner 
are  related  to  the  appearance  of  infinitely  many 
(real  or  virtual)  degrees  of  freedom.  In  the  latter 
case,  and  in  contrast  with  von  Neumann's  unique¬ 
ness  theorem  (see  Section  2),  we  know  [23],  that 
the  Fock  space  representation  is  only  one  among 
uncountably  many  inequivalent  representations  of 
the  canonical  commutation  (or  anticommutation) 
relations.  We  further  know  that  the  Fock  represen¬ 
tation  is  attached  to  the  vacuum  state  and  that  it 
cannot  accommodate  the  finite  density  of  particles 
needed  for  many  applications  to  statistical  me¬ 
chanics.  A  manifestation  of  this  phenomenon  was 
pointed  out  in  connection  with  the  gauge-symme¬ 
try  breaking  occurring  in  the  BCS  model  for  super¬ 
conductivity  [24]. 

The  natural  frame  to  collect  these  phenomena 
was  prepared  by  Murray  and  von  Neumann  in  a 
series  of  articles  [25].  The  mathematical  strategy  of 
these  articles  can  be  explained  physically  as  a  way 
to  see  QM  as  a  generalization  of  classical  mechan¬ 
ics  in  the  following  manner. 

First,  let  us  agree  to  view  (a  large  class  of)  the 
observables  in  classical  mechanics  as  (essentially) 
bounded  functions  on  the  classical  phase  space, 
mathematically  a  measure  space  (H,  /x);  for  in¬ 
stance,  /A  could  be  the  Lebesgue  measure  on  the 
space  R^”.  These  functions  are  the  real  elements  of 
an  algebra  j/  =  Lcffl)  that  comes  naturally 
equipped  with  an  involution  (here,  the  complex 
conjugation)  and  a  norm  satisfying  ||  A*A||  =  ||  A||^. 
In  modern  terms,  this  is  to  say  that  is  a  C^-alge- 
bra.  Moreover,  is  the  dual  of  a  Banach  space, 
namely,  the  space  LHH,  fx)  of  integrable  functions 
on  (n,  fx).  The  latter  property  makes  j/  what  is 
called  a  -algebra.  Note  that  the  states  on  these 
classical  systems  are  given  by  the  positive  ele¬ 
ments  p  of  ifiPt,  pi)  that  are  normalized  by  requir¬ 
ing  that  their  total  weight  is  1,  so  that  p  is  a 
probability  measure.  Hence,  the  expectation  value 
of  the  observable  A  in  the  state  p  is  given  by 
(  p;  A)  fA((o)p((o)  dpi. 

We  can  now  take  advantage  of  this  perspective 
on  classical  mechanics  to  provide  a  natural  link  to 
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the  formalism  of  quantum  mechanics.  Let  us  in¬ 
deed  represent  the  algebra  ssf  as  the  algebra  of 
operators  on  jl)  operating  by  multiplication, 
i.e.,  for  every  vector  ^  e  ju.),  the  vector 
is  defined  by  (A^)(w)  =  It  is  easy  to 

see  that,  in  this  representation, 

where  c5/'  =  {B  ^^\AB  =  BA  VA  ej/}.  (3.1) 

This  relation  implies  that 

where  s/"  =  {sf'Y .  (3.2) 

(In  general,  is  called  the  commutant  of  j/,  and 
j/",  the  bicommutant  of  j/.)  Hence,  j/  is  a  von 
Neuman  algebra,  i.e.,  an  algebra  of  operators  such 
that  it  is  self-adjoint  (i.e.,  A  ej/  implies  that  its 
adjoint  A*  also  belongs  to  jf)  and  satisfies  (3.2) 
(i.e.,  it  coincides  with  its  bicommutant). 

Von  Neumann  algebras  are  the  mathematical 
objects  proposed  for  study  in  [25]  under  the  now 
obsolete  name  of  "rings  of  operators";  the  current 
name  seems  to  have  come  into  general  practice 
after  the  publication  of  [26]. 

It  is  of  utmost  importance  in  our  argument  that 
we  include  (3.2)  but  not  (3.1)  in  the  definition  of  a 
von  Neumann  algebra;  indeed,  (3.1)  requires  in 
addition  that  the  algebra  be  Abelian  [j/cj/'j, 
which  we  do  not  want  in  general,  since  we  want 
to  include  in  this  structure  algebras  that  are  not 
Abelian,  such  as  the  algebra  of  all  bounded  opera¬ 
tors  on  a  Hilbert  space.  It  is  significant  for  our 
purpose  that  classical  mechanics  and  ordinary  QM 
appear  in  this  language  as  two  extreme  cases:  to 
classical  systems  are  associated  algebras  which  co¬ 
incide  with  their  commutant  [j/=j/'],  whereas 
the  algebras  that  are  associated  with  ordinary 
quantum  systems  are  irreducible,  i.e.,  their  com¬ 
mutant  is  trivial  [sf'  =  CJ].  Von  Neumann  and 
Murray  concentrated  on  an  intermediary  case  ob¬ 
tained  when  the  center  of  ssf  is  trivial  [j/  = 

CJ];  these  von  Neumann  algebras  are  called  fac¬ 
tors.  Mathematically,  they  turn  out  to  be  the  build¬ 
ing  blocks  from  which  one  can  construct  all  von 
Neumann  algebras.  Physically,  they  occur  under 
very  specific  circumstances,  e.g.,  in  connection  with 
the  description  of  pure  thermodynamical  phases  as 
extremal  KMS  states  (see  below). 

Before  we  arrive  there,  we  should  return  briefly 
to  the  question  of  the  justification  for  the  role 
Hilbert  spaces  in  quantum  theory;  or,  stated  differ¬ 
ently,  whether  one  can  avoid  putting  Hilbert  spaces 
explicitly  in  the  very  axioms  of  the  theory;  or 


putting  it  still  differently,  whether  one  can  build  in 
the  theory  enough  flexibility  so  that  different  phys¬ 
ical  situations  may  be  described  by  different 
Hilbert  space  representations. 

The  answer  to  this  question  is  predicated  on  the 
prior  construction  of  a  C* -algebra  (see  the  begin¬ 
ning  of  this  section),  the  self-adjoint  elements  of 
which  are  the  observables  of  the  system  consid¬ 
ered.  For  instance,  in  the  study  of  a  Heisenberg 
magnet,  i.e.,  of  spins  distributed  on  a  lattice  A,  one 
can  associate  to  every  part  c  A  containing  N 
sites  (with  N  finite)  the  C* -algebra  of  all 
2^  X  2^  matrices.  The  set-theoretical  inclusion  in 
the  family  of  all  finite  subsets  of  A  induces  a 
natural  inclusion  relation  in  the  family  of  all  corre¬ 
sponding  C*-algebras.  It  is  then  possible  to  con¬ 
struct  (by  a  method  called  the  C* -inductive  limit) 
a  C*-algebra  and  mutually  consistent  injective 
maps  from  each  to  sf,  in  such  a  manner  that 
the  set  theoretical  union  of  their  images  in  j/  is 
dense  in  j/.  j/  is  then  identified  as  the  C*-algebra 
of  the  infinite  lattice  of  spins.  Note  that  no  mention 
of  a  Hilbert  space  has  been  made  in  connection 
with  the  global  algebra  j/. 

The  general  GNS  construction  associates  then  to 
every  state  on  any  C*-algebra  j/  a  specific  Hilbert 
space  representation.  As  this  construction  seems  to 
capture  much  of  the  physics,  let  us  sketch  it.  First, 
a  state  is  viewed  here  as  a  linear  map: 

0:  A  {(f);  A)  G  C  with 

<</>;  A*>  =  A>*;  <</>;  A*A>  >0  ^  A^s^ 

and  sup^e.^,IIA||<iKc/>;  A>  |=  1.  (3.3) 

Note  that,  while  the  above  properties  are  not 
mathematically  independent,  they  all  have  signifi¬ 
cance  for  the  physical  interpretation  of  the  theory; 
moreover,  in  the  usual  situation  where  j/  has  a 
unit  J,  the  last  of  the  above  conditions  reads  sim¬ 
ply  (</>;  J>  =  1. 

The  state  on  being  given,  a  sesquilinear 
form  on  is  naturally  defined  by 

(A,  B)  =  <</>;  A*B>.  (3.4) 

A  Hilbert  space 

^=^7^  (3.5) 

is  then  obtained  as  the  completion  of  the  pre-Hil¬ 
bert  space  (j//Ker^),  where  the  kernel  Ker^  =  {A 
Gj/|  A* A)  =  0}  is  a  closed  left-ideal  of  j/.  We 
identify  the  vector  the  equivalence 
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class  of  A  ej/  modulo  Ker^.  As  these  vectors 
form  a  dense  subspace  of  we  can  define  for 
every  A  e  ^  a  bounded  operator  ir^(  A)  by 

7t^(A)^s  =  ^ab  VBgj/.  (3.6) 
Note  that  the  map 

TT^:  A  (3.7) 

is  a  Hilbert  space  representation  of  s^\ 

77/CjAi  +  C^A^)  =  CiTT^fAj)  +  C27r^(A2) 
A/,(A^2)  =  '^4,(A^)t7^(A2) 

TT^iA*)  ^  TT^iAf  ,  (3.8) 

for  all  Cj,  C2  G  C  and  all  Aj,  A2  g  j/.  Note  also 
that 

=  {2r/A)a);|A  Gj/}  (3.9) 

an(J 

($j,ir/A)3>,)  =  {<j>-,  A)  V  A  Gj/,  (3.10) 

i.e.,  in  this  representation,  the  vector  is  cyclic 
and  represents  the  state  (f).  These  two  properties 
determine  the  representation  up  to  a  unitary 
equivalence. 

For  two  familiar  examples,  let  cf)  be  the  vacuum 
state  on  the  algebra  of  the  CCR;  if  the  sys¬ 
tem  considered  has  finitely  many  degrees  of  free¬ 
dom,  the  representation  is  the  Schrddinger  rep¬ 
resentation;  and  when  the  system  considered 
has  infinitely  many  degrees  of  freedom,  the  repre¬ 
sentation  77^  is  the  Fock  space  representation.  The 
familiar  fact  that  these  representations  are  irre¬ 
ducible  is  a  reflection  on  the  fact  that  the  vacuum 
state  is  a  pure  state  (i.e.,  cannot  be  decomposed 
into  a  convex  mixture  of  other  states).  Since  W*-al- 
gebras  are  C*-algebras,  the  GNS  applies  to  them 
as  well. 

As  indicated  above,  for  the  special  case  of  the 
Heisenberg  ferromagnet,  it  is  possible  to  construct 
abstractly  the  C*-algebra  corresponding  to  an  infi¬ 
nite  system.  Similar  procedures  can  be  followed  in 
much  more  general  circumstances;  for  instance,  an 
axiomatization  of  quantum  field  theory  was  pre¬ 
sented  along  such  line  in  an  article  that  injected  a 
new  vigor  in  the  algebraic  approach  to  quantum 
theory  [27].  The  general  idea  is  as  follows.  Suppose 
that  a  C*-algebra  s/  has  been  obtained  that  de¬ 
scribes  the  infinite  system  to  be  considered.  Sup¬ 


pose  further  that  a  state  (f)  as  been  selected,  e.g., 
the  thermodynamical  limit  of  the  canonical  equi¬ 
librium  states  of  the  corresponding  local  systems; 
we  shall  discuss  below  a  characterization  of  the 
states  obtained  in  this  manner.  One  then  performs 
the  GNS  construction  associated  to  this  state  to 
obtain  the  representation  that  is  thus  specifi¬ 
cally  based  on  the  state  0.  It  is  now  a  matter  of 
convenience  to  consider  the  von  Neumann  algebra 
acting  on  the  Hilbert  space 

The  main  advantage  of  considering  the  von 
Neumann  algebra  rather  than  the  C* -alge¬ 
bra  is  that  the  topology  of  von  Neumann 

algebras  is  better  adapted  to  apply  some  of  the 
methods  of  functional  analysis.  For  instance,  a  von 
Neumann  algebra  JK  is  always  generated  by  its 
projectors  (whereas  a  C*-algebra  might  have  no 
non-zero  projector  among  its  elements).  A  spectral 
theory  can  be  formulated  then,  entirely  at  the  level 
of  the  von  Neumann  algebra.  In  fact,  the  lattice 
L{jy)  of  the  projectors  of  a  von  Neumann  algebra  JK 
essentially  contains  all  the  intrinsic  information 
provided  by  itself.  In  particular,  when  ./T  is  a 
factor,  a  dimension  function  d  can  be  defined  on 
Li/y).  Up  to  a  multiplicative  constant,  the  range  R 
of  d  can  be  of  either  of  the  following  five  types: 

type  R  =  {1,2, . . . ,  n} 

type  4:  R  =  {l,2,...,oo} 

type /Ji:  R  =  [0,1] 

type  J4:  R  =  [0,oo] 

typeffl:  R  =  {0,oo}.  (3.11) 

From  a  mathematical  point  of  view,  we  empha¬ 
size  that  the  range  of  the  dimension  function  for 
the  factors  of  type  I  is  discrete:  the  lattice  of  their 
projectors  is  atomic.  By  contrast,  for  the  factors  of 
type  111,  the  dimension  function  takes  all  values  in 
the  finite  closed  interval  [0, 1];  these  factors  are  the 
candidates  for  the  "'geometries  without  points." 
The  search  for  such  geometries  seems  to  have  been 
one  of  the  motivations  von  Neumann  had,  not 
only  for  the  series  of  articles  under  discussion,  but 
also  for  the  article  with  Birkhoff  discussed  in  the 
previous  section,  where  they  had  isolated  the 
modularity  condition. 

For  the  purposes  of  physics,  factors  of  type  II 
and  111  nowadays  occur  routinely  and  typically  in 
the  study  of  systems  with  infinitely  many  degrees 
of  freedom,  be  they  obtained  in  relativisitic  quan¬ 
tum  field  theory  or  in  the  statistical  mechanics  of 
the  thermodynamical  limit.  We  will  concentrate 
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our  attention  on  the  latter  in  the  sequel.  To  prepare 
for  this,  the  case  of  canonical  equilibrium  states  for 
finite  systems  is  still  worth  considering  in  some 
detail.  Specifically,  we  want  to  illustrate  that,  as 
the  GNS  representation  depends  on  the  state  (/>, 
one  should  expect  that  it  expresses  also  some  of 
the  special  circumstances  of  the  physical  situation 
described  by  the  state  (p. 

The  following  explicit  particular  case  is  typical 
in  the  structures  that  it  brings  to  light.  Consider, 
indeed,  the  representation  obtained  by  the  GNS 
construction  for  the  canonical  equilibrium  state  of 
a  1-dimensional  harmonic  oscillator;  it  acts  on  the 
Hilbert  space  known  as  the  Liouville  space  of  the 
system  ^  =  ^  where ^  denotes  the  Hilbert 

space  of  the  Schrbdinger  representation.  This  is  the 
space  of  all  Hilbert-Schmidt  operators  on  i.e., 
the  Hilbert  space  spanned  by  the  density  matrices 
on  With  =- 0,1,2, ...}  denoting  an  or¬ 

thonormal  basis  of  eigenvectors  of  the  Hamilto¬ 
nian  H  of  our  oscillator,  (S>  ipilk,  I  = 

0,1,2,...}  gives  an  orthonormal  basis  in  We 
have  then 

77^(  A)  ^  A  0  I, 

i.e.,  TT^CAXipi^  0  ipi)  =  (Aip^)  0  ipi  (3.12) 

and 

<I)j  =  -/p  with  p  =  /Tre~^^ , 

i.e.,<E.=  (3.13) 

k 

This  representation  is  evidently  not  irreducible,  a 
reflection  of  the  fact  that  the  canonical  equilibrium 
state  p  is  a  mixture.  Moreover,  it  is  instructive  to 
notice  that 

=  {1  0  A\A  and,  thus, 

n  TT^i^y  =  C,  (3.14) 

i.e.,  is  a  factor  which  is  antiunitarily  equiva¬ 

lent  to  its  commutant,  through  the  antiunitary 
operator  /  defined  by 

(3.15) 

^  kl  ^  kl 

Notice,  furthermore,  that  the  operators  U^(0  de¬ 
fined  on  ^  by 

U^t)^  =  ^  V(t)AV(-t) 

where  y(f)  =  and  A  gj/  (3,16) 


satisfy  the  relations 

H^(0O  -  O  and 

U^it)7r(A)U^(-t)  ^  7T{V(t)AV(-t)) 

VAgj/.  (3.17) 

These  operators  are  realized  explicitly  by 

Li/0  =  V(t)  0  V(-t)  (3.18) 

so  that 

^  U^(-t)  and  H^  =  H0l~l0H, 

(3.19) 

where  H^,  the  generator  of  li^,  has  therefore  a 
spectrum  that  is  symmetric  around  zero  and,  in 
particular  is  not  bounded  below.  As  ^  is  called 
the  Liouville  space  of  the  system,  is  referred  to 
as  the  Liouville  operator. 

So  far,  we  are  still  in  somewhat  familiar  terri¬ 
tory.  Yet,  the  remarkable  feature  about  the  repre¬ 
sentation  that  we  just  discussed  is  that  much  of  the 
structure  to  which  it  has  pointed  persists  in  the 
thermodynamical  limit  where  the  usual  formalism 
of  QM  loses  its  power  since  the  Fock  space  does 
not  accommodate  states  with  a  finite  particle  den¬ 
sity  and  factors  do  appear  that  are  not  of  type  L 
This  was  first  noticed  for  the  Bose  gas  [28]. 

This  structure  is  now  known  indeed  to  be  a 
general  consequence  [29]  (for  a  textbook  account, 
see  [17])  of  a  characterization  of  the  canonical 
equilibrium  state  that  carries  over  to  the  thermo¬ 
dynamical  limit.  This  is  the  abstract  formulation 
[30]  (the  so-called  KMS  condition  [31])  of  an  ana- 
lyticity  property  of  thermal  Green  functions. 

Given  a  time-evolution  a  (i.e.,  a  continuous, 
one-parameter  group  of  automorphisms  of  the  al¬ 
gebra  of  observables  for  the  system  considered) 
and  given  also  a  temperature  (3  =  1/kT,  the  KMS 
condition  on  the  state  (p  expresses  that  the  correla¬ 
tion  functions  satisfy  (in  essence,  although  some 
caution  on  domains  should  be  exercised) 

{(p;  a(t)[  A]B)  =  (cp;  Ba(t  +  zjS)[  A]) 

V  A,  B  Gj/.  (3.20) 

The  states  that  satisfy  this  condition  are  called 
KMS  states. 

For  finite  systems,  it  is  immediate  to  see  that 
there  is  only  one  such  state  and  that  it  is  given  by 
the  usual  canonical  equilibrium  density  matrix. 
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For  infinite  systems,  however,  this  uniqueness  dis¬ 
appears,  but  the  set  of  the  states  that  satisfy 
the  KMS  condition  (for  the  evolution  a  and  the 
temperature  jS)  form  a  convex  set  that  is  a  sim¬ 
plex,  i.e.,  a  convex  set  the  elements  of  which  can 
be  decomposed  in  a  unique  manner  as  a  convex 
combination  of  their  extreme  points  (as  every 
chemist  who  has  dealt  with  mixtures  of  three  reac¬ 
tants  knows,  an  equilateral  triangle  is  a  simplex;  a 
circle  is  not).  This  additional  freedom  is  exactly 
what  was  looked  for  when  one  wished  the  thermo¬ 
dynamical  limit  would  allow  one  to  describe  phase 
transitions.  The  extremal  KMS  states  are  to  be 
interpreted  as  pure  thermodynamical  phases,  and 
it  seems  important  for  the  interpretation  that 
canonical  equilibrium  states  be  uniquely  decom¬ 
posable  in  their  extremal  components,  the  pure 
thermodynamical  phases.  This  very  special  prop¬ 
erty  of  quantum  canonical  equilibrium  states  is  a 
classical  feature  of  quantum  statistical  mechanics 
that  is  made  manifest  only  in  the  thermodynami¬ 
cal  limit.  The  GNS  representation  for  a  KMS  state 
(/),  which  does  not  describe  a  pure  thermodynami¬ 
cal  phase,  is  not  a  factor;  the  spectrum  of  its 
nontrivial  center  then  labels  the  pure  thermody¬ 
namical  phases  of  which  cf)  is  a.  mixture. 

The  canonical  decomposition  of  KMS  states  in 
extremal  states  satisfying  the  KMS  condition  has 
been  a  useful  tool  to  discuss  the  phenomena  of 
spontaneous  symmetry  breaking,  see,  e.g.,  [32].  For 
nonequilibrium  situations  as  well,  the  flexibility  of 
the  algebraic  approach  has  allowed  some  progress 
on  a  variety  of  fronts,  such  as 

1.  The  construction  of  a  model  in  which  one  can 
implement  the  four  requirements  Bohr 
wanted  to  impose  on  a  quantum  measuring 
process,  namely:  (i)  the  collapse  of  the  wave 
packet,  e.g.,  the  collapse  of  the  state  of  the 
system  from  a  coherent  superposition  of 
eigenstates  (relative  the  observable  mea¬ 
sured)  to  an  incoherent  mixture  of  these 
eigenstates;  (ii)  the  transfer  of  information 
from  the  system  to  the  apparatus;  (iii  a)  the 
measuring  apparatus  is  macroscopic  (a  rather 
vague  notion  that  can  be  sharpened  in  a 
formalism  where  the  passage  to  the  thermo¬ 
dynamical  limit  is  allowed);  (iii  b)  the  mea¬ 
suring  process  is  stable  under  small  (i.e.,  lo¬ 
cal)  perturbations  of  the  initial  state  of  the 
measuring  apparatus;  (iv)  the  process  is  gov¬ 
erned  by  the  laws  of  Quantum  Mechanics 
(which  as  applied  to  infinite  systems,  means 


that  the  dynamics  is  obtained  as  the  thermo¬ 
dynamical  limit  of  a  Hamiltonian  evolution 
describing  completely  the  coupling  between 
the  system  and  the  apparatus)  [33]. 

2.  The  extension  to  the  quantum  realm  of  the 
concept  of  Kolmogorov  flows,  inherited  from 
classical  probabilistic  ergodic  theory  (e.g.. 
Brownian  motion)  [34];  this  involves  discus¬ 
sions  of  the  physical  manifestations  of  such 
ergodic  behavior,  including  various  notions 
of  mixing  (i.e.,  various  forms  for  the  decay  of 
correlation  functions),  as  well  as  finer  notions 
such  as  the  Lebesgue  spectrum,  the  nonvan¬ 
ishing  of  dynamical  entropy,  and  quantum 
diffusion-type  equations. 

3.  The  exploration  of  quantum  equivalents  to 
the  classical  notions  of  Anosov  systems, 
chaos,  and/or  Liapunov  exponents;  the  latter 
project  is  still  involved  in  some  controversy 
in  which  the  algebraic  approach  helped  the 
formulation,  and  sorting  out,  of  various  op¬ 
tions  [35]. 

Quantum  field  theory,  elementary  particle  the¬ 
ory,  braid  groups,  and  knots,  and  noncommutative 
differential  geometry  are  other  fields  of  physics 
and  of  mathematics  where  the  algebraic  and  ana¬ 
lytic  methods  pioneered  by  von  Neumann  have 
opened  new  conceptual  vistas;  these,  however,  are 
not  within  the  scope  of  the  present  lecture. 
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ABSTRACT:  In  probability  theory  an  event  is  characterized  by  a  set  of  outcomes.  The 
probability  of  an  event  is  a  prediction  of  the  relative  frequency  of  the  outcomes  that  lie  in 
the  set,  when  the  experiment  is  repeated  independently.  In  quantum  mechanics  an  event 
is  identified  with  a  closed  subspace  of  Hilbert  space.  Do  the  probabilities  of  quantum 
events  predict  relative  frequencies  in  the  same  way?  The  famous  analysis  of  Bell  shows 
that  this  cannot  happen  in  general,  for  all  quantum  events.  Nevertheless  the  probabilities 
must  predict  something,  and  if  not  relative  frequencies  then  what?  There  are  several 
answers  to  the  question  of  how  to  interpret  quantum  probabilities  as  relative  frequencies, 
but  they  all  involve  restrictions  on  the  events  that  are  considered.  This  article  surveys 
three  such  interpretations,  which  correspond  to  three  possible  choices:  measured  events, 
macroscopic  events,  and  position  events.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem 
65:  389-398,  1997 


Introduction 

Does  quantrum  probability  predict  frequency? 

If  so,  frequency  of  what?  We  put  these  questions  in 
a  mathematical  framework  and  present  possible 
answers.  The  purpose  is  to  clarify  the  physical 
content  of  quantum  mechanics.  This  article  is  ex¬ 
pository;  the  goal  is  to  explain  the  issues  in  as 
elementary  a  way  as  possible. 

In  ordinary  probability  theory  there  is  a  fixed 
set  of  outcomes.  An  event  is  characterized  by  a  set 
of  outcomes.  The  probability  of  an  event  is  a  num- 
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ber  between  zero  and  one.  The  ordinary  interpre¬ 
tation  of  probability  in  physical  science  is  the 
frequency  interpretation.  Suppose  there  are  many 
independent  repetitions  of  the  same  situation.  Each 
time  there  is  an  actual  physical  outcome.  The  rela¬ 
tive  frequency  of  the  repetitions  for  which  the 
outcome  makes  the  event  happen  should  be  close 
to  the  probability  prediction.  If  this  frequency  in¬ 
terpretation  of  probability  is  to  be  viable,  then  the 
probability  assignment  must  satisfy  basic  princi¬ 
ples  of  probability  theory,  such  as  addivity. 

In  quantum  mechanics  an  event  is  identified 
with  a  closed  subspace  of  Hilbert  space.  When  the 
quantum  state  is  specified,  each  quantum  event 
has  a  probability.  This  again  is  a  number  between 
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zero  and  one.  However,  conjunctions  of  quantum 
events  are  defined  only  in  special  cases,  when  the 
events  are  compatible.  Can  the  probabilities,  for  all 
quantum  events  simultaneously,  be  regarded  as 
predictions  of  relative  frequency?  Can  the  quan¬ 
tum  events  be  reinterpreted  as  determining  sets  of 
outcomes?  The  answer  is  no.  The  probabilities  of 
quantum  events  violate  additivity. 

The  famous  analysis  of  Bell  gave  a  deeper  per¬ 
spective  on  this  fact.  If  the  probability  assignment 
depends  on  which  set  of  compatible  quantum 
events  is  considered,  then  the  ordinary  probability 
laws  can  be  satisfied.  The  physical  apparatus  that 
defines  the  particular  quantum  events  influences 
the  probabilities.  However,  Bell  found  a  new  con¬ 
straint.  Assume  that  the  system  satisfies  two  con¬ 
ditions:  active  locality  and  passive  locality.  Then 
again  the  quantum  probability  assignment  con¬ 
flicts  with  probability  laws.  Thus  if  quantum  me¬ 
chanical  probabilities  are  to  predict  frequencies  in 
the  usual  way,  then  either  active  locality  or  pas¬ 
sive  locality  must  fail. 

In  any  case,  if  quantum  probabilities  are  to 
predict  the  frequencies  of  certain  events,  then  these 
events  must  be  defined  in  terms  of  sets  of  out¬ 
comes.  There  are  several  answers  to  the  question 
of  how  to  pick  the  set  of  outcomes  and  the  corre¬ 
sponding  events,  corresponding  to  different  inter¬ 
pretations  of  quantum  mechanics.  We  survey  three 
such  interpretations. 

According  to  the  Copenhagen  interpretation,  the 
set  of  possible  outcomes  depends  on  an  external 
choice  of  what  to  measure.  Once  this  choice  has 
been  made,  then  there  is  only  a  small  collection  of 
compatible  events  for  which  the  probabilities  give 
frequency  predictions. 

Another  interpretation  attempts  to  describe  a 
single  set  of  outcomes  consisting  of  macroscopic 
histories.  The  laws  of  probability  apply  to  macro¬ 
scopic  events  for  which  a  decoherence  condition  is 
satisfied. 

Yet  another  class  of  interpretations  describes  the 
world  on  the  atomic  level,  and  the  set  of  possible 
outcomes  consists  of  particle  trajectories.  The  only 
quantum  events  that  persist  in  this  picture  are 
position  events. 

So  the  answer  is  that  quantum  probability  pre¬ 
dicts  frequencies,  but  only  of  certain  events.  In 
each  interpretation  it  predicts  the  frequencies  of 
events  characterized  by  sets  of  outcomes.  The 
problem  is  that  there  is  no  agreement  over  what 
the  outcomes  should  be. 


The  entire  discussion  here  is  in  the  context  of 
the  elementary  quantum  mechanics  of  particles.  Of 
course,  current  physical  theory  regards  quantum 
fields  as  the  fundamental  dynamical  entities.  In 
particular,  quarks  and  leptons  are  described  by 
Fermi  fields  and  interactions  between  them  by 
gauge  fields.  The  direct  physical  significance  of 
these  fields  is  subtle,  and  so  the  implication  for  the 
interpretation  of  quantum  mechanics  is  yet  ob¬ 
scure. 


Probability 

Our  first  task  is  to  review  standard  probability 
theory  in  order  to  contrast  it  with  quantum  me¬ 
chanics.  In  probability  there  is  a  set  fi  of  all 
outcomes  of  an  experiment.  An  event  A  is  a  condi¬ 
tion  that  determines  a  subset  of  fl  to  which  a 
probability  may  be  assigned.  The  logical  opera¬ 
tions  and,  or,  not  on  events  correspond  to  set 
theory  operations  intersection,  union,  complement 
on  subsets.  The  probability  P[  A]  of  an  event  A  is  a 
number  between  0  and  1.  The  probability  assign¬ 
ment  must  satisfy  the  usual  laws  of  probability. 
For  instance,  a  typical  condition  is  additivity  in  the 
form 

P[  A]  =  P[  A  and  B]  +  P[  A  and  not  B]. 

The  frequency  interpretation  of  probability  is  as  a 
prediction  of  relative  frequency.  Suppose  the  situa¬ 
tion  is  repeated  many  times  under  independent 
conditions.  Then  the  relative  frequency  of  the  repe¬ 
titions  for  which  a  given  event  happens  should  be 
close  to  the  probability  of  the  event. 

Here  is  an  example  of  a  probability  assignment. 
Consider  the  set  fl  consisting  of  the  six  triples  110, 
101,  100,  Oil,  010,  and  001.  Let  A  be  the  event  that 
the  first  element  is  a  1,  let  B  be  the  event  that  the 
second  element  is  1,  and  let  C  be  the  event  that  the 
third  element  is  1.  Thus  B,  for  example,  corre¬ 
sponds  to  the  three-element  subset  110,  011,  010. 
Assign  the  probability  uniformly  over  the  six 
points.  Then  the  events  A,  B,  and  C  and  their 
negations  each  have  probability  However,  note 
that  events  such  as  the  compound  event  (A  and 
not  B)  have  probability  |.  (The  corresponding 
subset  is  101,  100,  and  these  are  two  out  of  six.) 
Thus  there  is  a  conspiracy  that  makes  A  and  B 
tend  not  to  happen  together.  However,  there  is  a 
limit  to  the  possible  magnitude  of  such  a  conspir¬ 
acy.  This  limit  is  given  by  the  following  elemen¬ 
tary  probability  theorem. 
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ELEMENTARY  PROBABILITY  THEOREM 

Consider  an  arbitrary  probability  assignment. 
Then 

P[  A  and  not  B]  +  P[B  and  not  C] 

+  P[C  and  not  A]  <  1. 

This  theorem  is  completely  trivial  as  a  theorem 
of  probability  theory,  but  we  shall  see  that  there  is 
an  analogous  result  that  is  false  in  quantum  me¬ 
chanics. 


Quantum  IVIechanics 

We  briefly  review  this  theory.  The  mathematical 
framework  of  quantum  mechanics  is  Hilbert  space. 
Each  quantum  event  is  identified  with  a  closed 
subspace  A  of  the  Hilbert  space.  (Often  the  sub¬ 
space  is  identified  with  the  orthogonal  projection 
operator  onto  the  subspace.)  The  quantum  state  is  a 
fixed  one-dimensional  subspace  of  the  Hilbert 
space.  For  each  quantum  event  A,  the  probability 
of  A  is  given  by 

P[A]  =  cosHe), 

where  cosKd)  is  the  square  of  the  cosine  of  the 
angle  between  the  one-dimensional  subspace  de¬ 
termining  the  state  and  the  subspace  correspond¬ 
ing  to  the  A.  [Of  course,  the  Hilbert  space  is 
complex,  but  cos^(0)  may  be  defined  as  the  effect 
on  the  state  of  projecting  onto  A  and  then  back.] 

In  this  framework,  the  negation  not  A  of  the 
event  A  is  defined  by  the  orthogonal  complement 
A  of  the  subspace  corresponding  to  A,  so  that 

P[not  A]  =  sin^(0). 

In  particular, 

P[  A]  -I-  P[not  A]  =  1. 

If  A  and  B  are  closed  subspaces,  the  intersec¬ 
tion  A  n  B  is  a  closed  subspace.  We  say  that  A 
and  B  are  compatible  if  the  four  subspaces  A  n 
B,  AnB-‘-,A-''nB,  A-*-  nB-*-  have  direct  sum  the 
entire  Hilbert  space.  (This  geometrical  condition  is 
equivalent  to  having  the  corresponding  projections 
commute.)  When  A  and  B  are  compatible,  we 
define  their  conjunction  A  and  B  to  be  the  intersec¬ 
tion  A  n  B.  Thus,  when  A  and  B  are  compatible, 
we  have  all  the  usual  probability  laws,  for  exam¬ 


ple,  additivity  in  the  form 

P[  A]=P[A  and  B]  +  P[  A  and  not  Bj. 

It  would  be  unwise  to  define  conjunction  as 
intersection  of  subspaces  when  the  subspaces  are 
not  compatible.  To  see  this  in  a  dramatic  way,  take 
A  and  B  to  be  one-dimensional  subspaces  that  are 
not  parallel  or  perpendicular,  and  take  the  state 
along  A.  The  additivity  equation  then  reads  1  =  0, 
as  bad  a  failure  as  one  could  imagine. 

The  relation  of  compatibility  is  not  transitive. 
Say  that  A  is  compatible  with  B  and  also  B  is 
compatible  with  C.  It  does  not  necessarily  follow 
that  A  is  compatible  with  C. 

What  is  to  be  made  of  the  existence  of  incom¬ 
patible  quantities  A  and  B?  In  the  Copenhagen 
interpretation  of  quantum  mechanics  it  is  some¬ 
times  said  that  one  must  choose  to  measure  either 
A  or  B,  and  the  measurement  of  one  makes  impos¬ 
sible  the  measurement  of  the  other.  Furthermore,  a 
quantum  event  has  no  particular  truth  value  until 
it  is  measured.  So  from  this  perspective  it  is  rea¬ 
sonable  there  is  no  meaningful  quantum  event  that 
combines  A  with  B. 


Bell’s  Theorems 

Bell  discovered  a  remarkably  simple  way  of 
characterizing  some  of  the  puzzling  aspects  of 
quantum  mechanics.  His  essays  on  this  subject 
have  been  collected  [1].  In  this  account  we  draw  on 
the  appendix  to  Wick's  book  [2]  for  a  particularly 
elementary  account. 

Suppose  that  we  have  a  fbced  quantum  state 
and  two  compatible  events  A  and  A'.  We  say  that 
A  is  equivalent  to  A  with  respect  to  the  state  if 

P[A]  =P[Aand  A']  =P[A']. 

These  events  will  happen  or  not  happen  together, 
with  probability  1. 

Bell  made  a  remarkable  discovery  about  the 
quantum  system  consisting  of  two  spin  |  particles 
in  the  singlet  state.  There  are  events  A,  B,  and  C 
associated  with  the  spin  of  the  first  particle  and 
events  A',  B',  and  C  associated  with  the  spin  of 
the  second  particle.  They  have  the  following  prop¬ 
erties: 

1.  Each  of  the  events  A,  B,  C  is  compatible  with 
each  of  the  events  A',  B',  C'. 
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2.  The  event  A  is  equivalent  to  A,  the  event  B 
is  equivalent  to  B',  and  the  event  C  is  equiv¬ 
alent  to  C.  These  events  all  have  probability 

1 

2* 

3.  The  probabilities  of  the  events  (A  and  not 
B'),  (B  and  not  C'),  (C  and  not  A)  are  each 
|.  There  is  a  very  strong  conspiracy  for  A 
and  B'  not  to  happen  together. 

This  gives  what  we  shall  refer  to  as  BelTs  first 
theorem; 

BELL’S  FIRST  THEOREM 

There  can  be  no  single  probability  model  that 
reproduces  the  results  of  quantum  mechanics  for 
this  system. 

Proof.  If  there  were,  we  could  replace  the  events 
A,  B',  and  C  by  the  equivalent  events  A,  B,  C 
and  get 

P[  A  and  notB]  +  P[  B  and  not  C] 

-I-  P[C  and  not  A]  =  |. 

This  contradicts  the  elementary  probability  theo¬ 
rem. 

This  theorem  is  remarkable,  but  its  implication 
for  quantum  mechanics  is  not  quite  as  strong  as  it 
seems.  If  one  admits  that  quantum  events  only 
become  true  or  false  when  they  are  measured,  then 
one  must  be  prepared  to  admit  that  a  measure¬ 
ment  can  have  an  effect  on  the  system.  Thus  the 
probability  assignment  might  be  affected  by  what 
is  measured. 

In  the  example  of  the  two  particles,  there  is  a 
magnetic  field  in  the  vicinity  of  the  first  particle 
that  is  oriented  in  a  particular  direction.  It  can 
deflect  the  particle  in  one  of  two  different  ways. 
The  event  A  corresponds  to  one  of  the  two  possi¬ 
ble  deflections  (and  the  event  not  A  to  the  other 
one).  If  the  magnetic  field  is  oriented  in  another 
way,  we  get  the  event  B  and  its  negation.  A  third 
orientation  defines  C  and  its  negation.  Only  one  of 
these  orientations  is  physically  possible  in  a  given 
situation.  There  are  similar  possible  magnetic  field 
arrangements  for  the  second  particle,  with  corre¬ 
sponding  events  A',  B',  C'  and  their  negations. 

Thus  we  could  write  for  the  probability 
when  we  have  decided  to  introduce  magnetic  fields 
to  decide  between  A  and  not  A  for  the  first 
particle  and  between  B'  and  not  B'  for  the  second 
particle.  In  this  notation  the  three  probabilities 
would  be  P„i,[A  and  not  B'],  Pi„[B  and  not  C], 


and  P„[C  and  not  A'].  Even  though  these  proba¬ 
bilities  are  each  f,  there  is  no  way  of  combining 
the  events  in  a  situation  where  a  single  probability 
assignment  is  relevant.  It  would  seem  that  the 
argument  would  no  longer  apply  to  this  situation. 
Bell,  however,  showed  that  under  additional  hy¬ 
potheses  it  still  works. 

Let  <?”  be  events  associated  with  the  first  parti¬ 
cle,  and  let  be  events  associated  with  the  sec¬ 
ond  particle.  Let  be  another  collection  of  events; 
we  think  of  these  as  occurring  in  the  past.  Let  P^ 
be  the  probability  assignment  with  apparatus  set¬ 
ting  i  for  the  first  particle  and  j  for  the  second 
particle. 

It  seems  reasonable  that  there  should  be  no 
possibility  of  intervention  over  long  distances.  This 
would  say  that  the  P,y  probabilities  of  events  in  y 
associated  with  the  first  particle  do  not  depend  on 
the  apparatus  setting  j  for  the  second  particle. 
Correspondingly,  the  P,y  probabilities  of  events  in 
y  associated  with  the  second  particle  do  not 
depend  on  the  apparatus  setting  i  for  the  first 
particle. 

The  following  two  locality  conditions  refer  to  a 
chosen  collection  of  past  events.  Active  locality 
rules  out  intervention  and  also  rules  out  influence 
on  the  past  events.  Passive  locality  is  relative  to 
the  same  collection  of  past  events. 

AL:  Active  locality:  The  P,y  probabilities  of 
events  defined  by  events  in  y  and  do 
not  depend  on  j.  The  P,^  probabilities  of 
events  defined  by  events  in  y  and  do 
not  depend  on  i.  (An  intervention  on  one 
particle  cannot  affect  the  relation  of  the 
other  particle  to  its  past.) 

Note:  It  follows  that  probabilities  of  past 
events  in  are  computed  with  a  probabil¬ 
ity  assignment  P  that  depends  on  neither 
apparatus  setting. 

PL:  Passive  locality:  If  an  event  A  in  y  is 
equivalent  to  an  event  A'  in  y'  with  re¬ 
spect  to  P,y,  then  they  are  both  equivalent 
to  an  event  Aq  in  (A  perfect  coinci¬ 
dence  must  have  an  explanation  in  a  com¬ 
mon  past.) 

BELL’S  SECOND  THEOREM 

Under  the  hypotheses  of  active  and  passive 
locality,  there  can  be  no  family  of  probability  mod¬ 
els  that  reproduces  the  results  of  quantum  me¬ 
chanics. 
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The  proof  is  not  difficult,  and  here  is  a  sketch. 
Let  A  in  ^  be  associated  with  one  particle,  and  let 
K  in  9^  be  associated  with  the  other.  Take  them 
equivalent.  By  passive  locality,  A  is  equivalent  to 
a  past  event  Aq  in  ■^0  with  respect  to  the  probabil¬ 
ities  By  active  locality,  for  these  events  we  get 
the  same  results  if  we  compute  with  the  probabili¬ 
ties  Thus,  A  is  equivalent  to  A^  with  respect 
to  the  probabilities 

By  similar  reasoning,  B'  is  equivalent  to  Bq 
with  respect  to  the  probabilities 

It  then  follows  from  simple  probability  argu¬ 
ments  that  (A  and  not  B')  is  equivalent  to  (Aq 
and  not  Bq)  with  respect  to  the  probabilities  ^ab‘ 
However,  the  probability  of  the  past  event  (Aq 
and  not  Bq)  is  computed  with  a  probability  assign¬ 
ment  P  not  depending  on  the  apparatus. 

It  follows  that 

P^J  A  and  not  B']  =  P[  Aq  and  not  Bq]  . 

This  can  be  repeated  with  all  three  terms.  The 
probabilities  no  longer  depend  on  the  decision  on 
the  apparatus.  Thus  we  can  now  prove  Bell's  sec¬ 
ond  theorem  with  exactly  the  argument  used  for 
Bell's  first  theorem. 

The  conclusion  of  this  theorem  is  that  if  quan¬ 
tum  mechanics  is  a  correct  way  to  make  probabil¬ 
ity  predictions,  then  we  must  give  up  the  combi¬ 
nation  of  active  and  passive  locality.  We  shall  see 
how  this  happens  in  the  various  interpretations. 

Notice  that  in  order  to  apply  the  theorem  in  a 
specific  case  one  must  define  the  collection  of  past 
events  There  may  be  some  choice  in  this  defi¬ 
nition.  Let  us  look  at  two  extreme  examples. 

Example  1.  Assume  there  is  no  relevant  past. 
This  means  that  the  only  past  events  are  trivial:  the 
sure  event  with  probability  one  and  the  impossible 
event  with  probability  zero.  In  this  case  active 
locality  is  merely  the  condition  that  there  is  no 
possibility  of  intervention.  If  this  is  so,  then  it  is 
only  passive  locality  that  fails. 

Example  2.  Assume  that  for  each  pair  of  appara¬ 
tus  settings  z,  j  the  theory  is  deterministic  except 
for  random  initial  conditions  at  the  past  time.  The 
probability  assignment  P^j  makes  every  event 
equivalent  to  a  corresponding  event  in  9q.  Passive 
locality  obviously  holds,  so  active  locality  must 
fail.  How  does  this  happen?  Suppose  an  event  A 
in  9^  is  equivalent  to  Aq  in  9q  with  respect  to  the 
probability  measure  Then  A  may  be  equiva¬ 
lent  to  some  other  event  Aq  with  respect  to  the 
probability  measure  ^ab-  The  change  of  the  appara¬ 


tus  setting  for  the  second  particle  has  changed  the 
relation  between  past  and  future  for  the  first 
particle. 

Quantum  mechanics  has  been  well  tested,  and 
there  is  now  considerable  experimental  support  for 
the  strong  correlations  over  long  distances  pre¬ 
dicted  by  the  theory.  Quantities  accessible  to  ex¬ 
periment  seem  to  be  local  in  the  sense  that  the 
correlations  cannot  be  used  to  make  interventions. 
However,  according  to  Bell's  result,  there  is  no 
collection  of  past  events  that  simultaneously  sat¬ 
isfy  active  and  passive  locality.  The  correlations 
seem  unexplainable  by  a  common  past. 


Measured  Events 

The  traditional  textbook  interpretation  of  quan¬ 
tum  mechanics  is  the  Copenhagen  interpretation. 
In  this  interpretation  each  consistent  set  of  quan¬ 
tum  events  defines  probabilities  having  a  fre¬ 
quency  interpretation.  But  which  consistent  set  of 
events  is  chosen  comes  from  outside  the  theory,  by 
a  choice  of  external  measuring  device. 

The  problem  that  some  people  see  with  the 
Copenhagen  interpretation  is  that  it  gives  no  ac¬ 
count  of  a  reality  that  is  not  subject  to  measure¬ 
ment.  According  to  Heisenberg,  "the  idea  of  an 
objective  real  world  whose  smallest  parts  exist 
objectively  in  the  same  sense  as  stones  and  trees 
exist,  independently  of  whether  we  observe 
them... is  impossible."  (The  quote  is  from  W. 
Heisenberg,  Physics  and  Philosophy,  Harper  and 
Row,  New  York,  1958,  p.  129).  There  must  always 
be  a  measuring  apparatus  external  to  the  quantum 
system.  One  can  perhaps  draw  the  line  between 
quantum  system  and  measuring  apparatus  at  vari¬ 
ous  levels,  but  there  is  never  a  theory  of  what 
happens  in  a  system  that  is  not  undergoing 
measurement. 

Furthermore,  it  is  not  evident  what  counts  as  a 
suitable  external  measurement.  Clearly  a  person 
with  a  Ph.D.  in  physics  and  a  research  grant  is 
allowed  to  perform  a  measurement.  Maybe  the 
measurement  can  be  conducted  automatically  with 
the  experimental  apparatus  purchased  on  the  grant. 
But  how  about  the  world  before  grants.  Can  a 
dinosaur  perform  a  measurement?  And  how  about 
the  world  on  a  grander  scale?  Does  the  universe 
have  a  quantum  description? 

The  Copenhagen  interpretation  does  not  evade 
the  locality  issues.  It  is  trivially  true  that  different 
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probability  measures  correspond  to  different 
choices  of  measurement.  The  event  (A  and  not  B') 
is  not  compatible  with  the  event  (B  and  not  C). 
Thus  pjA  and  not  B']  is  computed  with  a  differ¬ 
ent  probability  measure  from  that  used  for  B 
and  not  C].  However,  the  coincidences  are  still 
present,  and  according  to  BelTs  second  theorem 
they  have  no  explanation  in  a  common  past. 


Macroscopic  Events 

Perhaps  there  should  be  an  interpretation  that 
does  not  depend  on  the  notion  of  measurement. 
There  are  a  number  of  more  recent  proposals,  due 
to  various  authors,  that  I  will  put  in  one  category. 
The  Gell-Mann  and  Hartle  version  is  perhaps  rep¬ 
resentative  [3].  They  key  concepts  are  consistent 
history,  macroscopic  event,  and  decoherence. 

In  Bell's  theorem  the  original  conjunctions  of 
quantum  events  are  between  compatible  events. 
So  any  peculiar  features  of  quantum  mechanics  are 
already  present  in  this  case.  Nevertheless,  some 
authors  have  proposed  that  problems  of  interpre¬ 
tation  may  be  clarified  by  defining  the  probabili¬ 
ties  of  conjunctions  of  certain  incompatible  events. 
Griffiths  was  one  of  the  pioneers  in  the  direction 
[4].  Here  is  a  brief  description  of  how  this  works. 

We  now  describe  an  operation  known  as  reduc¬ 
tion  or  collapse  of  the  state.  Consider  a  fixed  state, 
given  by  a  one-dimensional  subspace  of  the  Hilbert 
space.  Let  B  be  an  event  with  nonzero  probability 
P[B].  The  collapsed  state  is  the  projection  of  the 
one-dimensional  subspace  onto  the  subspace  cor¬ 
responding  to  B.  It  is  a  new  quantum  state.  Let  us 
denote  the  probability  of  some  other  event  A 
computed  with  respect  to  the  collapsed  state  by 
P[  A|  B].  We  might  want  to  call  this  the  conditional 
probability  of  A  given  B.  Sometimes  we  may 
think  of  B  as  being  defined  before  A,  so  that  this 
is  a  probability  computed  in  the  course  of  a 
history. 

Given  the  conditional  probability,  we  might  try 
to  define  the  probability  of  the  history  by  the  usual 
formula 

P[  Aand  B]  =P[A|B]P[B]. 

It  is  important  to  realize  that  the  probability  of  the 
conjunction  is  thought  of  as  being  computed  with 
respect  to  the  original  state.  The  collapse  is  merely 
a  stage  in  the  mathematics. 

When  the  events  are  not  compatible,  this  defini¬ 
tion  can  lead  to  paradoxes.  Consider  A  and  B  to 


be  one-dimensional  subspaces  at  an  angle  of  7r/4. 
Take  the  state  to  be  perpendicular  to  A  but  at  an 
angle  7r/4  from  B.  Then  P[  B]  =  P[  A|B]  =  f  so 
P[  A  and  B]  =  Yet  P[  A]  =  0. 

All  that  remains  of  the  additivity  condition  is 
the  equation 

P[A]  =  P[A  and  B]  +  P[A  andnot  B]  +  I(A,  B). 

The  interference  term  satisfies  -^</(A,  B)<^ 
and  measures  the  failure  of  additivity. 

Let  A  and  B  be  arbitrary  quantum  events.  We 
say  that  B  followed  by  A  form  a  consistent  history 
with  respect  to  the  given  quantum  state  if 

P[  A]  =  P[  A  and  B]  +  P[  A  andnot  B]. 

This  says  that  for  this  particular  state  the  interfer¬ 
ence  term  is  zero. 

The  idea  now  is  that  this  should  apply  to  a 
certain  class  of  macroscopic  events.  For  this  class, 
the  decoherence  effect  should  make  the  interference 
terms  vanishingly  small.  Decoherence  is  a  prop¬ 
erty  of  large  systems  that  says  that  it  is  impossible 
to  distinguish  quantum  mechanics  from  ordinary 
probability  without  examining  in  detail  every  par¬ 
ticle  in  the  system.  One  of  the  early  formulations 
of  decoherence  is  by  Hepp  and  Lieb  [5];  see  [6]  for 
extensions.  There  is  survey  of  the  subject  and  its 
history  in  the  article  by  Zurek  [7].  The  book  of 
Omnes  [8]  has  an  extensive  discussion.  The  deco¬ 
herence  effect  is  not  automatic;  it  is  supposed  to  be 
a  consequence  of  the  dynamics  of  the  particular 
system. 

To  see  how  this  might  work,  consider  a  system 
with  two  rather  separate  parts.  One  part  is  associ¬ 
ated  with  collective  events,  typically  macroscopic 
events  defined  by  the  values  of  collective  coordi¬ 
nates  (center  of  mass  and  so  on).  The  other  part  is 
an  environment,  presumably  part  of  the  atomic 
world.  This  part  is  associated  with  environmental 
events.  It  is  assumed  that  the  collective  events  are 
compatible  with  the  environment  events. 

We  consider  a  situation  where  the  two  parts 
have  interacted  in  such  a  way  that  the  environ¬ 
mental  part  has  become  correlated  with  the  collec¬ 
tive  part.  We  say  that  the  state  makes  a  collective 
event  B  decoherent  if  B  is  equivalent  to  some 
environmental  event  £  with  respect  to  the  state. 
The  environment  thus  contains  a  record  of  the 
event  B,  in  that  when  B  happens  or  does  not 
happen,  the  corresponding  £  happens  or  does  not 
happen. 
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DECOHERENCE  THEOREM 

Let  A  and  B  be  collective  events.  Suppose  the 
state  makes  B  decoherent.  Then  the  history  B 
followed  by  A  is  consistent  with  respect  to  the 
state. 

Proof.  Let  E  be  the  environmental  event  that 
makes  B  decoherent  in  the  given  state.  Since  A 
and  E  are  compatible,  we  have 

P[  A]  =  P[A  and  E]  -\r  P[  A  and  not  E]. 

Since  B  and  E  are  equivalent  in  the  state,  the 
collapsed  state  with  respect  to  B  is  the  same  as  the 
collapsed  state  with  respect  to  £.  Not  only  P[  B]  = 
P[E],  but  also  P[A\B]  =  P[  A|E].  Hence  we  can 
replace  £  by  B  (and  similarly  not  E  by  not  B)  and 
obtain 

P[A]  =P[AandB]  +  P[  A  and  not  B]. 

The  prediction  of  such  a  theory  consists  of  prob¬ 
abilities  of  sets  of  histories  of  macroscopic  objects. 
The  histories  are  part  of  the  world  independent  of 
anyone's  choice  of  what  to  measure.  The  outcome 
in  a  particular  situation  is  the  history  that  actually 
occurs. 

The  construction  of  such  a  theory  in  full  detail 
is  a  formidable  task.  The  purpose  of  the  theory  is 
to  make  frequency  predictions  about  sets  of  consis¬ 
tent  histories,  independent  of  any  notion  of  mea¬ 
surement.  Therefore  it  must  specify  which  particu¬ 
lar  collection  of  consistent  histories  is  relevant. 
Gell-Mann  and  Hartle  [3]  have  given  criteria  that 
such  a  collection  must  satisfy.  However,  questions 
of  the  existence  and  of  the  uniqueness  of  such  a 
collection  are  difficult. 

The  puzzling  aspect  of  such  an  interpretation  is 
that  it  gives  little  role  to  the  subatomic  world 
except  as  a  way  of  making  predictions  about  the 
world  on  a  large  scale.  If  there  were  a  universe  of 
only  a  few  particles,  in  which  decoherence  effects 
did  not  play  a  role,  then  such  an  interpretation 
would  be  vacuous. 

Note  also  that  any  such  interpretation  must 
confront  the  implications  of  Bell's  theorems.  By 
Bell's  first  theorem,  there  will  not  be  a  macroscopic 
event  in  the  set  of  consistent  histories  correspond¬ 
ing  to  each  quantum  event.  There  may  be  macro¬ 
scopic  events  corresponding  to  certain  atomic  level 
quantum  events,  those  for  which  a  suitable  cou¬ 
pling  to  the  macroscopic  world  exists.  An  example 
of  this  might  be  the  pointer  reading  corresponding 
to  the  deflection  of  a  particle  by  a  magnetic  field. 


Perhaps  there  is  no  possibility  of  intervention  over 
long  distances.  However,  according  to  Bell's  sec¬ 
ond  theorem,  the  interpretation  must  admit  the 
violation  of  the  combination  of  active  and  passive 
locality. 


Position  Events 

Another  class  of  interpretation  describes  the 
world  on  the  atomic  scale,  but  only  to  events 
defined  in  terms  of  the  position  of  particles.  One 
can  look  at  the  positions  over  time,  so  one  can 
assign  probabilities  to  sets  of  particle  trajectories. 
The  outcome  in  a  given  situation  is  a  particular 
particle  trajectory. 

Two  variants  are  Bohm's  theory  (Bohmian  me¬ 
chanics)  and  Nelson's  theory  (stochastic  mechan¬ 
ics).  The  literature  on  Bohm's  theory  can  be  traced 
back  from  a  recent  collection  [9],  and  work  is 
ongoing  [10,  11].  Nelson's  theory  is  explained  in 
his  books  [12,  13].  The  fact  that  there  are  two 
competing  theories,  giving  different  predictions  for 
the  particle  trajectories,  is  a  weakness.  Perhaps 
there  is  some  comfort  in  the  fact  that  the  fixed  time 
position  probabilities  are  the  same  in  both  theories 
and  agree  with  those  of  quantum  mechanics.  And 
this  may  be  all  that  is  possible  to  measure  in 
practice. 

These  theories  discard  the  usual  elaborate  su¬ 
perstructure  of  quantum  mechanics,  in  which  there 
are  events  of  all  sort.  Probability  is  only  assigned 
to  position  events  (at  various  times).  It  is  surpris¬ 
ing  how  little  this  seems  to  matter.  It  is  wise  to 
consider  these  theories  before  making  general  dec¬ 
larations  on  the  interpretation  of  quantum  me¬ 
chanics,  since  often  they  provide  a  counterexam¬ 
ple. 

The  Bell  theorems  again  apply  to  these  theories. 
Spin  is  determined  by  introducing  certain  mag¬ 
netic  fields,  which  deflect  the  particles,  eventually 
resulting  in  a  change  in  position.  The  introduction 
of  the  magnetic  fields  does  change  the  probabili¬ 
ties,  as  envisaged  in  Bell's  second  theorem  [14]. 
The  main  defect  of  these  theories  is  that  with  the 
natural  definition  of  past  events  they  violate  active 
locality,  at  least  on  the  level  of  the  trajectories. 
However,  if  one  looks  only  at  the  final  positions  of 
the  particles,  the  predictions  agree  with  quantum 
mechanics.  So  again  there  is  no  practical  possibility 
of  active  intervention  for  such  events,  and  the 
situation  is  as  for  the  Copenhagen  interpretation. 
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I  find  stochastic  mechanics  attractive,  so  I  will 
give  a  brief  account.  I  will  begin  with  the  descrip¬ 
tion  of  the  time-reversible  stationary  diffusion  pro¬ 
cess  associated  with  the  quantum  mechanical 
ground  state.  Then  I  will  give  some  indication  of 
the  diffusion  process  associated  with  a  quantum 
state  that  evolves  in  time.  In  both  cases  the  out¬ 
come  of  the  probability  experiment  is  a  particular 
particle  trajectory. 

In  ordinary  quantum  mechanics  the  position 
events  at  fixed  time  are  compatible,  but  position 
events  at  different  time  are  usually  not  compatible. 
With  the  diffusion  process  we  can  define  the  con¬ 
junction  of  position  events  at  different  times.  Of 
course,  this  is  a  deviation  from  the  usual  interpre¬ 
tation  of  quantum  mechanics. 

Consider  the  Hilbert  space  of  square  integrable 
wave  functions  and  the  Schrddinger  operator 

1  h} 

H=  +  y(x). 

2  m  ^ 

Assume  that  H  has  a  lowest  energy  eigenfunction 
il/>  0  with 

Hip  =  Eip. 

A  position  event  at  fixed  time  is  defined  by  a 
subset  of  position  space.  If  A  is  such  a  subset,  then 
there  is  a  corresponding  indicator  function.  This 
function  is  1  on  the  set  A  and  0  on  its  comple¬ 
ment.  We  also  denote  the  indicator  function  by  A. 
Multiplication  by  the  indicator  function  is  the  pro¬ 
jection  onto  the  corresponding  closed  subspace  of 
Hilbert  space. 

We  let  A(s)  be  the  event  that  the  trajectory  of  a 
particle  is  in  the  set  A  at  time  s.  Let  A^  be  the 
operator  exp[-s(H  -  E)/fi)Aexp(s(H  -  E)/fi)]. 

Now  consider  two  such  position  events,  A  and 
B  and  corresponding  times  s  <  t.  Then  the  proba¬ 
bility  of  the  particle  being  A  at  time  s  and  in  B  at 
time  t  is  given  by 

P[  A(s)  and  B(t)]  =  (ip,  A^B^ip) 

=  {ip, 

Note  that  this  is  a  well-defined  probability  for  the 
position  of  the  particle  at  two  different  times.  For 
t  =  s  the  particles  have  not  moved,  so  this  is  just 
the  probability  that  the  particle  is  in  the  intersec¬ 
tion  of  the  two  sets.  However,  as  f  —  s  becomes 
infinite,  the  probability  approaches  {ip,  Aip){ip, 
Bip).  This  says  that  the  two  particle  positions  be¬ 
come  independent. 


The  probability  calculation  may  be  reproduced 
in  the  language  of  diffusion  theory.  Define  the 
backward  diffusion  operator  L  and  forward  diffusion 
operator  L*  by 

1 

L  =  --(H  -  E)/hiP 

and 

1 

L*  =  -ip(H  -  E)/h~. 

Note  that  LI  =  0  and  L^p  —  0,  where  p  =  ip^. 

The  explicit  form  of  these  operators  is  as  fol¬ 
lows.  Define  the  diffusion  coefficient  cr^  =  h/m. 
Define  the  drift  vector  field  u  to  be 

u  =  a  - . 

Then  the  backward  diffusion  operator  is 

The  backward  diffusion  operator  gives  the  ex¬ 
pected  values  at  future  times  as  a  function  of  the 
initial  point.  Thus  e^^f(x)  is  the  expected  value  of 
/  of  the  particle  position  at  time  t,  when  the 
particle  is  started  at  point  x.  The  forward  diffusion 
operator  is 

L*  =  -  V^*(w  ). 

The  forward  diffusion  operator  describes  how 
probability  evolves.  Thus,  if  g  is  the  probability 
density  at  time  zero,  then  is  the  probability 
density  at  time  t. 

The  quantum  mechanical  operator  ~(H  —  E)/h 
and  the  diffusion  operators  L  and  L*  all  have  the 
same  spectrum,  since  they  are  similar.  So  this  is  a 
perfect  correspondence  between  quantum  mechan¬ 
ics  and  diffusion  theory. 

We  can  now  rewrite  the  probability  that  a  dif¬ 
fusing  particle  is  in  A  at  time  s  and  in  B  at  time  t. 
This  is 

P[  A(s)  and  Bit)]  =  {  p,  Ae^^~^^^B) 

The  backward  operator  maps  events  from  future 
to  present,  while  the  forward  operator  maps  prob¬ 
abilities  from  present  to  future. 
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We  may  write  the  forward  diffusion  equation 
explicitly  as  a  partial  differential  equation 

dp  1  ,  , 

_  =  L*p  =  -o-^  •  (up). 

This  describes  how  probability  evolves  when  a 
particle  is  diffusing  randomly  with  diffusion  con¬ 
stant  and  drift  u.  The  drift  vector  field  u  is 
space-dependent.  If  we  start  the  particle  at  a  fixed 
point  Xq,  then  the  solution  is  a  fimction  p(xQ,  x,  t) 
that  describes  the  probability  density  after  time  t. 

If  we  start  the  particle  at  a  random  position 
with  density  p,  then  the  probability  density  at  a 
later  time  is  fp(xQ)p(xQ,  x,  t)  dxQ.  The  condition 
for  a  stationary  density  is 

Jp(xQ)p(xg,  X,  t)  dxo  =  p(x). 

This  is  equivalent  to 

0  =  L*p=icrW2p-V,-(iip). 

As  we  have  seen,  the  solution  is  p  ^  if/^.  The 
equilibrium  for  the  diffusion  is  the  quantum  me¬ 
chanical  density.  In  fact,  the  equilibrium  is  main¬ 
tained  by  the  mechanism  of  detailed  balance,  which 
says  that  the  equilibrium  is  due  to  a  local  balance 
between  the  density  gradient  and  the  drift.  This  is 
expressed  mathematically  as 

0  =  |cr^  ^xP  ~ 

This  leads  to  the  following  physical  picture  for  a 
particle  in  a  quantum  mechanical  eigenstate.  The 
particle  diffuses  randomly,  but  the  drift  is  such 
that  it  compensates  for  the  diffusion  and  leads  to 
an  equilibrium.  This  probability  for  the  position  of 
the  particle  in  this  equilibrium  agrees  at  every 
time  with  the  prediction  of  quantum  mechanics. 
However,  the  diffusion  model  provides  a  picture 
of  the  random  trajectory  of  the  diffusing  particle. 
Such  a  picture  is  forbidden  in  the  ordinary  inter¬ 
pretation  of  quantum  mechanics. 

In  the  general  case  one  gets  a  diffusion  process, 
but  it  is  no  longer  homogeneous  in  time.  Consider 
a  solution  ip  of  the  time-dependent  Schrodinger 
equation.  Such  a  solution  is  in  general  complex. 
Define  two  vector  fields,  a  drift  u  and  a  velocity, 
V,  by 

2^^  . 

CT  -  ^  U  -\-  IV. 


These  are  both  drift  terms  in  the  diffusion  equa¬ 
tion.  Sometimes  u  is  called  an  osmotic  velocity 
and  V  a  current  velocity.  In  any  case,  the  corre¬ 
sponding  diffusion  process  is  given  by  the  forward 
equation 

^  =  -0-2  v^p  -  V^-  ((m  +  v)p) 

with  diffusion  coefficient  cr  ^  and  drift  u  +  v.  The 
coefficients  are  now  both  space  and  time  depen¬ 
dent.  Consider  the  solution  where  the  particle  is 
started  at  point  Xq  at  time  Iq.  The  solution 
P(xq,  X,  tQ,  t)  describes  the  probability  density  for 
diffusion  to  x  at  time  t. 

If  we  start  the  solution  at  time  with  probabil¬ 
ity  density  p(xq,  Iq)  then  the  probability  at  a  later 
time  is 

p(x,t)  =  jp(Xf,,tQ)p(xo,  xJqJ)  dxQ. 

We  can  find  such  density  explicitly  by  taking  p  = 
lipf',  the  quantum  mechanical  prediction.  Then  it 
turns  out  that  there  are  separate  equations  involv¬ 
ing  u  and  V.  The  equation  involving  u  is  just  the 
detailed  balance  equilibrium  we  had  before,  but 
with  time-dependent  coefficient: 

0  =  P  ~ 

The  equation  involving  v  is  the  transport  equa¬ 
tion: 

dp 

Together  these  equations  imply  that  p  satisfies  the 
forward  diffusion  equation  with  diffusion  coeffi¬ 
cient  cr  ^  and  total  drift  u  -\-  v. 

The  fact  that  the  diffusion  equation  involves 
two  vector  fields  u  and  v  at  first  seems  difficult  to 
motivate.  However,  u  does  not  change  under  time 
reversal,  while  v  changes  sign  under  time  reversal. 
Thus  the  role  of  the  drift  u  is  to  attempt  to  estab¬ 
lish  equilibrium,  while  the  role  of  the  velocity  v  is 
to  effect  overall  movement. 

Bohm's  theory  differs  from  stochastic  mechanics 
in  that  the  motion  is  deterministic,  except  for  the 
random  initial  conditions.  In  Bohm's  theory  the 
motion  is  given  by  the  velocity  field  v  alone. 
Clearly  there  is  a  sense  in  which  Bohm's  theory  is 
much  simpler.  However,  stochastic  mechanics  has 
the  attractive  feature  that  it  gives  a  natural  role  to 
the  diffusion  constant  cr^  =  fi/m  that  is  derived 
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from  Planck's  constant  and  the  mass.  It  is  amusing 
to  note  that  when  m  is  the  mass  of  an  electron,  the 
diffusion  constant  cr^  has  a  rather  memorable  and 
reasonable  value,  about  one  square  centimeter  per 
second. 


Conclusion 

According  to  the  mathematical  apparatus  of 
quantum  mechanics,  each  quantum  event  has  a 
probability.  The  question  is  whether  these  events 
correspond  to  sets  of  outcomes,  so  that  the  proba¬ 
bility  has  a  frequency  interpretation.  Here  is  the 
situation  in  the  three  interpretations,  (i)  In  the 
Copenhagen  interpretation  each  compatible  collec¬ 
tion  of  quantum  events  has  a  corresponding  set  of 
outcomes.  However,  the  possible  outcomes  are  not 
intrinsic  to  the  system  but  depend  on  a  choice  of 
external  measurement,  (ii)  In  interpretations  where 
events  consist  of  sets  of  consistent  histories  subject 
to  a  decoherence  condition,  the  outcomes  are 
macroscopic  histories.  However,  there  need  be  no 
frequency  interpretation  for  the  probabilities  of 
quantum  events  on  the  atomic  scale,  (iii)  In  the 
interpretations  in  which  the  outcomes  are  particle 
trajectories,  the  only  relevant  probabilities  are  those 
corresponding  to  position  events  at  various  times. 
So  the  probabilities  of  other  quantum  events  either 
have  no  frequency  interpretation  or  their  interpre¬ 
tation  is  ultimately  expressed  by  position  events. 

The  subspaces  of  Hilbert  space  describes  a  huge 
collection  of  events.  Any  interpretation  of  quan¬ 
tum  mechanics  must  provide  an  additional  crite¬ 
rion  to  select  a  relatively  small  collection  of  events 
for  which  the  probabilities  have  a  frequency  inter¬ 


pretation.  In  the  Copenhagen  interpretation  this 
is  given  by  choice  of  a  measurement  and  a  cor¬ 
responding  set  of  outcomes.  Unfortunately,  this 
choice  is  external  to  the  theory.  On  the  other  hand, 
the  macroscopic  event  and  position  event  interpre¬ 
tations  have  a  single  set  of  outcomes  and  assign 
probabilities  to  at  least  some  quantum  events.  They 
point  the  way  to  a  possible  understanding  of  quan¬ 
tum  mechanics  as  a  description  of  an  isolated 
system.  Perhaps  that  is  enough. 
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ABSTRACT:  Recent  discoveries  show  that  phenomena  in  highly  excited  states  of 
atoms  and  molecules  are  qualitatively  different  from  the  standard  ''planetary"  model  of 
the  atom  and  the  near-rigid  model  of  molecules,  characteristic  of  these  systems  in  their 
low-energy  states.  Highly  excited  states  of  two-electron  atoms  show  "molecular"  behavior 
of  a  nonrigid  linear  structure  undergoing  collective  rotation  and  vibration.  Highly  excited 
states  of  molecules  display  strong  effects  of  normal  mode  anharmonicity  and  coupling 
that  induce  bifurcations  on  the  route  to  molecular  chaos  and  isomerization.  Hints  of  a 
unified  view  for  both  systems  are  emerging  in  terms  of  approximate  dynamical  symmetry 
principles  and  methods  of  nonlinear  classical  mechanics  such  as  bifurcation  theory. 
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Introduction 

A  surge  of  new  knowledge,  experimental  and 
theoretical,  of  highly  excited  quantum  states 
of  few-body  systems  is  leading  to  a  new  view  of 
the  motion  of  electrons  in  atoms  and  of  atoms  in 
molecules.  We  are  accustomed  to  thinking  of  atoms 
in  their  ground  states  in  terms  of  the  "planetary'" 
model  of  independent  particles,  described  by  an 
electronic  configuration,  e.g.,  the  configuration 
of  the  ground  state  of  helium.  In  contrast  to  this 
nearly  uncorrelated,  nonrigid  structure  of  an  atom 
in  its  ground  state,  the  ground-  and  low-energy 
rotation-vibration  states  of  molecules  are  de- 
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scribed  in  terms  of  the  normal  mode  picture  of 
small  amplitude,  nearly  harmonic  motion  about  a 
rigid  equilibrium  structure.  However,  efforts  of 
many  workers  in  recent  years  show  that  in  highly 
excited  states  each  system  at  high  energy  shows 
some  resemblance  to  the  behavior  that  describes 
the  other  at  low  energy.  In  highly  excited  states, 
the  motion  of  electrons  in  atoms  takes  on  charac¬ 
teristics  similar  to  molecular  rotation-vibration 
modes,  while  the  motion  of  atoms  in  highly  ex¬ 
cited  vibrational  states,  especially  in  systems  un¬ 
dergoing  isomerization,  shows  drastic  departures 
from  the  low-energy  normal  mode  picture.  More¬ 
over,  the  tools  used  to  describe  high-energy  behav¬ 
ior,  such  as  effective  Hamiltonians,  are  beginning 
to  give  signs  of  a  unified  formal  description  of  the 
two  systems. 

My  purpose  here  was  to  highlight  some  of  the 
most  important  of  these  developments.  I  give  ref¬ 
erence  to  some  of  the  key  original  sources,  in 

CCC  0020-7608  /  97  /  050399-1 1 


KELLMAN 


which  can  be  found  many  other  important  refer¬ 
ences  that  must  be  left  out  here.  Elsewhere  [1,  2],  I 
have  presented  distinct  points  of  view  regarding 
some  aspects  of  the  work  discussed  here. 


The  ''Molecular  Atom:’’  Rotations, 
Vibrations,  and  Supermultiplets 

The  electrons  in  atoms  and  molecules  do  not 
have  fixed  positions  relative  to  the  atomic  nuclei, 
and  it  is  not  evident  that  they  have  a  "structure" 
in  any  geometrical  sense.  In  the  classical  model  of 
the  hydrogen  atom,  the  electron  orbits  the  nucleus 
in  an  ellipse.  In  a  multielectron  atom,  the  Coulomb 
repulsion  between  electrons  substantially  perturbs 
the  elliptical  orbits.  This  is  the  problem  of  electron 
correlation.  Two-electron  atoms  continue  to  elicit 
intense  interest  as  the  simplest  prototype  of  the 
correlation  problem.  Conventionally,  one  describes 
an  excited  state  of  the  atom  in  terms  of  a  configu¬ 
ration.  For  example,  a  state  with  both  electrons  in 
a  2p  orbital  has  a  configuration  2 This  indepen¬ 
dent  particle  configuration  description  is  the  quan¬ 
tum  mechanical  analog  of  the  planetary  model.  By 
the  late  1970s,  accumulating  evidence  was  show¬ 
ing  that  the  configuration  description  is  severely 
inadequate  for  highly  excited  states.  Nonetheless, 
it  came  as  something  of  a  shock  [3-6]  to  discover 
that  the  excited  atom  had  many  properties  in  com¬ 
mon  with  the  motion  of  a  triatomic  molecule. 
There  is  now  compelling  evidence  that  in  states 
with  both  electrons  excited  from  the  ground  level 
the  atom  behaves  according  to  a  "molecular" 
model  of  collective  rotations  and  vibrations,  simi¬ 
lar  to  a  highly  nonrigid  linear  XYX  molecule,  with 
the  electrons  correlated  on  opposite  sides  of  the 
nucleus. 

I  will  outline  how  this  picture  has  developed, 
emphasizing  the  physical  ideas,  but  giving  high¬ 
lights  of  the  formal  development.  The  interested 
reader  should  consult  the  references  for  the  many 
technical  details. 

APPROXIMATE  DTOAMICAL  SYMMETRY  OF 

THE  TWO-ELECTROX  ATOM 

Classically,  the  electron  in  a  one-electron  atom 
with  Coulomb  potential  follows  an  elliptical  (Kep¬ 
ler  or  planetary)  orbit,  in  which  the  angular  mo¬ 
mentum  L  is  an  exact  constant  of  motion.  The 
three  components  of  the  angular  momentum  vec¬ 


tor  L  generate  the  group  of  rotations  in  three- 
dimensional  space,  denoted  0(3). 

The  elliptical  Kepler  orbit  has  far  more  symme¬ 
try  than  demanded  merely  by  rotational  invari¬ 
ance,  however,  a  planetary  orbit  that  conserves 
angular  momentum  L  could  still  have  a  precessing 
perihelion.  The  Kepler  orbit  has  a  constant  of  the 
motion  [7]  in  addition  to  the  angular  momentum, 
the  Runge-Lenz  vector  A.  This  vector  points  along 
the  semimajor  axis  of  the  ellipse.  It  is  defined  in 
terms  of  the  instantaneous  position  and  momen¬ 
tum  at  each  point  in  the  orbit,  but  as  a  constant  of 
motion,  it  is  invariant  in  size  and  direction.  The 
vector  A  together  with  the  angular  momentum  L 
generates  the  group  0(4)  of  rotations  in  an  abstract 
four-dimensional  space,  which  contains  the  ordi¬ 
nary  angular  momentum  group  0(3)  as  a  sub- 
group. 

In  the  two-electron  atom,  because  the  system  is 
invariant  under  rotations  in  space,  the  total  angu¬ 
lar  momentum 

L  =  Li  +  L2  (1) 

is  an  exact  constant  of  motion  that  generates  a 
group  0(3)i2-  By  analogy  with  the  one-electron 
atom,  one  might  think  to  combine  L  with  the  total 
Runge-Lenz  vector 

A  =  Ai  +  A2  (2) 

to  form  a  two-electron  0(4)i2  group.  If  the  two- 
electron  system  behaved  like  a  "planetary"  atom, 
each  electron  would  move  in  an  independent  ellip¬ 
tical  orbit,  with  A  ^  A^  -h  A2  a  conserved  quan¬ 
tity.  However,  this  is  not  the  case,  because  A  is 
badly  broken  as  a  constant  of  motion  by  the  strong 
Coulomb  repulsion  between  electrons. 

Even  though  this  ©(4)^2  exact 

symmetry,  it  is  reasonable  to  try  to  use  it  as  an 
approximate  symmetry.  One  might  hope  this  would 
be  of  use  in  ordering  the  spectrum,  but  in  a  less 
complete  way  than  if  O  (4)^2  were  exact.  This  would 
be  very  much  in  the  spirit  of  the  use  of  approxi¬ 
mate  symmetries  earlier  in  nuclear  and  elementary 
particle  physics,  where  these  kinds  of  ideas  led  to 
developments  such  as  the  quark  model.  Unfortu¬ 
nately,  when  this  was  tried  for  the  two-electron 
atom,  it  was  found  [8]  that  0(A)^2  was  not  of  much 
use  even  in  this  limited  sense  of  an  approximate 
symmetry. 

A  key  advance  came  when  Wulfman  [9]  and, 
independently,  Herrick  and  Sinanoglu  [10,  11] 


400 


VOL  65,  NO.  5 


NONRIGID  SYSTEMS  IN  CHEMISTRY 


tried,  instead  A  of  (2),  the  operator 

B  -  Ai  -  A2,  (3) 

where  the  crucial  difference  is  the  minus  sign,  A 
heuristic  physical  picture  of  the  electron  correla¬ 
tion  is  shown  in  Figure  1.  The  idea  is  that  overall 
maximizing  the  difference  between  the  Runge-Lenz 
vectors  A^  and  A  2  minimizes  the  Coulomb  repul¬ 
sion  between  the  electrons.  It  was  found  that  B  is 
an  approximate  constant  of  motion,  unlike  the 
experience  with  A,  Furthermore,  it  is  easily  shown 
from  the  commutation  relations  [7]  that  L  and  B 
together  generate  a  distinct  O  (4)  group.  I  will  call 
this  approximate  symmetry  group  0(4)i2  to  dis¬ 
tinguish  it  from  the  unsuccessful  O  (4)^2  of  Eqs,  (1) 
and  (2). 

ROTATIO^IS  AND  VIBRATIONS  OF  THE 

“MOLECULAR’’  ATOM 

Of  course,  the  heuristic  picture  in  Figure  1  of 
electrons  moving  in  stable  ellipses  is  far  from  a 
literal  image  of  either  the  classical  or  quantum 
mechanics  of  the  atom.  A  more  realistic  depiction 
of  a  quantum  state  of  the  atom  with  both  electrons 
in  the  same  shell  is  given  in  Figure  2.  The  atomic 
structure  is  like  that  of  an  elongated  cloud  or  blob 
of  charge,  with  the  density  of  the  two  electrons 
concentrated  at  the  ends.  This  can  be  rationalized 
by  saying  that  the  repulsion  of  the  electrons  tends 
to  correlate  them  on  opposite  sides  of  the  nucleus. 
At  the  same  time,  a  residual  effect  of  the  quantum 
mechanical  configuration  (with  both  electrons  in 
the  same  shell  in  Fig.  2)  is  to  concentrate  the 
electrons  very  loosely  at  a  most  favored  distance 
from  the  nucleus.  This  structure  of  the  electron 
cloud  surrounding  the  nucleus  is  that  of  a  highly 
nonrigid  system.  Nonetheless,  the  correlation  gives 


FIGURE  1.  Heuristic  picture  of  electron  correlation  in 
two-electron  atonn  with  both  electrons  in  the  same  shell. 
Because  of  the  electronic  repulsion,  thejnosHavorable 
configuration  maximizes  the  difference  B  =  -  A2  of 

Runge-Lenz  vectors. 


FIGURE  2.  A  depiction  of  the  two-electron  atom  with 
both  electrons  in  the  same  shell  (which  may  be  an 
excited  shell).  The  atom  has  the  structure  of  a  “blob”  of 
charge.  The  electrons  are  concentrated  on  opposite 
sides  of  the  nucleus,  because  of  the  electronic  repulsion, 
and  are  concentrated  at  some  most  probable  distance, 
because  of  the  shell  structure.  The  overall  structure  has 
the  symmetry  of  a  linear,  symmetric  XYX  molecule,  with 
X’s  the  electrons  and  Y  the  nucleus. 

the  atom  a  distinct  shape — in  fact,  a  shape  with 
the  symmetry  of  the  molecular  point  group 

If  the  atom  has  a  structure  with  a  symmetrical 
shape,  it  is  natural  to  wonder  if  this  structure  can 
exhibit  collective  modes,  e.g.,  rotations  and  vibra¬ 
tions.  The  beginning  of  the  "molecular"  model  of 
the  atom  came  when  Kellman  and  Herrick  [3]  used 
the  0(4)12  group  structure  to  classify  the  atomic 
levels  into  "rotor  series,"  i.e.,  O  (4)i2  multiplets  or 
sets  of  states  that  behaved  like  a  nonrigid  rotor. 
Next  [4-6],  the  0(4)i2  multiplets  were  combined 
into  "supermultiplets."  The  collection  of  the  O  (4)12 
rotor  series  into  the  more  comprehensive  super- 
multiplet  classification  describes  bending  vibra¬ 
tions  of  the  "molecular"  atom.  (The  role  of  radial 
excitations  in  this  picture,  which  came  to  be  better 
understood  later,  is  discussed  below.)  Figure  3 
depicts  the  rotations  and  bending  vibrations  of  the 
electron  cloud  in  the  "molecular"  atom.  The  su¬ 
permultiplets  give  the  classification  of  "intrashell" 
states  of  the  atom  with  both  electrons  in  the  n  =  2, 
3,  4,  and  5  shells  and  also  "intershell"  states  with 
the  electrons  in  different  shells.  The  supermulti¬ 
plets  for  both  electrons  in  n  =  3  are  shown  in 
Figure  4,  indicating  the  role  in  the  classification  of 
rotational  excitations,  bending  vibrations  with  vi¬ 
brational  angular  momentum  along  the  "molecu¬ 
lar"  symmetry  axis,  and  bending  vibrations  with 
no  angular  momentum.  Clearly,  this  classification 
reveals  a  great  deal  of  order  in  a  spectrum  which 
does  not  at  first  show  much  order  at  all.  Not  only 
is  the  spectral  pattern  of  Figure  4  similar  to  that  of 
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FIGURE  3.  Collective  rotation-vibration  modes  of  the 
“molecular”  atom.  The  structure  has  degenerate  bending 
vibrations  and  overall  rotations,  like  a  linear  triatomic 
molecule. 

a  linear,  nonrigid  triatomic  molecule,  but  the  pat¬ 
tern  of  terms  matches  that  of  a  symmetric 

triatomic  molecule  with  rotor-vibrator  states  obey¬ 
ing  the  constraints  imposed  by  Fermi  statistics  of 


the  outer  particles,  i.e.,  the  spin-orbital-parity  clas¬ 
sifications  of  the  terms  derived  from  atomic  shells 
and  the  symmetry  restrictions  of  the  XYX  molecule 
dovetail  perfectly  [6]. 

There  have  been  significant  developments  since 
the  work  of  Herrick  and  Kellman.  Some  highlights: 
I  showed  [12]  that  the  molecular  picture  could  be 
applied  to  atoms  with  two  electrons  outside  a 
closed  shell,  i.e.,  alkaline  earth  atoms.  Berry  and 
co-workers  [13,  14]  analyzed  the  atomic  wave 
functions  in  terms  of  a  basis  with  bending  vibra¬ 
tion  and  stretching  modes.  In  the  original  model  of 
Kellman  and  Herrick,  the  molecule  was  taken  to 
have  rotations  and  bending  vibrations,  but  not 
stretching  excitations,  inasmuch  as  the  radial  de¬ 
grees  of  freedom  seemed  quite  reasonably  to  be 
encompassed  in  the  n  =  2, 3, . . .  shell  structure. 
Ezra  et  al.  [15]  developed  a  semiclassical  treatment 
of  the  radial  motion  of  a  collinear  atom  and  found 
evidence  for  radial  motions  that  can  be  described 
as  an  antisymmetric  stretch.  Rost  et  al,  [16,  17] 
gave  a  unified  "molecular  orbital"  treatment  of 
the  radial,  bending,  and  rotational  motion.  They 
found  an  intricate  interweaving  of  bend  and  stretch 
excitations.  These  investigations  strongly  support 
the  idea  of  the  atom  as  being  like  a  highly  nonrigid 
triatomic  molecule,  but  now  understood  to  have  a 
systematic  involvement  of  radial  motion  in  the 
classification.  Richter  et  al.  [18]  discovered  the 


FIGURE  4.  Supermultlplet  rotor-vibrator  “molecular”  classification  of  doubly  excited  states  of  He  with  both  electrons 
excited  to  the  n  =  3  shell,  from  [4].  In  the  largest,  d  =  2  supermultlplet,  the  column  labeled  [4, 0]"^  is  a  multiplet  of 
0  (4)',2.  This  multiplet  is  a  “rotor  series”  of  pure  rotational  excitations  with  L  =  0, 1 , 2, 3, 4,  built  on  the  lowest  energy 
state,  which  is  Adjacent  columns  of  the  of  =  2  supermultlplet  are  rotor  series  built  on  excitations  of  the  bending 
vibration,  which  have  /  =  1  unit  of  angular  momentum  along  the  symmetry  axis  of  the  “molecular”  atom.  The  d  =  1 
supermultlplet  is  built  on  an  "“S®  state  with  two  bending  quanta  and  /  =  0  units  of  angular  momentum  along  the 
symmetry  axis.  It  has  its  own  rotor  series  and  /  =  1  excitations.  The  d  =  0  supermultlplet  is  a  lone  ^S®  state  with  four 
bending  quanta  and  /  =  0.  See  [4,  6]  for  more  details. 
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existence  of  ''frozen  planet"  states,  in  which  the 
two  electrons  are  locked  in  a  stable  configuration 
on  the  same  side  of  the  nucleus.  Ostrovsky  and 
Prudov  [19]  found  frozen  planet  states  in  which 
radial  and  bending  modes  of  the  "molecule"  are  in 
an  approximate  2:1  frequency  ratio,  suggestive  of 
molecular  Fermi  resonance!  Finally,  as  described 
later,  I  have  been  investigating  some  connections 
of  the  atomic  supermultiplet  classification  with 
recent  models  of  molecular  vibrations  that  use  Lie 
group  symmetries. 


Order  in  Chaos  in  Highly  Excited 
Molecules? 

We  have  seen  that  the  spectrum  of  a  highly 
excited  two-electron  atom  resembles  that  of  a  lin¬ 
ear  molecular  structure  undergoing  rotation  and 
vibration.  This  can  be  turned  around  to  ask.  What 
happens  to  a  real  molecule  when  it  is  excited  high 
above  the  ground  state?  How  does  one  describe 
the  changes  that  must  occur  from  the  low-energy 
picture  of  small-amplitude  oscillations  about  a 
near-rigid  structure? 

The  spectrum  of  vibrational  excitations  is  tradi¬ 
tionally  described  in  terms  of  normal  modes  [20]. 
However,  at  high  energies,  the  modes  can  become 
strongly  coupled  to  each  other.  As  the  atoms  per¬ 
form  progressively  large  excursions  from  the  equi¬ 
librium  rigid  structure,  the  motion  can  become 
chaotic,  in  the  sense  of  nonlinear  dynamical  sys¬ 
tems.  Eventually,  the  atoms  may  rearrange  into  a 
different  molecular  structure  or  the  molecule  can 
fragment.  The  normal  modes'  description,  ade¬ 
quate  for  a  rational  account  of  the  spectrum  at  low 
energies,  is  no  longer  physically  valid.  But  it  is 
precisely  here  that  the  spectrum  may  become  fear- 
somely  complex.  Spectra  may  be  our  window  on 
the  molecular  world,  through  which  to  obtain  in¬ 
formation  such  as  the  forces  between  atoms  which 
govern  molecular  dynamics  and  reactivity,  but  as 
with  all  visualization,  a  great  deal  of  sophisticated 
processing  is  necessary  to  extract  the  information 
encoded  in  the  raw  data  of  experience.  The  search 
for  new  ways  to  model  spectra  and  organize  them 
into  meaningful  patterns,  if  any  such  exist,  has 
been  a  concerted  effort  of  fundamental  chemical 
science  of  the  past  two  decades. 

Even  though  the  standard  approach  starts  with 
the  notion  of  a  set  of  normal  modes  [20],  at  all  but 
the  lowest  energies,  it  is  necessary  to  take  into 


account  anharmonicity  of  the  modes.  In  addition, 
there  are  important  couplings:  In  general,  the  vi¬ 
brations  are  not  truly  separable  into  independent 
modes.  (For  simplicity,  I  will  neglect  rotation.  Most 
of  the  statements  carry  over  when  rotation  is  in¬ 
cluded.)  At  low  energy,  the  combination  of  anhar¬ 
monicity  and  coupling  for  the  most  part  can  be 
"swept  under  the  rug."  But  this  becomes  unten¬ 
able  at  higher  energy,  where  anharmonicity  and 
coupling  become  inexorably  more  important. 

Here,  I  will  discuss  two  aspects  of  this  problem 
of  complex  spectra  of  polyatomics:  (1)  V\^en  the 
most  important  couplings  are  taken  into  account, 
are  there  any  remaining  approximate  constants  of 
motions?  If  so,  these  intact  dimensions  of  the 
molecular  phase  space  act  as  "bottlenecks"  to  en¬ 
ergy  flow  within  the  molecule.  This  will  be  impor¬ 
tant,  e.g.,  in  efforts  to  control  molecular  dynamics 
with  intense  laser  pulses.  (2)  What  happens  to  the 
dimensions  of  phase  space  that  are  not  left  intact? 
This  is  the  problem  of  the  branching  of  the  normal 
modes  in  bifurcations  on  the  route  to  molecular 
chaos.  I  will  outline  some  promising  approaches  to 
these  problems,  developed  through  intense  efforts 
of  many  workers. 

THE  SPECTROSCOPIC  FITTING 

HAMILTOXIAN  WITH  NORMAL  MODE 

COUPLINGS 

I  begin  by  describing  the  standard  effective 
Hamiltonian  used  to  fit  experimental  spectra.  The 
spectroscopic  Hamiltonian  (SH)  is  intended  as  an 
"effective  Hamiltonian"  to  bridge  the  gap  between 
experimental  spectral  data  and  the  fundamental 
description  of  the  true  molecular  quantum  me¬ 
chanics.  A  natural  question  is  how  the  SH  is  re¬ 
lated  to  the  true  Hamiltonian.  Fried  and  Ezra  [21] 
and  independently  Sibert  [22]  showed  how  to  ob¬ 
tain  the  SH  by  beginning  with  a  potential  surface 
and  transforming  approximately  to  SH  form  by 
means  of  perturbation  theory. 

The  SH  is  a  quantum  mechanical  model  of  vi¬ 
brational  normal  modes  and  their  couplings.  The 
SH  contains  a  number  of  variable  parameters. 
These  are  optimized  to  give  the  best  fit  to  experi¬ 
mental  data.  The  SH  is  in  no  sense  a  fundamental 
description  of  the  molecule.  That  involves  treating 
it  as  a  collection  of  nuclei  and  electrons  and  essen¬ 
tially  solving  for  the  exact  quantum  mechanics. 
When  feasible,  as  for  a  few  small  molecules,  that 
leads  to  a  molecular  potential  energy  surface,  with 
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vibrations  emerging  as  quantum  mechanical  mo¬ 
tion  of  the  atoms  on  this  potential.  This  is  very 
difficult  to  carry  out  in  practice. 

The  starting  point  of  the  SH  is  a  set  of  elemen¬ 
tary  normal  modes  described  by  a  Hamiltonian 
function  Hq,  which  depends  on  the  number  of 
quanta  in  each  of  the  modes  i.  The  simplest 
model  is  just  a  set  of  N  harmonic  normal  modes. 
However,  anharmonicity  is  evident  from  the  spec¬ 
tra  even  at  low  energy.  It  is  therefore  standard  to 
use  an  Hq  with  harmonic  terms  proportional  to  the 
number  of  quanta  and  frequency  co^  in  each 
mode,  but  also  anharmonic  terms  proportional  to 
higher  powers  of  the  quantum  numbers,  including 
cross-terms: 

N 

Ho  =  L  co^int)  +  'Lxiiinf 

i=\  i 

N 

**’  •  (^4) 

i<J 

One  can  try  to  fit  an  experimental  spectrum  by 
optimizing  the  parameters  etc.,  in  Hq.  For 
many  systems,  especially  at  low  energy,  this  may 
suffice  to  give  a  good  match  with  experiment. 
However,  a  Hamiltonian  of  the  form  Hq  often  is 
not  adequate.  In  such  cases,  coupling  between  the 
normal  modes  must  be  introduced  beyond  the 
cross-anharmonic  terms  For  example. 

Darling  and  Dennison  [23]  found  that  an  interac¬ 
tion  must  be  added  to  Hq  which  couples  the  stretch 
normal  modes  in  a  molecule  such  as  H2O.  This  is 
called  "Darling-Dennison  resonance"  because  of 
the  near-resonant  frequencies  of  the  symmetric  and 
antisymmetric  stretches  s  and  a.  Addition  of  the 
Darling-Dennison  coupling  gives  a  Hamilto¬ 
nian  H  =  Hq  -h  which  is  no  longer  separable 
into  modes  s  and  a. 

The  Darling-Dennison  coupling  Vdd  is  for  two 
modes  s  and  a  which  have  approximately  the 
same  frequency.  This  is  sometimes  called  a  1:1 
resonance  coupling.  Other  couplings  can  arise  be¬ 
tween  modes  with  frequency  ratio  n:m.  These  are 
referred  to  as  Fermi  resonances  [24].  For  example, 
there  is  a  prominent  2:1  Fermi  resonance  in  H2O 
between  the  symmetric  stretch  and  bend.  A  global 
fit  of  the  spectrum,  including  states  with  promi¬ 
nent  excitation  of  all  three  modes,  requires  a  mul¬ 
tiresonance  Hamiltonian: 

H  =  Ho  +  (5) 


The  multiresonance  character  of  the  spectra  has 
an  important  connection  with  identifying  approxi¬ 
mate  constants  of  motion  of  the  molecule.  These 
act  to  channel  energy  flow  within  the  molecular 
phase  space,  simplifying  the  spectra  and  the  corre¬ 
sponding  intramolecular  dynamics.  However, 
within  the  energy  flow  charmels  or  pathways  de¬ 
fined  by  the  constants,  the  resonances  can  produce 
bifurcations  of  the  normal  modes  and  classical 
chaos,  with  considerable  complications.  I  will  con¬ 
sider  these  in  turn. 

POLYAD  SYMMETRY,  APPROXIMATE 
CONSTANTS,  AND  CHANNELS  FOR 

MOLECULAR  ENERGY  FLOW 

Approximate  constants  of  motion  are  quantum 
numbers  that  are  left  intact  when  the  resonance 
couplings  needed  to  fit  the  spectrum  are  taken  into 
account.  If  these  exist,  they  are  very  important 
because  they  are  related  to  patterns  that  can  be 
uncovered  in  the  experimental  spectrum.  In  dy¬ 
namical  terms,  on  the  time  scale  corresponding  to 
the  level  of  energy  resolution  in  the  spectral  fit, 
they  channel  energy  flow  in  the  molecular  phase 
space  when  energy  is  deposited  at  some  specific 
location  in  the  molecule.  This  could  be  crucial  for 
efforts  to  control  molecules  with  intense  beams  of 
coherent  light  pulses.  The  theory  [21,  25]  for  find¬ 
ing  approximate  constants  and  energy  channels 
uses  a  simple  "resonance  vector"  analysis  of  the 
SH.  The  quantum  numbers  that  are  preserved  are 
"poly ad"  numbers  which  are  special  combinations 
of  the  zero-order  quantum  numbers. 

The  resonance  vector  method  was  first  applied 
for  experimental  data  [25]  to  absorption  spectra  of 
acetylene  (C2H2)  of  Smith  and  Winn  [26]  that 
probe  the  flow  of  energy  initially  deposited  in  the 
C — H  stretch  motions.  There  are  a  multiplicity  of 
resonance  couplings  at  high  energy.  Nonetheless, 
two  good  quantum  numbers  are  found  [25]  to 
remain  from  the  resonance  vector  analysis.  The 
polyad  analysis  was  applied  by  Jonas  et  al,  [27]  to 
spectra  which  probe  the  bends  of  acetylene,  with  a 
bearing  on  observation  of  hierarchies  of  time  scales 
for  the  energy  flow.  Temsamani  and  Herman  [28, 
29]  analyzed  the  polyads  in  constructing  an  effec¬ 
tive  SH  Hamiltonian.  Solina  et  al.  [30,  31]  directly 
"unzipped"  the  C2H2  spectrum  into  polyads. 
When  viewed  as  individual  polyads,  the  spectra 
reveal  a  great  deal  of  order  that  is  not  otherwise 
evident.  Most  recently,  direct  absorption  and  fluo¬ 
rescence  data  were  jointly  analyzed  [32]  from  the 
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point  of  view  of  polyad  constants,  with  the  goal  of 
constructing  the  SH  from  as  global  a  view  of  the 
molecular  phase  space  as  possible.  An  open  ques¬ 
tion  is  whether  the  polyads  can  be  further  ''un¬ 
zipped"  into  subpolyads,  revealing  still  more  hid¬ 
den  order  in  the  spectrum. 

BRANCHING  OF  THE  NORMAL  MODES  AND 

THE  STRUCTURE  OF  MOLECULAR  CHAOS 

As  shown  by  the  need  to  use  multiresonance 
Hamiltonians,  it  is  evident  that  the  couplings  pre¬ 
clude  using  the  normal  modes  by  themselves  in  all 
but  the  coarsest  description  of  the  spectra.  Some 
approximate  constants  of  motion  may  be  left  intact 
as  polyad  numbers.  As  seen  above,  these  are  ex¬ 
tremely  important,  yet  easy  to  abstract  from  the 
SH  fit  to  the  experimental  spectrum.  However, 
knowledge  of  the  intact  polyad  numbers  leaves 
crucial  questions  unanswered.  What  happens  when 
the  individual  normal  modes  are  destroyed  or 
modified  by  couplings?  What  are  the  signatures  of 
this  in  the  spectrum?  What  are  the  dynamical 
consequences,  e.g.,  the  changes  in  the  natural  mo¬ 
tions  of  the  molecule?  These  questions  are  impor¬ 
tant  for  using  spectra  to  infer  information  about 
molecular  potentials  and  may  be  crucial  for  efforts 
at  coherent  control  of  polyatomic  molecules.  From 
the  viewpoint  of  classical  mechanics,  the  destruc¬ 
tion  of  the  normal  modes  occurs  when  they  un¬ 
dergo  bifurcations.  When  a  normal  mode  bifur¬ 
cates,  new  types  of  anharmonic  or  nonlinear  mo¬ 
tions  are  formed  in  place  of,  or  in  addition  to,  the 
original  modes.  The  goal  of  recent  work  [1,  33-37] 
has  been  to  characterize  the  bifurcations  associated 
with  the  SH  applied  to  data  of  specific  molecules, 
in  order  to  identify  the  new  nonlinear  modes  and 
related  spectral  patterns  that  result  at  high  energy. 

A  difficult  aspect  of  using  bifurcation  theory  in 
spectral  analysis  is  the  high  number  of  degrees  of 
freedom  in  a  polyatomic  molecule.  One  confronts 
the  not-well-understood  problem  of  bifurcations  in 
a  high-dimensional  classical  phase  space.  In  addi¬ 
tion,  the  classical  dynamics,  in  general,  shows  a 
mixture  of  chaotic  and  regular  motion,  so  one  also 
faces  the  notorious  problem  of  the  connection  be¬ 
tween  chaotic  classical  dynamics  and  quantum 
states  in  this  "mixed"  regime. 

Completely  solving  for  the  bifurcations  of  a 
chaotic  Hamiltonian  is  presently  impossible  and 
probably  not  very  useful.  Instead,  using  as  an 
exemplar  the  three-mode  spectroscopic  Hamilto¬ 
nian  of  (5)  for  H2O,  work  has  focused  [36-40]  on 


obtaining  and  exploiting  the  large-scale  bifurcation 
structure.  By  this  is  meant  the  principal  branchings 
of  the  original  normal  modes  as  the  system  be¬ 
comes  chaotic.  This  gives  the  principal  structure 
around  which  the  chaotic  phase  space  is  orga¬ 
nized.  This  information  has  been  used  in  several 
methods  for  assignment  of  spectra  and  analysis  of 
energy-level  patterns  [37-40].  Although  success¬ 
fully  carried  out  for  H2O,  the  assignment  with 
spectral  pattern  recognition  is  largely  unchartered 
territory  for  larger  molecules,  with  investigations 
currently  in  progress. 


More  Alike  at  Higher  Energy? 

We  have  seen  that  at  high  energy  the  two-elec¬ 
tron  atom  has  spectral  patterns  similar  to  the  rota¬ 
tion-vibration  modes  of  a  molecule,  while  in  ac¬ 
tual  molecules  at  high  energy  the  vibrational 
modes  are  transformed  in  bifurcations  because  of 
resonant  couplings.  To  a  certain  extent,  the  "plane¬ 
tary"  atom  has  become  like  a  "molecular"  atom 
and  the  rigid  molecule  less  regular.  Still,  the  two 
systems  could  become  even  more  similar.  A  real 
molecule  can  have  excursions  of  the  atoms  com¬ 
pletely  out  of  its  equilibrium  configuration,  with 
the  possibility  of  multiple  minima  in  the  potential 
energy  surface,  as  in  isomerization.  Could  the  mo¬ 
tion  of  these  "wandering  atoms"  come  to  resemble 
the  correlated  but  nonrigid  motion  of  the  electrons 
in  an  atom?  On  the  other  hand,  could  electrons  in 
atoms  sometimes  become  rigidly  localized  to  such 
an  extent  as  to  speak  of  Fermi  resonances  between 
modes  of  well-defined  frequency?  Also,  could 
atoms  show  multiple  quasi-rigid  configurations, 
analogous  to  an  isomerizing  molecule?  There  is 
evidence  that  two-electron  atoms  have  "frozen 
planet"  states  [18]  in  which  the  electrons  really  are 
locked  into  a  second  "molecular"  configuration, 
with  the  electrons  on  the  same  side  of  the  nucleus. 
Furthermore,  there  are  indications  [19]  that  the 
frequencies  of  the  normal  modes  of  these  states  are 
well  defined  and  may  sometimes  be  in  Fermi  reso¬ 
nance,  i.e.,  have  close  to  integral  ratios.  On  the 
other  hand,  evidence  about  isomerizing  molecules 
is  now  starting  to  accumulate  [41-47],  and  distinct 
experimental  and  theoretical  hallmarks  are  appear¬ 
ing.  So,  there  is  the  prospect  of  soon  understand¬ 
ing  just  what  are  the  similarities  and  correspon¬ 
dences  between  truly  nonrigid  molecules  and 
highly  correlated  atoms. 
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ISOMERIZING  MOLECULES 

If  spectra  are  one  of  our  clearest  windows  on 
the  microscopic  world  of  molecules,  one  of  the 
most  elusive  behaviors  to  ''catch  in  the  act"  is 
molecular  rearrangement  or  isomerization.  How¬ 
ever,  rapid  progress  [41-47]  is  being  made  on  two 
prototype  systems:  the  isomerization  of  HCP  to 
HPC  and  that  of  HCN  to  HNC,  In  particular,  a 
bifurcation  of  the  type  known  as  "tangent"  or 
"saddle  node"  has  been  predicted  from  numerical 
studies  [46]  in  connection  with  the  approach  of 
HCP  to  the  isomerization  barrier.  Most  interest¬ 
ingly,  distinct  changes  in  the  spectrum,  quite  pos¬ 
sibly  associated  with  this  bifurcation,  have  re¬ 
cently  been  observed  [46].  Calculated  vibrational 
wave  functions  [47]  on  a  potential  surface  also 
show  interesting  behavior.  An  exciting  challenge 
will  be  the  development  of  spectroscopic  Hamilto¬ 
nians  for  isomerizing  systems,  including  the  de¬ 
scription  of  the  new  bifurcation  phenomena. 

OF  ‘FROZEN  ’  PLANETS  AND  FERMI 

RESONANCES  IN  ATOMS 

There  have  been  important  developments  in  the 
study  of  collective  behavior  of  the  electrons  in 
atoms  since  the  rotor-vibrator  supermultiplet 
model.  Some  highlights  include  investigation  of 
molecular-type  electronic  wave  functions,  with 
distinct  approaches  pursued  by  Berry  and  co¬ 
workers  [13,  14]  and  Rost  and  co-workers  [16,  17]. 
Another  development  is  the  investigation  of  radial 
motion  [15]  and  how  radial  motion  fits  [16, 17]  into 
the  supermultiplet  scheme.  Finally,  a  most  inter¬ 
esting  development  [18]  is  the  prediction  of  "frozen 
planet"  states.  In  these  states,  both  electrons  are  on 
the  same  side  of  the  nucleus!  As  illustrated  in 
Figure  5,  the  outer,  "frozen"  electron  hardly  moves 
at  all  and  the  inner  electron  oscillates  along  the 
axis  connecting  the  outer  electron  to  the  nucleus. 
Ostrovsky  and  Prudov  [19]  investigated  the  rela- 


FIGURE  5.  “Frozen  planet"  states  of  a  two-electron 
atom.  Electron  2  is  nearly  frozen  in  place,  while  electron 
1  oscillates  along  the  axis  between  electron  2  and  the 
nucleus.  See  [18,  19,  48]  for  more  discussion  of  frozen 
planet  states. 


tive  frequencies  of  radial  and  bending  modes  of 
the  "molecular"  atom  in  the  frozen  planet  configu¬ 
ration.  They  predicted  circumstances  in  which  the 
modes  are  in  an  approximate  2:1  frequency  ratio, 
suggestive  of  molecular  Fermi  resonance.  This 
suggests  an  analogy  with  bifurcation  phenomena 
in  molecular  vibrations.  Interestingly,  recent  classi¬ 
cal  calculations  of  Ostrovsky  and  Prudov  [48]  on 
the  frozen  planet  states  show  phase  space  behavior 
similar  to  that  observed  by  Li  et  al.  [33,  34]  in 
resonant  molecular  vibrations.  Further  interesting 
parallels  may  be  expected  between  the  frozen 
planet  states  and  the  isomerization  of  linear 
molecules,  including  the  novel  bifurcation  phe¬ 
nomena  in  the  latter. 


A  Unified  Model? 

We  have  seen  that  highly  excited  vibrations  and 
rotations  of  molecules  involve  significant  depar¬ 
tures  from  the  orderly,  rigid  structures  of  the 
ground  state,  while  highly  excited  states  of  elec¬ 
trons  in  atoms  behave  in  some  ways  like  highly 
nonrigid  molecular  structures.  The  relationships 
discussed  so  far  consist  of  analogies  between  phys¬ 
ical  descriptions  that  are  rooted  in  very  different 
formal  descriptions:  in  the  case  of  atoms,  the  O  (4) 
hydrogenic  shell  structure,  and  in  the  case  of 
molecules,  the  motion  of  atoms  on  a  potential 
energy  surface,  abstracted  in  the  spectroscopic 
Hamiltonian  (SH).  But  if  both  types  of  systems 
display  more  and  more  features  in  common,  such 
as  collective  modes,  resonances  with  bifurcations, 
and  multiple  stable  configurations,  it  is  reasonable 
to  ask.  Can  there  be  a  unified  formal  description 
encompassing  both  types  of  system?  These  ques¬ 
tions  are  in  the  early  stages  of  exploration,  and  I 
do  not  believe  definitive  answers  or  conclusions 
about  the  choice  of  formalism  are  anywhere  near 
at  hand.  Nonetheless,  I  would  like  to  discuss  one 
possible  approach  to  at  least  a  partially  unified 
description.  This  approach  uses  Lie  algebras  to 
describe  the  spectra  of  both  atoms  and  molecules. 
For  atoms,  this  is  not  surprising,  given  the  promi¬ 
nence  of  the  0(4)  algebra  in  the  work  on  atoms 
already  described.  Perhaps  more  surprising,  for¬ 
mally  related  approaches  have  been  developed  for 
molecular  rotation-vibration  spectra.  One  of  these 
has  come  to  be  known  as  the  vibron  model  [49-51]; 
another  uses  so-called  quantum  groups  [52,  53].  I 
have  used  a  vibron-type  model  for  an  investiga- 
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tion  of  the  atomic  supermultiplets  [54]  that  is  de¬ 
scribed  below.  The  reason  that  such  models  are 
candidates  for  a  unified  picture  is  that  they  start 
with  a  group  theoretical  shell  structure,  similar  to 
the  O  (4)  description  of  the  atom,  yet  they  encom¬ 
pass  an  effective  rotation-vibration  Hamiltonian, 
much  like  the  standard  spectroscopic  Hamiltonian 
(SH). 

THE  VIBRON  MODEL 

The  '"vibron"  model  [49-51]  is  related  to  early 
work  on  rigid  and  nonrigid  molecular  systems  in 
terms  of  correlation  diagrams  [55,  56].  The  basic 
idea  is  to  represent  the  rotation-vibration  spec¬ 
trum  of  each  bonded  pair  of  atoms  in  a  molecule 
in  terms  of  representation  of  a  continuous  group. 
The  group  is  the  unitary  group  LJ(4),  which  is 
related  to  the  O  (4)  rotation  group  described  above 
for  atoms.  The  unitary  group  can  be  thought  of  as 
the  generalization  of  rotations  to  spaces  of  com¬ 
plex  numbers.  In  the  vibron  model,  a  set  of  n 
bonded  pairs  is  described  by  first  coupling  a  sepa¬ 
rate  Lf(4)  for  each  bond:  U(4)i  X  Lf(4)2  X  •••  X 
LI(4)„. 

Then,  subgroup  chains  are  used  to  describe  op¬ 
erators  which  resemble  terms  in  the  standard 
zero-order  Hamiltonian  (4).  Couplings  analogous 
but  not  identical  to  the  Darling-Dennison  or  Fermi 
resonance  couplings  can  also  be  defined  in  terms 
of  the  group  operators.  Although  this  procedure 
might  appear  far  different  from  the  SH,  there  is  a 
term-by-term  correspondence  of  a  vibron  Hamilto¬ 
nian  to  a  similar  Hamiltonian  built  in  the  standard 
way.  It  is  therefore  not  surprising  that  sufficiently 
elaborate  versions  of  the  vibron  model  (with 
enough  independent  terms  in  the  Hamiltonian) 
can  give  spectral  fits  of  comparable  accuracy  to  the 
SH. 

A  RELATION  TO  THE  SUPERMULTIPLETS? 

The  vibron  model  also  turns  out  to  have  a 
possible  relation  to  the  supermultiplet  description 
of  two-electron  atoms.  Despite  the  spectral  pat¬ 
terns  revealed  by  the  supermultiplets  and  the  com¬ 
pelling  physical  motivation,  one  fundamental  as¬ 
pect  of  the  supermultiplets  was  mysterious  from 
the  beginning.  This  was  the  formal  group  theoreti¬ 
cal  basis  of  the  supermultiplet  classification.  The 
supermultiplets  were  constructed  by  collecting  to¬ 
gether  the  approximate  two-electron  O  (4)i2  multi- 
plets  into  larger  entities — the  supermultiplets.  The 


mystery  was  that  despite  the  beauty  of  their  struc¬ 
ture  and  ''molecular"  interpretation  the  supermul¬ 
tiplets  did  not  appear  to  correspond  to  any  larger 
mathematical  group  structure  encompassing  the 
two-electron  0(4)[2-  ^  recent  article  [54],  I  was 

able  to  shed  some  light  on  this  by  embedding  the 
O  (4)  group  structure  in  a  larger  group,  LI(4).  The 
(J(4)  lends  its  structure  to  a  larger  organization 
scheme,  the  supermultiplets,  than  can  be  obtained 
from  0(4)  alone.  This  embedding  of  the  atomic 
problem  in  a  [i(4)  structure  is  similar  to  the  way 
the  vibron  model  embeds  the  molecular  states.  In 
both  cases,  the  suitability  of  the  embedding  seems 
related  to  the  observation  that  the  pattern  of  corre¬ 
lation  from  nonrigid  to  rigid  states  in  linear 
molecules  [56]  results  in  groupings  analogous  to 
the  atomic  supermultiplets. 

FUTURE  PROSPECTS 

One  aspect  of  the  LI(4)  embedding  of  the  super¬ 
multiplets  is  still  puzzling,  but  possibly  connected 
to  important  future  insight  into  the  atomic  prob¬ 
lem  and  maybe  even  isomerizing  molecules.  This 
is  that  the  LI(4)  embedding  leads  to  "extra"  states 
beyond  those  obtained  from  the  0(4)^  X  0(4)2 
structure  assumed  in  starting  from  the  single-par¬ 
ticle  "planetary"  model  of  the  atom.  In  the  Ui4\ 
X  11(4)2  scheme  [54],  I  took  pains  to  try  to  ratio¬ 
nalize  away  these  extra  states.  However,  it  would 
not  be  necessary  to  do  this  if  the  "extra"  states 
really  were  new  physical  states  not  accounted  for  by 
assuming  an  O  (4)^  X  O  (4)2  planetary  structure  as 
a  starting  point.  Is  there  any  evidence  for  such 
states?  Some  but  not  all  of  the  "frozen  planet" 
states  really  appear  to  be  new  states.  This  is  not 
surprising,  because  it  is  hard  to  see  why  frozen 
planet  states  would  originate  from  an  0(4)^  X 
0(4)2  independent  particle  picture.  Suggestively, 
the  first  frozen  planet  state  in  a  shell  corresponds 
to  the  highest  ordinary  supermultiplet  state  in  the 
shell,  i.e.,  the  supermultiplet  state  with  the 
most  bending  excitation.  It  is  as  if  the  frozen  planet 
sequences  are  taking  off  from  where  the  shell-based 
supermultiplets  leave  off — hinting  that  both  are 
part  of  some  larger  classification.  Of  course,  this 
does  not  prove  that  the  frozen  planet  states  are  the 
"extra"  states  of  the  Ui4\  X  11(4)2  scheme.  In¬ 
deed,  that  scheme  predicts  many  more  new  states 
than  the  frozen  planet  states  already  known.  It  is 
unknown  how  many  dynamically  stable  states  will 
eventually  be  discovered  or  what  the  systematics 
of  their  classification  will  look  like.  However,  it  is 
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an  exciting  possibility  to  try  to  use  group  theoreti¬ 
cal  schemes,  such  as  the  U(4:\  X  11(4)2  model,  to 
make  predictions  for  computational  and  experi¬ 
mental  searches  for  novel  states. 

Another  challenge  is  the  development  of  spec¬ 
troscopic  Hamiltonians  for  isomerizing  systems, 
including  the  description  of  the  novel  bifurcation 
phenomena  now  being  uncovered  in  experiments 
and  numerical  studies.  This  opens  up  many  excit¬ 
ing  possibilities  for  the  development  not  only  of 
spectroscopic  Hamiltonians  of  the  standard  molec¬ 
ular  form,  but  also  effective  Hamiltonians  using 
Lie  algebraic  methods.  It  will  be  fascinating  in 
coming  years  to  watch  the  interplay  of  these  new 
developments  in  highly  excited  electron  correla¬ 
tion  in  atoms  and  highly  excited,  nonrigid 
molecules.  These  undertakings  are  likely  to  lead  to 
a  new  understanding  of  fundamental  problems  of 
correlation,  of  general  interest  not  only  for  atoms 
and  molecules  but  throughout  the  many  areas  of 
physical  science  where  correlation  problems  are 
important. 
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ABSTRACT:  The  self-consistent  Thomas-Fermi  atom  satisfying  Poisson's  equation  in 
D  dimensions  has  a  functional  derivative  of  the  kinetic  energy  T  with  respect  to  the 
ground-state  density  n(r)  proportional  to  But  the  Poisson  equation  relates 
to  "reduced"  density  derivatives  \  Thus  8T/8n  can  be  written  also,  quite 

compactly,  solely  in  terms  of  these  derivatives.  An  analytic  solution  to  the  Thomas-Fermi 
equation  in  D  dimensions  can  be  presented  as  an  expansion  about  the  known  analytic 
solution  at  D  =  2,  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  411-413,  1997 


Introduction 


h^D  I  D 

.(D)  =  _  _ 

2miD  +  2)\2Ko 


(1) 


There  is  a  good  deal  of  interest  currently  in  the 
role  played  by  electron  density  gradients  in 
density  functional  theory  [1].  The  purpose  of  the 
present  study  is  to  explicitly  demonstrate  that  the 
self-consistent  Thomas-Fermi  (TF)  atom  discussed 
in  D  dimensions  by  Kventsel  and  Katriel  [2,  3]  has 
the  following  property.  The  kinetic  energy  T,  of 
the  form  [4] 


where  the  constant  satisfies  the  recurrence 
relation 


K 


D+2 


2ttD' 


K, 


1 

277 


(2) 


has  a  functional  derivative  8T/8n  that  can  be 
written  either  in  the  conventional  "local"  form 
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gp(D)  /  2  \ 

_  =  (3, 

or  in  a  quite  equivalent  form  in  terms  solely  of 
n~^(dn/dr)  and  n~^(d^n/dr^). 
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Differential  Equation  for 
Ground-State  Density 


The  first  step  in  the  proof  of  the  above  state¬ 
ment  is  to  establish  the  differential  equation  satis¬ 
fied  by  the  ground-state  density  n(r)  in  D  dimen¬ 
sions,  The  starting  point  to  do  this  is  the  D-dimen¬ 
sional  Poisson  equation  [1,  2],  which  from  phase 
space  considerations  takes  the  form 


^2^  D  -  1 

- — 1_ - 

r  dr 


(4) 


Here  the  chemical  potential  jx,  constant  through¬ 
out  the  entire  electronic  charge  cloud  of  the  atom, 
and  the  self-consistent  potential  energy  V(r),  are 
related  to  ^  in  Eq.  (4)  by 


IJL-V  =  e'^ 

while  the  constant  Pp  is  given  by 


jSp  =  Srre 


2TTme\°^^  I  D 

r  1  +  — 


(5) 


(6) 


Explicitly,  for  a  point  charge  of  magnitude  Ze, 
4^(r)  behaves  as  r  -»  0  as 


'P(r) 


477  Ze 


(D  -  2)0,  r 


r^-^,  D>3  (7) 


with 


O 


D  /  D 

r  1  — 


(8) 


The  integral  of  n(r)  through  all  space  diverges 
for  D  >  4,  but  as  one  of  us  has  pointed  out  previ¬ 
ously  [3],  this  trouble  can  be  "cured"  by  introduc¬ 
ing  a  finite  nucleus. 

Evidently,  from  Eq.  (4),  the  electron  density 
n(r)  is  related  to  '^(r)  by  [4] 


n(r) 


qfD/2 

4:776 


(9) 


Writing  Eq.  (9)  as  "^(r)  a  n{rY^^,  one  can  sub¬ 
stitute  for  ^(r)  in  Eq.  (4)  to  obtain  the  differential 
equation  satisfied  by  the  electron  density  n(r)  in 
D  dimensions.  After  a  short  calculation,  this  can  be 


obtained  in  the  form 

'  1  d^n  D  -1  n  dn\  I  2  Ul  ^n\^' 

n  dr^  r  \n  dr )  \D  )\n  dr  j 

=  (10) 

This  generalizes  the  known  result  [5]  for  the 
original  TF  atom  with  D  =  3,  namely 


1 1 

Wn 

„l/3 

1 1 

'=4! 

-1 

n 

’  3' 

[  n  j  _ 

;  ' 

(11) 

since  from  Eq.  (6),  ^  (327r^e/3h^)i2mey^^ ,  to  D 

dimensions. 


Analytic  Solution  to  the  TF  Equation  in 
D  Dimensions 


Using  the  spatial  dimension  D  as  a  free  param¬ 
eter  has  been  widely  employed  to  obtain  an  ap¬ 
proximate  results  for  D  =  3  for  a  variety  of  physi¬ 
cal  problems  [6].  The  TF  equation  in  D  =  3  is  a 
nonlinear  equation.  But  the  case  D  =  2  from  Eq. 
(9)  yields  a  linear  equation  which  has  been  solved 
previously  in  terms  of  Bessel  functions  [2]. 

Recently,  Bender  et  al.  [7]  have  proposed  a  new 
approach  to  the  analytic  solution  of  a  variety  of 
nonlinear  problems.  Laurenzi  [8]  has  shown  that 
one  can  use  this  approach  to  obtain  an  analytic 
solution  for  the  TF  equation. 

In  this  section  we  would  like  to  show  that  one 
can  use  the  D-perturbation  expansion  method  to 
obtain  an  analytic  solution  for  the  D  dimensional 
TF  equation.  We  do  this  by  replacing  Eq.  (4)  with 


26  +  1 

_  _j_ _ = 

dx^  X  dx 


(12) 


where  x  =  ^l/^r  and  e  =  (D  -  2)/2  is  a  measure 
of  the  nonlinearity  of  the  TF  equation.  The  poten¬ 
tial  4^  is  then  expanded  in  a  power  series  in  e: 

^  %  -I-  +  e'^%  -t-  ••• .  (13) 


This,  in  turn,  produces  a  set  of  inhomogeneous 
equations  for  the  perturbation  functions 


dx^ 


1  'i'n 


(14) 
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where  the  first  few  inhomogeneous  terms  are  given 
by: 

A'o  =  0,  (15) 

(16) 

2  d%  1 

^2  =  %  ln%  -  -  —  +  %  +  -%  (17) 

The  zero-order  equation,  which  is  the  TF  equa¬ 
tion  at  D  =  2,  has  been  solved  previously  in  terms 
of  modified  Bessel  functions  1q(x)  and  Kq(x) 

%  =  a^JoCx)  +  a2Xo(x),  (18) 

where  the  constants  and  a2  can  be  fixed  by  the 
boundary  conditions  at  zero  and  infinity  [2].  By 
eliminating  the  first  derivative  d'^^^/dx  in  the  set 
of  linear  equations,  Eq.  (14),  one  can  use  the  Green's 
function  approach  to  construct  a  general  solution 
for  [8].  This  generalizes  to  D  dimensions  the 
previous  work  of  Laurenzi  [8]  for  the  three-dimen¬ 
sional  solutions  of  the  TF  equation. 


Functional  Derivative  of  Kinetic  Energy 
r  in  D  Dimensions 


Returning  to  Eq.  (3),  one  can  raise  Eq.  (9)  to  the 
power  2/(D  —  2)  to  obtain  a  different,  though,  of 
course,  equivalent,  form  of  the  functional  deriva¬ 
tive  8T^^^/dnir),  The  result  is 


57(D) 

8n(r) 


7(D) 


1 

n  dr^ 

2 
D 


+ 


D  "  1  \  1  dn 
n  dr 


1  dn 

lll“  — 

n  dr 


2/D-2 


,  (19) 


where 


1  + 


D 


.(D) 


2  ^2/(D-2) 

D 

X(477e)”'/^)3^/°<2-°>.  (20) 


This  is  the  main  result  of  the  present  work  and 
generalizes  to  D  dimensions  the  previous  work  of 
one  us  for  the  three-dimensional  TF  neutral  atom, 
namely 


57(3) 

8n{r) 


n 


1 

3 


(21) 


Conclusion 

The  conclusion  is  that  for  the  self-consistent  TF 
atom  in  D  dimensions,  the  functional  derivative  of 
the  kinetic  energy  T  can  be  expressed  in  terms  of 
"reduced"  density  gradient  terms  (l/n)(dn/dr) 
and  {l/n)(d^n/dr^).  The  local  form  (3)  and  the 
density  gradient  (nonlocal)  form  (21)  are,  of  course, 
entirely  equivalent.  This  is  because  of  the  differen¬ 
tial  Eq.  (10)  satisfied  by  the  self-consistent 
ground-state  electron  density  in  the  TF  atom  in  D 
dimensions. 
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The  detour  matrix  was  introduced  into  the 
mathematical  literature  in  1969  [1]  and  25 
years  later  into  the  chemical  literature  under  the 
name  of  the  maximum  path  matrix  of  a  molecular 
graph  by  Ivanciuc  and  Balaban  [2]. 

The  detour  matrix  can  be  used  to  compute  the 
so-called  detour  index  [3,  4]  in  the  same  way  as 
the  distance  matrix  [5]  can  be  employed  to  gener¬ 
ate  the  Wiener  index  [6].  The  detour  index,  which 
is  a  Wiener-like  index,  was  also  introduced  by 
Ivanciuc  and  Balaban  [2]  as  the  half-sum  of  the 
maximum  path  sums  and  independently  by  John 
[7]  and  us  [3].  Lukovits,  who  introduced  the  term 
''the  detour  index,"  studied  the  properties  of  this 
index  [4,  8].  He  was  also  first  to  use  this  index  in 
structure-property  modeling  [4].  The  present  re¬ 
port  was  stimulated  by  Lukovit's  remark  [8a]:  "The 
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usefulness  of  the  detour  index  is  deflated  hy  the  fact  that 
to  date  no  method  (but  inspection)  is  available  to 
compute  this  index."  Or,  in  other  words,  there  is  no 
method  available  to  compute  the  detour  matrix 
but  by  hand.  In  attempting  to  remedy  the  above, 
we  propose  here  a  computer  method  for  construct¬ 
ing  the  detour  matrix  for  graphs  of  moderate  sizes 
and  a  limited  number  of  cycles. 

The  detour  matrix  A  =  A(G)  of  a  labeled  con¬ 
nected  (molecular)  graph  G  is  a  real  symmetric 
N  X  N  matrix  whose  (/,  ;)-entry  is  the  length  of 
the  longest  path  [9a]  from  vertex  i  to  vertex  j: 


(A).7 


o-ij  if  i  #  j 
0  ili=j' 


(1) 


where  cr^  is  the  number  of  steps  in  a  longest  path 
(i.e.,  the  maximum  number  of  edges)  in  G  between 
vertices  i  and  j. 

This  definition  is  exactly  the  "opposite"  of  the 
definition  of  the  traditional  graph-theoretical  dis¬ 
tance  matrix,  whose  off-diagonal  entries  are  the 
lengths  of  the  shortest  paths  between  the  vertices 
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in  G  [5,  9b]: 


(D)f;  - 


if  i  ¥=  j 
if  i  -  j 


(2) 


where  is  the  number  of  steps  in  a  shortest  path 
(i.e.,  the  minimum  number  of  edges)  in  G  between 
vertices  i  and  j.  As  examples,  the  distance  and 
detour  matrices  for  a  simple  bicyclic  labeled  graph 
G  are  given  in  Figure  1. 

The  detour  index  II  is  defined  in  the  same  as  is 
the  Wiener  index  W  [10],  i.e.,  the  detour  index  is 
equal  to  the  half-sum  of  the  elements  of  the  detour 
matrix  A  [2-4]: 


a  =  (1/2)E  E(A).7  (3) 

'  ; 

W=  (1/2)E  E(D).7.  (4) 

i  j 

The  Wiener  index  and  the  detour  index  for  a 
bicyclic  graph  G  are  also  given  in  Figure  L 

It  is  obvious  from  (3)  and  (4)  that  the  Wiener 
index  and  the  detour  index  are  identical  for  acyclic 
structures.  For  polycyclic  structures,  they  are  not 
particularly  intercorrelated  indices.  For  example, 
the  linear  correlation  between  W  and  H  for  a  set  of 
37  diverse  polycyclic  structures  has  a  modest  cor¬ 
relation  coefficient  (r  =  0.79)  [3a]. 


The  computation  of  the  detour  matrix  is  a  non¬ 
trivial  problem  of  long  standing  [11].  We  propose  a 
procedure  for  computing  the  (molecular)  detour 
matrix  from  the  (molecular)  distance  matrix  since 
there  are  several  algorithms  available  in  the  litera¬ 
ture  for  computing  the  distance  matrix  [9b].  We 
will  also  use  the  adjacency  matrix  to  generate  the 
distance  matrix.  The  procedure  is  based  on  the 
computation  of  distance  matrices  for  sets  of  se¬ 
lected  subgraphs  of  a  considered  graph  G.  These 
subgraphs  are  created  from  G  by  removing  succes¬ 
sively  edges  whose  vertices  are  at  least  divalent 
until  the  set  of  spanning  trees  is  reached.  For  each 
subgraph,  the  distance  matrix  is  computed  and  is 
compared  to  the  starting  distance  matrix  corre¬ 
sponding  to  G.  If  any  of  the  distances  in  the 
newly  created  distance  matrix  is  larger  than  the 
corresponding  element  (d/Po  initial  distance 

matrix,  then  (rff^o  should  be  replaced  with 
The  final  distance  matrix,  obtained  after  the  con¬ 
sidered  set  of  subgraphs  is  inspected,  is  the  detour 
matrix.  After  the  detour  matrix  is  obtained,  the 
computation  of  the  detour  index  is  straightfor¬ 
ward. 

The  block  diagram  of  our  computational  proce¬ 
dure  is  given  in  Figure  2.  As  an  illustrative  exam¬ 
ple,  we  computed  the  detour  matrix  for  the  same 
bicyclic  graph  G  that  was  considered  before  in 
Figure  1.  This  computation  is  shown  in  Figure  3. 
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FIGURE  1 .  The  distance  matrix  (D)  and  the  detour  matrix  (A)  of  a  labeied  bicyclic  graph  G. 
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FIGURE  2.  A  block-diagram  of  the  computer  program. 
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1  2  3 


6  5  4 

G 

1 )  Generate  the  adjacency  matrix  Aq  and  calculate  distance  matrix  Do  of  graph  G. 
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2)  Find  edges  e(G)  with  both  vertices  at  least  divalent  and  copy  them  to  array  of  m-tuples  of 
edges  (m  =  1). 
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7  6 

edge-labels 


3)  For  every  m-tuple,  disconnect  edge(s)  In  the  m-tuple,  disregarding  graphs  that  contain 
disconnected  vertices  and  paths  of  length  one. 


m-tuple 

labels 


(1) 


(2) 


(5) 


m^  =  1  n^  (number  of  m-tuples)  =  7 


4)  From  m-tuples,  form  the  (m-i-1  )-tuples  by  adding  the  edges  (found  in  step  2)  with  the  edge 

serial  numbers  higher  than  the  serial  number  of  the  last  m-tuple  edge. 
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m2  (number  of  m+1 -tuples)  =  2  n2  (number  of  m-i-1 -tuples)  =  15 

(a) 

FIGURE  3.  Computation  of  the  detour  matrix  for  the  bicyciic  graph  G  using  the  proposed  method. 
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5)  Copy  (m+1  )-tuples  to  array  of  m-tuples,  and  go  to  the  step  3.  Continue  repeating  steps  3  and 
4  until  spanning  trees  disconnecting  level  is  achieved  (i.e,,  m+1  =  cyclomatic  number). 

In  the  present  example,  there  is  no  need  to  go  back  to  the  step  3. 


6)  Calculate  the  distance  matrix  D  for  every  subgraph  obtained  by  m+1  disconnections.  If  any 
of  the  distances  dy  is  larger  than  the  corresponding  element  in  Do,  replace  (djj)o  with  dy. 


7)  The  detour  matrix  A  is  equal  to  the  final  distance  matrix. 


A  = 


0  5  4  5  4  5 
5  0  5  4  3  4 

4  5  0  5  4  5 

5  4  5  0  5  4 

4  3  4  5  0  5 

5  4  5  4  5  0 


8) 


Calculate  the  detour  index, 
n  =  67 


(b) 


FIGURES.  (Continued). 


The  CPU  time  need  to  construct  the  detour  matrix 
for  G  in  Figure  3  was  1  s  on  the  workstation 
HP-735. 


ACKNOWLEDGMENT 

This  work  was  supported  by  the  Ministry  of 
Science  and  Technology  of  the  Republic  of  Croatia 
through  Grant  No.  00980606.  We  thank  the  referees 
for  constructive  comments. 


References 

1.  F.  Harary,  Graph  Theory  (Addison-Wesley,  Reading,  MA, 
1969),  p.  203. 

2.  O.  Ivanciuc  and  A.  T.  Balaban,  Math.  Chem.  (MATCH)  30, 
141  (1994). 

3.  (a)  D.  Amic  and  N.  Trinajstic,  Croat.  Chem.  Acta  68,  53 
(1995).  (b)  S.  Nikolic,  N.  Trinajstic,  A.  Juric,  and  Z,  Mihalic, 


Croat.  Chem.  Acta  69,  1577  (1996).  (c)  N.  Trinajstic,  S. 
Nikolic,  B.  Lucic,  D.  Amic,  and  Z.  Mihalic,  J.  Chem.  Inf. 
Comput.  Sci.  (in  press). 

4.  I.  Lukovits,  Croat.  Chem.  Acta  69,  873  (1996). 

5.  (a)  Z.  Mihalic,  D.  Veljan,  D.  Amic,  S.  Nikolic,  D.  Plavsic, 
and  N.  Trinajstic,  J.  Math.  Chem.  11,  223  (1992).  (b)  Z. 
Mihalic,  S.  Nikolic,  and  N.  Trinajstic,  J.  Chem.  Inf.  Comput. 
Sci.  32,  28  (1992). 

6.  (a)  H.  Hosoya,  Bull.  Chem.  Soc.  Jpn.  44,  2332  (1971).  (b)  S. 
Nikolic,  N.  Trinajstic,  and  Z.  Mihalic,  Croat.  Chem.  Acta  68, 
105  (1995). 

7.  P.  E.  John,  Math.  Chem.  (MATCH)  32,  207  (1995). 

8.  (a)  I.  Lukovits,  J.  Chem.  Inf.  Comput.  Sci.  36,  65  (1996).  (b) 
H.-Y.  Zhu,  D.  J.  Klein,  and  I.  Lukovits,  J.  Chem.  Inf.  Com¬ 
put.  Sci.  36,  420  (1996).  (c)  I.  Lukovits  and  M.  Razinger,  J. 
Chem.  Inf.  Comput.  Sci.  37,  283  (1997), 

9.  N.  Trinajstic,  Chemical  Graph  Theory,  2nd  ed.  (CRC  Press, 
Boca  Raton,  FL,  1992).  (a)  pp.  9-11;  (b)  pp.  52-57. 

10.  H.  Wiener,  J.  Am.  Chem.  Soc.  69,  17  (1948). 

11.  F.  Buckley  and  F.  Harary,  Distance  in  Graphs  (Addison-Wes- 
ley,  Reading,  MA,  1990),  pp.  213-214. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


419 


Many-Body  Valence-Bond  Theory 


D,  J.  KLEIN/  H.  ZHU/  R,  VALENTI/  M.  A.  GARCIA-BACH" 

^Texas  A&M  University-Galveston,  Galveston,  Texas  77553-1675 

^Sealy  Center  for  Biomolecular  Structure,  University  of  Texas  Medical  Branch,  Galveston,  Texas 

^Institut  fur  Physik,  Universitat  Dortmund,  44221  Dortmund,  Germany 

^Fisica  Fonomental,  Universitat  de  Barcelona,  Diagonal  647,  08028-Barcelona,  Spain 

Received  29  March  1997;  accepted  1  June  1997 


ABSTRACT:  The  classical  theory  of  chemical  valence,  first,  is  naturally  formalized  in 
mathematics  in  the  area  of  graph  theory  and,  second,  finds  an  extension  in  quantum 
mechanics  in  terms  of  the  Heitler-London-Pauling  "valence-bond"  (VB)  theory.  Thus, 
VB  theory  stands  in  a  fairly  unique  position,  although  in  quantum  chemistry,  there  often 
has  been  a  preference  for  the  alternative  (perhaps  even  "complementary")  molecular 
orbital  (MO)  theory,  presumably  in  large  part  because  of  computational  efficacy  for 
general  molecular  structures.  Indeed,  as  formulated  by  Pauling  and  others,  VB  theory 
was  described  as  a  configuration  interaction  (Cl)  problem  when  there  were  multiple 
relevant  classical  valence  structures  for  the  same  molecular  structure.  Also,  as  now 
recognized,  a  direct  assault  on  Cl  is  computationally  intensive,  prone  to  size-inconsistency 
problems,  and  effectively  limited  to  smaller  systems — whereas  indirect  approaches,  e.g., 
via  wave-function  cluster  expansions  or  renormalization-group  theory,  often  neatly  avoid 
these  problems.  Thus,  what  is  (and  perhaps  always  has  been)  needed  is  "many-body" 
schemes  for  VB  computations  (as  well  as  for  higher-order  MO-based  approaches,  too). 
Here,  then,  certain  such  many-body  VB-amenable  computational  schemes  are  to  be 
discussed,  in  the  context  of  semiempirical  (explicitly  correlated)  graphical  models.  The 
collection  of  models  are  described  and  interrelated  in  a  fairly  comprehensive  systematic 
manner.  A  selection  of  many-body  cluster  expansion  methods  are  then  discussed  with 
special  reference  to  resonating  VB  wave  functions  and  the  fundamental  graph-theoretic 
nature  of  the  consequent  problems  (such  as  also  are  noted  to  arise  in  lattice-discretized 
statistical-mechanical  problems,  too).  Some  examples  are  described  incorporating 
resonance  among  exponentially  great  numbers  of  VB  structures  as  applied:  for  large 
icosahedral-symmetry  fullerenic  structures,  for  the  (polyacetylenic)  linear  chain,  and  for 
ladderlike  conjugated  polymers.  It  is  contended  that  practicable  many-body  VB-theoretic 
methods  are  now  available,  retaining  clear  links  to  classical  chemical  valence  theory. 
Hopefully,  too,  these  methods  may  soon  find  use  beyond  the  semi-empirical  framework. 
©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  421-438,  1997 
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1.  Introductory  Survey 

The  idea  of  valence  structures  goes  back  to 
classical  chemistry.  Also,  indeed,  then  it  was 
at  the  heart  of  chemistry,  although  the  theory  was 
primitive  in  a  quantitative  mode.  There  are  both 
brief  surveys  [1]  encompassing  this  history,  as  well 
as  whole  books  on  the  subject  [2]. 

Soon  after  the  advent  of  quantum  mechanics, 
several  valence-bond  developments  were  made: 

■  First,  with  the  ab  initio  H2  computations  of 
Heitler  and  London  [3]  who  presented  a  fun¬ 
damental  quantum  mechanical  description 
of  the  covalent  bond; 

«  Second,  with  Rumer's  [4]  realization  of  a 
basis  of  such  covalent  electronic  configura¬ 
tions  in  suggestive  correspondence  with 
classical  chemical-bonding  structures; 

■  Third,  with  Pauling's  identification  [5]  of  ori¬ 
ented  hybrid  orbitals  as  precursors  for  bond 
orbitals  such  as  could  be  used  in  conjunction 
with  Rumer  basis  functions;  and 

■  Fourth,  with  Pauling  and  others'  develop¬ 
ment  [6]  of  such  Rumer-type  valence-bond 
bases  as  a  foundation  for  a  classically  con¬ 
nected  theory  of  molecular  electronic  struc¬ 
ture  with  extension  to  incorporate  resonance 
to  describe  delocalized  bonding. 

Thus,  a  quantum  mechanically  founded  va¬ 
lence-bond  (VB)  theory  had  auspicious  beginnings, 
although  as  reviewed  just  a  bit  more  completely 
elsewhere  [7,  8],  a  slow  period  of  development 
followed.  The  alternative  molecular  orbital  (MO) 
theory  began  development  roughly  at  the  same 
time  by  Hund,  Fiiickel,  and  Mulliken,  but  contin¬ 
ued  with  further  theoretical  developments  by 
Coulson  and  co-workers  while  a  significant  com¬ 
putational  advance  was  made  by  Roothaan  and 
Hall  in  the  form  of  a  discrete-orbital-basis 
self-consistent-field  theory.  Several  supposed  diffi¬ 
culties  with  VB,  both  computationally  and  concep¬ 
tually  (e.g.,  involving  nonorthogonality  and  size- 
consistency),  were  brought  to  attention.  Presum¬ 
ably,  largely  due  to  the  greater  computational  effi¬ 
cacy  of  MO  theory,  it  came  to  dominance — then 
further  various  semiempirically  formulated  MO- 
based  theorems  were  developed,  e.g.,  by  Longuet- 


Higgins  and  Dewar,  and,  finally,  the  MO-based 
Woodward-Hoffmann  rules  only  further  en¬ 
hanced  this  dominance  of  MO  theory.  The  fact  that 
VB  theory  related  so  nicely  in  a  conceptual  (but 
seemingly  noncomputational)  way  back  to  classi¬ 
cal  chemical-bonding  ideas  may  have  actually  dis¬ 
inclined  some  researchers  to  this  approach — it  be¬ 
ing  viewed  that  the  numerically  correct  answers 
should  come  out  of  "pure"  quantum  mechanics 
(i.e.,  solely  out  of  physics)  without  folding  in  any 
classical  chemical  concepts,  at  least  if  one  expends 
sufficient  computational  effort. 

Moreover,  in  physics,  many-body  methods  were 
developed  largely  in  terms  of  the  MO  (or  band- 
orbital-theoretic  or  momentum-space)  picture. 
Some  conceptual  problems  concerning  size- 
consistency  were  resolved  in  this  context  and  re¬ 
lated  to  powerful  computational  procedures: 
many-body  perturbation  theory,  Green's-function 
decoupling  techniques,  and  cluster-expanded  wave 
functions.  Also,  ultimately,  these  computational 
techniques  were  all  adapted  (in  the  1970s  and 
1980s)  for  extensive  ab  initio  molecular  computa¬ 
tions.  Also,  MO-related  density  functional  theory 
was  developed  in  a  powerful  computational  for¬ 
mat. 

Through  the  1950s,  1960s,  and  1970s,  there  were 
still  a  few  researchers  in  the  area  of  VB  theory 
(including  Coulson,  Daudel,  Gallup,  Goddard, 
Lowdin,  Matsen,  McConnell,  McWeeny,  Ooster- 
hoff,  and  Simonetta,  although  most  of  these  re¬ 
searchers  had  much  diversified  interests).  But  es¬ 
pecially  starting  in  the  1980s  (primarily  with  a  new 
group  of  researchers),  it  became  clear  that  VB 
theory  is  far  from  dead.  On  the  ab  initio  side,  there 
are  the  especially  notable  accurate  small-molecule 
computations  of  Gerratt  and  co-workers  [9, 10].  On 
a  more  theoretical  side,  Anderson  [11]  suggested 
that  a  VB  picture  may  be  crucial  to  understanding 
high-temperature  superconductivity,  so  that  now 
there  are  a  variety  of  formal  many-body  resonating 
VB  developments  made  by  physicists,  e.g.,  as  re¬ 
viewed  in  [12].  Also,  on  the  semiempirical  chemi¬ 
cal  side,  there  have  been  (and  are  reviewed  [13]) 
many  developments,  amusingly  often  linking  up 
with  the  ideas  now  developed  in  conjunction  with 
high-temperature  superconductivity. 

Now,  it  seems  that  there  are  many-body  tech¬ 
niques  which  are  fairly  widely  recognized  to  en¬ 
compass  either  MO-  or  VB-based  approaches.  One 
such  is  the  renormalization-group-theoretic  ideas, 
although  an  efficacious  general  computational 
scheme  for  general  molecular  structures  has  yet  to 
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have  reached  any  general  recognition.  Another 
powerful  computational  approach  is  via  the  Monte 
Carlo  statistical  techniques,  which,  again,  have  yet 
to  reach  any  general  utility,  although  in  the  hands 
of  experts,  there  are  some  high-accuracy  results  on 
selected  systems.  Also,  more  and  more,  it  seems 
that  techniques  earlier  thought  to  be  MQ-based 
can  be  formulated  as  well  in  a  VB-format. 

Here,  wave-function  cluster  expansions  devel¬ 
oped  in  the  context  of  semiempirical  VB-theoretic 
models  are  to  be  considered.  Notably,  it  is  be¬ 
lieved  that  the  semiempirical  context  is  not  neces¬ 
sary.  But  before  describing  the  different  types  of 
cluster  expansions  based  on  local  structures,  some 
further  general  theoretical  characterization  of  VB- 
theoretic  ideas  in  the  next  section  is  appropriate. 
Beyond  the  description  of  different  types  of  locally 
represented  cluster-expanded  wave  functions.  Sec¬ 
tion  3  goes  on  to  indicate  the  form  of  the  overlap 
and  Hamiltonian  matrix  elements,  as  generating 
functions  for  the  counting  of  certain  types  of  sub¬ 
graphs  of  the  parent  molecular  graph.  Section  4 
notes  certain  qualitative  features  of  the  different 
types  of  wave  functions,  with  attention  to  the 
structural  circumstances  for  best  applicability  and 
to  types  of  long-range  order  which  can  arise.  Fi¬ 
nally,  Section  5  reports  and  compares  numerical 
results  on  three  different  types  of  structures:  icosa- 
hedral-symmetry  fullerenes,  the  infinite  linear 
(poly acetylenic)  chain,  and  the  infinite  ladder. 


2.  VB  Theory:  Bases,  Models, 
and  Resonance 

As  the  first  step,  some  of  the  nomenclature  and 
framework  should  be  clarified.  A  VB  basis  consists 
of  configurations  of  spin-paired  localized  orbitals, 
at  least  for  the  overall  spin  singlet  case,  while  for 
higher  spins,  there  is  augmentation  by  unpaired 
spins.  Typical  VB  spin-pairing  diagrams  in  corre¬ 
spondence  with  the  basis  elements  for  the  case  of 
six  electrons  in  six  orbitals  (as  for  the  six  occupied 
atomic  7r-orbitals  of  benzene)  appear  as  in  Figure  1 
and  display  an  evident  close  analogy  to  the  pri- 


I 


FIGURE  1.  Five  independent  covalent  VB  structures 
for  six  orbitals.  These  VB  structures  are  also  the  Burner 
structures. 


mary  classical  valence  structures  for  benzene,  espe¬ 
cially  if  augmented  with  the  rest  of  the  spin¬ 
pairing  diagram  for  the  cr-orbitals.  Rumer  [4] 
focused  on  the  singlet  case  with  orbitals  singly 
occupied,  it  being  recognized  that  the  extension  to 
states  with  doubly  occupied  orbitals  is  straightfor¬ 
ward.  Also,  Pauling  [6]  pointed  out  early  the  ready 
extension  to  the  overall  spin  doublet  case,  and  the 
approach  taken  hinted  at  further  extension,  e.g.,  as 
done  in  Simonetta's  group  [14].  But  it  is  to  be 
emphasized  that  there  are  other  possibilities — an¬ 
ticipated  to  be  most  significant  when  the  VB  struc¬ 
tures  as  formalized  by  Rumer,  Weyl,  and  Teller 
[15]  do  not  correspond  so  closely  to  classical  VB 
structures.  I.e.,  this  Rumer  basis  is  built  in  terms  of 
pairings  around  a  cycle,  and  in  many  cases,  a 
chemical  structure  may  have  little  to  do  with  a 
simple  cycle.  More  important  from  a  chemical 
viewpoint  should  be  spin-pairing  patterns  involv¬ 
ing  pairing  between  nearest  pairs  of  localized  or¬ 
bitals,  independently  of  a  relation  to  the  formal 
basis  of  Rumer  et  al.  [15]. 

Granted  a  VB  basis,  a  semiempirical  model  rep¬ 
resented  on  such  a  basis  often  is  described  as  a 
"  VB"  model.  Of  course,  such  models  can  be  repre¬ 
sented  on  any  basis  spanning  the  same  space, 
although  the  semiempirical  integral  approxima¬ 
tions  are  (usually)  motivated  from  considerations 
in  terms  of  the  localized  (atomic)  orbitals  as  they 
appear  in  VB  basis  states.  Here,  a  VB  model  is 
taken  to  be  a  model  where  the  atomic  orbitals 
which  appear  are  viewed  to  be  non-orthogonal. 
One  such  circumstance  is  the  VB  model  of  Pauling 
and  Wheland  [16]  expressed  by  the  covalent  space 
of  configurations  built  from  singly  occupied  or¬ 
bitals.  Indeed,  the  model  so  developed  turns  out  to 
be  essentially  the  Heisenberg  spin  Hamiltonian, 
more  often  expressed  by  the  basis  of  what  is  essen¬ 
tially  Slater  determinants  of  atomic  orbitals,  and 
each  determinant  is  distinguished  solely  via  the 
spin  assignment.  Now,  most  commonly,  the  model 
is  re-represented  equivalently  to  act  just  on  spin 
space;  hence,  the  Hamiltonian  takes  the  general 
form 


(2.1) 

p 

where  are  spin  space  permutations  and  Jp 

are  parameters.  The  simplest  (and  usual)  approxi¬ 
mation  retains  just  the  terms  involving  a  transposi¬ 
tion  P  =  (ij)  exchanging  the  ith  and  jth  indices; 
hence,  the  exchange  parameter  can  [17]  be  approx- 
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imated  to  be  of  the  form 


=  {ES,j -  S,^.},  (2.2) 

with  E  =  (OIHIO),  S^j  -  <4)|(z/)|$>,  and  O  most 
simply  being  taken  as  a  product  of  nonorthogonal 
atomiclike  orbitals,  one  assigned  to  each  center. 
Further,  the  interaction  may  be  represented  in 
terms  of  ordinary  spin  operators: 

=  2Si  •  sj  +  1/2 

=  2sfs/  +  s+s/  +  srs+  +1/2.  (2.3) 

Then,  the  Hamiltonian  (with  a  further  shift  of  the  0 
of  energy)  is  written  as 

H=  (2.4) 

i^j 

which  is  the  more  commonly  presented  form; 
hence,  it  is  known  as  the  Heisenberg  spin  Hamilto¬ 
nian,  often  with  i  and  j  restricted  to  nearest  neigh¬ 
bors.  Evidently,  reference  to  Rumer  or  VB  bases  is 
now  quite  hidden,  and  many  other  bases  are  used. 
Bases  involving  "spin-waves"  are  not  uncommon 
in  physics,  but  working  with  representations  on 
such  alternative  bases  can  lead  to  nonlocally  inter¬ 
preted  wave  functions,  and  results  can  end  up 
being  rather  far  divorced  from  apparent  classical 
chemical  connection  and  interpretation. 

A  variety  of  VB  models  arise,  with  possibly  the 
most  natural  hierarchy  indicated  in  Figure  2.  There, 
also,  a  systematic  derivational  scheme  is  indi¬ 
cated,  based  on  successive  steps  which  are  either 
restrictions  to  ever  smaller  spaces  or  orthogonal- 
izations  of  natural  (initially  nonorthogonal)  many- 
particle  bases  for  these  spaces.  It  should  be  noted 
that  older  criticisms  concerning  non-orthogonality 
or  size-consistency  "catastrophes"  connected  with 
several  of  the  intermediate  models  are  largely  ir¬ 
relevant  or  misleading,  as  addressed  elsewhere 
[18]. 

Resonating  VB  theory  should  make  explicit  use 
of  VB  states,  especially  the  more  chemical  corre¬ 
sponding  to  the  more  local  spin-pairing  patterns. 
This  really  is  the  focus  of  this  review,  especially 
for  the  many-hody  case.  The  use  of  the  phrase  VB 
theory  (without  the  adjective  resonating)  should  be 
more  general,  but  it  might  best  be  taken  to  de¬ 
scribe  a  treatment  with  connection  back  to  the 
classical  chemical-bonding  ideas,  using  a  VB  basis 
adapted  to  the  molecular  structure  under  consider¬ 
ation,  although,  often,  researchers  refer  to  any- 


2"^  orthogonalization 


FIGURE  2.  A  hierarchy  of  VB-theoretIc  models. 


thing  using  the  Rumer  basis  as  "VB  theory,"  i.e., 
when  a  system  has  a  structure  little  related  to  a 
single  cycle,  then  the  standard  Rumer  basis  often 
does  not  include  many  locally  spin-paired  struc¬ 
tures.  The  Pauling-Wheland  resonance-theoretic  model 
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(of  Fig.  2)  is  obtained  as  the  restriction  of  the 
covalent-space  (Heisenberg)  VB  model  just  to  the 
subspace  of  Kekule  structures  which  are  those  VB 
structures  with  spin-pairings  solely  between  near¬ 
neighboring  pairs  of  sites.  The  molecular  structure 
is  conveniently  mathematically  encoded  in  terms 
of  a  molecular  graph  G  which  is  specified  by  the 
sites  (which  here  are  the  various  Ti-orbitals)  and  by 
the  set  of  pairs  of  neighbor  sites.  The  Kekule  struc¬ 
tures  are  nonorthogonal,  but  they  may  be  sub¬ 
jected  to  Lowdin  symmetric  orthogonalization  in  a 
systematic  fashion  [19]  to  yield  what  is  known  as 
the  Herndon-Simpson  model  [20],  which  most 
simply  takes  the  form 

Hhs=^iEA„  +  R2I:A^,  (2.5) 

a  P 

where  and  R2  are  parameters,  the  a-  and 
/3-sums  are  over  the  6-  and  10-cycles  of  G,  and 
and  are  operators  which  give  a  nonzero  result 
only  if  they  act  on  a  Kekule  structure  with  a 
pattern  of  alternating  tt  bonds  around  the  cycle  (a 
or  j8),  in  which  case  they  reverse  the  pattern  of 
alternation  around  that  cycle.  Then,  utilizing  a 
ground-state  wave-function  Ansatz  as  an  equally 
weighted  sum  of  all  Kekule  structures,  there  re¬ 
sults  [21]  a  resonance  energy  expressed  in  an  ele¬ 
gant  graph-theoretic  form,  which,  in  fact,  can  also 
be  motivated  [22]  from  purely  classical  chemical¬ 
bonding  ideas. 

Additional  models  are  also  indicated  in  Figure 
2.  Any  of  these  models  can  be  expressed  in  a 
graph-theoretic  form,  and  all  involve  explicit  elec¬ 
tron  correlation.  Also,  although  a  number  of 
many-body  solution  techniques  are  possible  as 
briefly  discussed  elsewhere  [23],  focus  here  is  to  be 
on  a  class  of  wave-function  cluster  expansions 
where  local  structures  are  manifest.  Indeed,  three 
types  of  such  wave-function  cluster  expansions  are 
here  described  in  some  detail. 


3.  Wave-function  Cluster  Expansions 

One  general  type  of  approach  to  the  solution  of 
such  models  is  via  the  localized  cluster  expansion 
of  system  wave  functions.  Now,  wave-function 
cluster  expansions  seem  (at  least  in  an  ab  initio 
framework)  most  commonly  to  be  carried  out  in 
the  coupled-cluster  format  [24]  based  on  a  single- 
determinantal  MO  reference,  although  it  is  under¬ 


stood  [25]  that  the  technique  can  be  applied  in  a 
local  format  based  on  a  reference  built  from  local¬ 
ized  orbitals.  Also,  such  localized-orbital  coupled- 
cluster  techniques  have  quite  successfully  been 
utilized  on  the  Heisenberg  model  (as  well  as  the 
Hubbard  model).  But,  especially  for  the  lower 
models  in  the  hierarchy  of  Figure  2,  this  has  not 
been  done,  and,  indeed,  it  is  often  not  so  clear 
what  role  such  orbitals  would  play.  Here,  then,  we 
consider  a  general  type  of  local  substructure  clus¬ 
ter  expansion  for  wave  functions.  Considerations 
here  are  limited  to  developments  for  the  Heisen¬ 
berg  spin  Hamiltonian  and  the  Herndon-Simpson 
Hamiltonian,  although  much  the  same  sort  of  ideas 
should  be  rather  much  more  generally  applicable. 
Even  within  the  stated  limitations,  there  are  differ¬ 
ent  types  of  local  cluster  expansions,  and  they  are 
here  discussed  separately,  but  the  three  discussed 
are  formulated  in  graph-theoretic  language. 

One  presumes  a  molecular  graph  G,  which  may 
be  thought  of  as  the  classical  molecular  connectiv¬ 
ity  pattern.  The  graph  G  is  presumed  to  be  con¬ 
nected  in  the  sense  that  there  exists  a  sequence  of 
edges  leading  from  any  site  to  any  other.  Further, 
often,  it  is  here  assumed  that  G  is  bipartite  in  the 
sense  that  the  set  of  sites  may  be  partitioned  into 
two  subsets  termed  starred  and  unstarred  subsets 
such  that  sites  of  one  subset  have  neighbors  solely 
in  the  other  subset.  For  instance,  all  the  classical 
benzenoids  form  bipartite  networks,  although  sev¬ 
eral  lattices  arising  in  solid-state  problems  are  also. 
It  is  to  be  emphasized  that  the  presumption  of  a 
graph  does  not  in  itself  engender  any  limita¬ 
tions — but,  rather,  it  guides  the  form  of  the  expan¬ 
sions,  which  when  truncated  in  low  order  engen¬ 
ders  an  approximation.  Different  types  of  sub¬ 
graphs  composed  from  subsets  of  sites  and  subsets 
of  edges  of  G  are  of  relevance,  and  often  these  can 
be  disconnected.  In  particular,  each  Kekule  struc¬ 
ture  K  is  represented  by  a  subgraph  where  the 
subset  of  sites  for  K  is  the  full  set  of  G  and  the 
subset  of  edges  of  K  is  to  be  such  that  every  site 
has  exactly  one  edge  incident  thereon 
in  K. 

3.1.  NEEL-STATE-BASED 

CLUSTER  EXPANSION 

The  first  type  of  wave-function  cluster  expan¬ 
sion  to  be  considered  is  the  most  nearly  conven¬ 
tional  and  has  at  least  in  its  simple  two-site  form 
been  proposed  several  times  [26]  independently 
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and  now  used  [27-30]  quite  successfully.  For  a 
bipartite  graph  G  for  the  Heisenberg  model,  this 
systematic  cluster  expansion  is  based  on  the  so- 
called  Neel  state  which  is  a  product  of  spins  with 
a  spins  on  the  starred  subset  of  sites  and  p  spins 
on  the  unstarred  subset  of  sites: 

|3>>  ^  OaCofliSf;).  (3.1.1) 

^  i 

Even  by  itself,  this  may  be  viewed  to  be  a  zero- 
order  estimate  for  the  ground  state  of  the  antiferro- 
magnetically  signed  Heisenberg  model.  But  build¬ 
ing  from  this,  a  cluster-expanded  wave  function 
may  be  considered  of  the  form 

|^>  =  exp^(r)|(l)>,  (3.1.2) 

where  the  subscript  U  indicates  a  slightly  modi¬ 
fied  exponential  function  to  be  explained  a  little 
later  and  T  is  an  excitation  operator  which  may  be 
written  as  a  sequence  of  terms  involving  sub¬ 
graphs  of  one-,  two-,  three-,  four-sites  and  beyond: 

T^=  Y.XiSf'  +  ZxijSfsf 

i  i-'j 

+  E  (^i/itsrsfs^"  +  ^,'(;)tsrsr) 

i~j~k 

+  E 

+  ^Wki sf^JSk  +  sfsf )  +  ...,  (3.1 .3) 

where  the  various  are  variational  parameters 
and  the  are  spin-lowering  or  spin-raising  opera¬ 
tors  s~  or  for  site  m,  i.e.,  the  change  the 
spin  occurring  in  the  Neel  state  O  at  site  m.  Also, 
note  that  within  the  three-  and  four-site  terms 
there  are  what  would  usually  be  called  pair  excita¬ 
tions  but  involve  pairs  of  sites  as  far  apart  as  those 
which  appear  in  the  ''proper"  three-  and  four-site 
terms  appearing  within  these  same  terms.  This 
may  be  recast  in  a  general  form  thusly: 

cG  y 

7’^- E  E^,.^nsr,  (3.1.4) 

y  ^  i 

where  the  first  sum  is  over  (connected)  subgraphs 
y  of  G,  the  second  sum  is  over  subsets  of  sites 
of  y  such  that  the  set  occurs  in  no  smaller 
subgraph,  and  the  product  is  over  the  sites  in 
Then,  the  wave  function  may  be  expanded  in  terms 
of  subgraphs  y  which  have  many  components 


disconnected  from  one  another,  it  now  being  un¬ 
derstood  that  the  subscript  U  on  the  exponential  in 
Eq.  (3.1.2)  indicates  the  deletion  of  any  terms  in  the 
Taylor  series  expansion  of  the  exponential  involv¬ 
ing  products  of  subgraph  terms  with  any  sites  in 
common.  Then,  the  resultant  "disentangled"  form 
for  the  wave  function  is 

G  y  ^ 

l^>  =  E  EX(7,^)nsrl4»>.  (3.1.5) 

7  ^ 

where  is  a  subset  of  the  sites  of  y  and  the 
coefficient  X(7,<5^)  breaks  up  into  a  product  of 
^  labeled  by  the  different  components  y  of  7 
and  by  the  sets  ^  of  sites  of  5^  which  occur  in  y: 

7  ^&y 

x(7,^)  =  n  n  (3.1.6) 

y  .5^ 

Really,  Eqs.  (3.1.5)  and  (3.1.6)  specify  the  present 
Ansatz  quite  completely,  the  form  of  Eqs. 
(3.1.1)-(3.1.4)  simply  being  more  like  the  usual 
manner  of  presentation  in  the  conventional  cou¬ 
pled-cluster  literature.  But,  a  further  important 
point  (which  much  simplifies  matters)  is  that  if  the 
net  number  of  spins  is  to  be  conserved  (as  for  the 
isotropic  Heisenberg  model)  then  the  excitations 
not  so  conserving  this  number  may  be  omitted, 
i.e.,  all  the  terms  involving  an  odd  number  of 
excitations  may  be  omitted,  and,  in  addition,  real¬ 
izing  that  the  graph  G  is  presumed  to  be  bipartite, 
terms  without  a  balance  of  starred  and  unstarred 
sites  [such  as  that  associated  with  may  be 

omitted.  Thus,  Eq.  (3.1.4)  reduces  to 

r=E^,;srsr+  E  ix,j„srsfs^sr 

+ ...,  (3.1.7) 

although,  again,  all  the  simplification  may  be  en¬ 
visaged  in  the  form  of  Eq.  (3.1.5).  Finally,  with 
symmetry,  many  of  the  x-parameters  may  be  iden¬ 
tified  as  equal.  With  all  sites  translationally  equiv¬ 
alent,  all  the  x^y  can  be  taken  as  the  same. 

3.2.  HULTHErV-KASTELEYN  AJXSATZ 

This  approach  originally  considered  by  Hulthen 
[31]  and  Kasteleyn  [32]  and  evidently  indepen¬ 
dently  by  Huse  and  Elser  [23]  may  be  viewed  to 
be  also  based  on  Neel-state-related  ideas.  Again, 
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special  note  of  neighbor  pairs  of  a  and  /3  spins  is 
made  as  they  occur  in  a  general  N-site  product, 

\(r(N))  =  \(T^(1)  0-2(2)  a^(3)  (3.2.1) 

of  such  sites  spins  =  a,  jS;  i.e.,  with  sCcrCN),  G) 
the  number  of  /3  spins  assigned  to  unstarred  sites 
in  a  (NX  the  Hulthen-Kasteleyn  cluster-expansion 
Ansatz  is 

Gr(N) 

(3.2.2) 

where  the  variational  ^-parameters  take  the  value 
1  if  <jj  aj.  The  phase  factor  here  is  convenient 
since  it  is  known  [34]  that  upon  inclusion  of  this  in 
the  product  basis  vectors  the  ground-state  wave 
function  (for  the  nearest-neighbor  Heisenberg  on  a 
bipartite  G)  appears  nodeless,  in  the  sense  that 
the  correct  expansion  coefficients  for  the 
(_l)s(o-{N),G)|^(jy)>  chosen  non-nega¬ 

tive.  If  all  the  neighbor  pairs  are  translationally 
equivalent  (while  also  there  is  spin-symmetry  un¬ 
der  the  interchange  of  a  and  jS),  then  all  the 
Xij(aa)  =  Xjy(/3j3)  =  x  for  i  —  ;.  Now,  if 
#(aj8;  o-(N),  G)  is  defined  as  the  number  of  a,  jS 
pairs  in  cr(N)  which  are  neighbors  in  G,  then  the 
ground-state  Ansatz  becomes 

O-(N) 

(3.2.3) 

Moreover,  one  can  imagine  a  way  to  generalize 
this  Ansatz  to  higher  orders,  with  variational  pa¬ 
rameters  Xy(a(y))  dependent  on  the  spin  configu¬ 
ration  a(y)  on  a  (cormected)  subgraph  of  type  7, 
i.e.,  we  let  #(cr(y),  y;  cr(N),  G)  denote  the  number 
of  subgraphs  of  type  y  in  G  such  that  the  spin 
configuration  cr(y)  appears  in  |cr(N)),  and  then 
we  take  the  general  ground  state  to  be  expressed 
as 

o-CN)  r 

(3.2.4) 

If  subgraphs  up  to  the  size  of  G  are  kept,  then  this 
includes  the  exact  wave-function.  In  fact,  in  the 


form  given,  there  is  an  overabundance  of  parame¬ 
ters — one  need  not  use  subgraphs  y  which  are 
included  in  any  other  retained  subgraphs.  (Or  if 
such  smaller  subgraphs  are  kept,  the  associated 
variational  parameters  can  be  chosen,  say,  to  take 
the  values  as  determined  when  there  were  no 
larger  subgraphs  included  in  the  earlier  truncated 
cluster  expansion.) 

3.3.  RESONATING  VB  CLUSTER  EXPANSION 

Neither  of  the  more  nearly  conventional 
cluster-expansion  Ansdtze  of  the  preceding  two 
sections  really  incorporate  VB-theoretic  ideas  for 
wave  functions,  but  this  now  is  to  be  done. 
Strangely,  though  the  most  closely  related  to  clas¬ 
sical  chemical  ideas,  it  seems  to  be  the  most  re¬ 
cently  considered  [29,  35“37].  This  VB-theoretic 
Ansatz  is  definable  in  terms  of  standard  singlet 
spin-pairing  functions 

©[/,/]  ^a(i)p(j)  -  p(i)a(j),  (3.3.1) 

Then,  presuming  that  G  has  at  least  a  nonzero 
number  of  fully  paired  Kekule  structures  K,  the 
lowest-order  resonating  VB  wave  function  is  taken 
to  be 

I^rVb)  =  Z)  (  n  ^n(K))  O  7/ (3.3.2) 

where  the  sum  is  over  Kekule  structures  of  G,  the 
two  products  are  over  the  starred  sites  of  G,  and 
i(K)  denotes  the  site  to  which  i  is  paired  in  K.  For 
a  higher-order  Ansatz,  one  considers  the  Rumer 
structures  R  with  spin-pairing  between  more  dis¬ 
tant  sites  but  always  such  that  starred  and  un¬ 
starred  sites  are  paired.  The  wave  function  makes 
use  of  the  idea  of  convex  subgraphs  y  of  G,  which 
is  such  that  for  every  pair  of  vertices  i  and  j  of  y 
every  shortest  path  of  G  between  i  and  j  is  also  in 
y.  Then,  the  general  resonating  VB  wave-function 
takes  the  form 

G  /RAG  \  ★ 

I^RVB>  ^  E  n  X  ln®[7.7(^)].  (3.3.3) 
R  \  y  I  j 

where  the  y-product  here  is  over  the  components 
of  the  smallest  (convex)  subgraph  of  G  such  that  R 
entails  the  pairing  of  no  sites  in  different  compo- 
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nents.  Of  course,  the  variational  parameters  are 
expected  to  become  ever  smaller  as  the  (con- 
nected)  subgraph  component  y  becomes  ever 
larger,  and  usually  one  will  truncate  the  Ansatz  to 
keep  just  smaller  size  y, 

3.4.  OTHER  POSSIBLE  CLUSTER-EXPANSION 

ANSATZE 

There  are  slightly  different  ways  to  set  up  each 
of  the  types  of  cluster  expansions  noted  in  the 
preceding  sections.  First,  for  the  Neel-state-based 
Ansatz,  one  need  not  utilize  the  '"unlinked'"  sub¬ 
script  U  on  the  exponential,  although,  then,  the 
"disentangling"  in  matrix  elements  as  in  the  next 
subsection  would  not  be  quite  so  complete  (and 
the  results  would  be  numerically  different  at  a 
given  order  of  truncation),  at  least  as  far  as  the 
higher-order  Ansatze  are  concerned.  Second,  for 
the  lattice-gas  Ansatz,  one  could  seek  to  utilize  the 
extra  freedom  in  the  Xy(a(y))  imposing  extra  con¬ 
ditions,  say  by  "factoring"  the  variational  coeffi¬ 
cients  to  pair  factors  Xijia^a^)  independently  of 
whether  i  and  ;  are  neighbors  or  not;  hence,  the 
overlap  reduces  to  the  partitions  function  for  an 
lattice-gas  model  solely  with  pair  interactions. 
Third,  for  the  reasoning  VB  model,  the  freedom 
connected  with  the  overcompleteness  of  the  full 
set  of  Rumer  structures  might  be  sought  to  be 
utilized,  say  again  by  a  similar  sort  of  factorization 
to  pair  terms.  Although  neither  of  these  pair  factor¬ 
ization  assumptions  is  known  to  include  the  exact 
ground  state,  there  is  some  empirical  evidence  [29, 
33]  of  very  high  accuracy. 

But  there  are  at  least  a  couple  of  other  some¬ 
what  different  sorts  of  possibilities  for  cluster- 
expansion  Ansatze,  although  seemingly  less  well 
developed.  One  "sequentially"  spin-adapted  clus¬ 
ter  expansion  is  related  to  the  Young- Yamanouchi 
basis  in  somewhat  the  same  sense  that  the  resonat¬ 
ing  VB  Ansatz  is  related  to  the  set  of  locally  paired 
VB  structures.  In  this  approach,  the  product  basis 
|o-(N)>  again  is  considered,  and  the  z-component 
of  spin  at  site  i  is  denoted  by  (=  +1/2  for 
cTj  =  a  and  =  —  1  /2  for  <j^  =  ^).  Also,  it  is  conve¬ 
nient  to  define 

i 

Znij  (3.4.1) 

and  we  let  denote  a  total  spin  (quantum  num¬ 
ber)  for  indices  1  ^  i.  Then,  in  a  lower-order  ap¬ 


proximation, 
l^seq>-  E  E  ( 

ct(N)  S(N)  I  '  =  1 

X(S,.M,.  X  (3.4.2) 

where  S(N)  =  (S^,  S2, . . .,  S^^),  the  x(Sj-,  S^^^)  are 
variational  parameters,  and  the  (S^M^-  X  cr^+i\ 
are  Clebsch-Gordon  (or  spin-coupling) 
coefficients.  More  generally,  in  place  of  the  two-site 
approximation  with  just  5^  + j),  one  would  (at 
the  k  +  1-site  approximation)  instead  have 
XiiSi,  Sj  +  2/  • » •  /  But,  further,  in  invoking 

an  approximation,  one  limits  the  values  which  the 
Sj  may  take — e.g.,  variational  x-parameters  with 
Sj  arguments  exceeding  some  selected  value  are 
taken  as  0.  Presumably,  though,  because  of  the 
manner  of  the  spin-coupling,  this  Ansatz  (in  its 
lower-order  forms)  is  really  only  well  adapted  to 
quasi-1 -dimensional  systems  (i.e.,  the  chemical 
graph-theoretic  circumstance  is  rather  special). 

Also,  finally,  a  cluster-expansion  on  a  "basis"  of 
cluster-expanded  wave  functions  might  be 
mentioned  as  a  possibility.  Granted  a  localized 
cluster-expansion  as  any  in  the  preceding  sections, 
the  local  x-parameters  can  be  viewed  as  separate 
variables  which  in  the  ground  state  for  translation- 
ally  symmetric  systems  fall  into  a  relatively  few 
equivalence  classes,  so  that  changing  one  at  a  time 
away  from  the  ground-state  value  to  give  a  result 
orthogonal  to  the  ground  state  yields  what  may  be 
viewed  as  candidate  excited  states  or,  alterna¬ 
tively,  as  new  basis  states.  But  in  the  local  picture, 
this  gives  ^  N  such  "single"  excitations  with 
changes  to  just  one  local  x-parameter,  and  one  can 
inquire  what  happens  when  two  or  more  such 
changes  are  made.  Of  course,  when  such  changes 
are  made  for  well-separated  regions,  additional 
"multiple"  excitations  arise  so  that  something  ap¬ 
proximating  a  new  cluster-expanded  basis  seems 
to  be  arising.  With  such  a  new  basis,  a  new  repre¬ 
sentation  for  the  Hamiltonian  can  be  made,  such 
new  representation  still  being  essentially  local,  and 
any  standard  many-body  solution  technique  might 
be  attempted  on  this  new  basis.  In  particular,  a 
new  cluster  expansion  might  be  made.  Sometimes, 
the  basis  functions  may  not  fully  span  the  original 
space,  as  is  the  case  with  the  Neel-state-based 
Ansatz  if  the  excitations  are  built  from  a  low-order 
primary  cluster  expansion,  although,  even  so,  the 
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secondary  cluster  expansion  can  provide  some  de¬ 
gree  of  improvement.  The  same  limitation  occurs 
with  a  low-order  resonating  VB  Ansatz,  but  no¬ 
tably  for  the  lattice-gas  Ansatz  for  a  linear  chain, 
the  new  basis  is  essentially  complete,  as  discussed 
elsewhere  [38]. 

3.5.  OVERLAP  MATRIX-ELEMENT 
COMBINATORICS 

Of  course,  the  manner  in  which  one  computes 
with  the  cluster-expanded  wave  functions  of  the 
preceding  sections  is  of  crucial  importance.  The 
most  standard  approach  at  least  for  the  more  con¬ 
ventional  MO-based  cluster  expansions  is  via 
Cizek's  coupled-cluster  method  [24].  Also,  indeed, 
this  can  conceivably  be  done  for  the  localized-site 
cluster  expansions  as  suggested  previously  [39] 
and  as  implemented  [25,  30]  in  practice  for  (what 
is  a  slightly  modified  version  of)  the  Neel-state- 
based  Ansatz  of  Section  3.1.  But,  in  fact,  at  least  for 
quasi-l-dimensional  (or  finite)  systems,  all  the 
Ansatze  of  the  preceding  subsections  can  be  treated 
in  a  purely  variational  manner,  which  emphasizes 
interesting  statistical  mechanical  analogies.  Hence, 
one  sees  clearly  possibilities  for  different  types  of 
long-range  orders  (as  discussed  in  a  following  sec¬ 
tion) — and  there  are  possibilities  for  novel  types  of 
phase  transitions. 

Here,  the  variational  combinatorial  form  might 
be  briefly  considered  for  the  Neel-state-based 
Ansatz  of  3.1.  Most  simply,  restriction  might  be 
made  to  just  the  two-site  Ansatz,  for  which  the 
overlap  matrix  element  takes  the  form 

=  i:eeex(7.^)* 

7 

X  <<&i(nsr)^nsfi^>x(7',^') 

^  i  ^  j 

=  E  E  EX(7,^)*X(7',^).  (3.5.1) 

7^7' 

This  may  be  refined  further  in  a  general  form  in 
terms  of  "superposition"  graphs  (composed  from 
pairs  7  and  7O.  But  as  an  introduction,  the 
graph-theoretic  elegance  might  be  illustrated  with 
consideration  of  the  case  with  just  nearest-neigh¬ 
bor  symmetry-equivalent  pair  excitations.  For  this 
circumstance,  then,  the  overlap  may  be  repre¬ 
sented  as  a  sum  over  terms  associated  to  different 
subgraphs  of  G,  each  subgraph  corresponding  to 


suitable  products  of  the  corresponding  to  the 
edges  of  the  subgraph.  Each  site-excitation  in  the 
ket  needs  to  be  matched  by  one  in  the  bra,  so  that 
two  types  of  connected  components  can  arise  in 
these  subgraphs:  even  cycles  with  pair  excitations 
from  ket  and  bra  alternating  around  the  cycle  and 
isolated  edges  where  a  pair  excitation  in  the  ket 
and  one  in  the  bra  exactly  match.  Hence,  the  over¬ 
lap  is  a  weighted  sum  over  such  sesquivalent  sub¬ 
graphs  S,  and  the  weights  are  a  product  over 
weights  for  each  component:  2x^  for  an  even 
M-cycle  of  S,  with  the  2  arising  because  there  are 
two  ways  to  have  alternation  of  the  pair  excita¬ 
tions  around  the  cycle  between  ket  and  bra,  and 
for  an  isolated  edge  of  S.  Then, 

G 

(ipiqr)  =  Y,r{S)^eiS)+KS)^  (3  5  2) 

s 

where  the  sum  is  over  sesquivalent  subgraphs  of 
G,  c(S)  is  the  number  of  cycles  in  S,  e{S)  is  the 
total  number  of  edges  in  S,  and  i(S)  is  the  number 
of  isolated  edges  in  S.  This  may  be  expressed  [40] 
in  terms  of  the  permanent  of  a  weighted  molecular 
graph  G: 

<qr|qr>  =  per{l  -h  xA},  (3.5.3) 

where  A  is  the  adjacency  matrix  of  G.  Graph 
theoretically,  this  is  a  type  of  subgraph-generating 
function  associated  to  the  parent  graph  G — the 
parameter  x  appears  as  the  dummy  variable  in 
this  generating  function.  In  a  statistical  mechanical 
context,  the  mathematical  form  of  is  the 

same  as  that  of  a  partition  function  for  allowed 
subgraph  conformations,  with  x  corresponding  to 
a  statistical  mechanical  activity. 

In  fact,  the  graph-theoretic  and  statistical  forms 
of  appearing  like  that  of  a  generating  func¬ 

tion  or  a  partition  function  persists  to  higher-order 
or  alternative  cluster-expansion  wave-function 
Ansatze.  The  simple  resonating  VB  Ansatz  which 
involves  just  Kekule  structures  leads  [36,  37]  to  an 
overlap  given  as  a  different  sum  over  spanning 
sesquivalent  graphs.  The  simple  Ansatz  for  the 
conjugated-circuit  case  yields  an  overlap  which  is 
just  a  count  of  the  Kekule  structures.  For  the  near¬ 
est-neighbor  Hulthen-Kastelyen  Ansatz,  the  over¬ 
lap  becomes  [35]  essentially  the  Ising  model,  and 
higher-order  such  Ansatze  are  just  elaborated 
"lattice-gas"  models. 
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3.6.  HAMILTONIAN  MATRIX-ELEMENT 
COMBINATORICS 

To  obtain  the  energy,  one  considers  the  Hamil¬ 
tonian  matrix  element.  For  the  Heisenberg  spin 
Hamiltonian,  it  is  a  sum  over  matrix  elements  of 
pair  interactions  for  neighbor  sites  i  ^  j. 

The  Hamiltonian  matrix  element  may  then  be  de¬ 
veloped  in  a  diagrammatic  form  with  a  special 
type  of  edge  (perhaps  represented  by  a  wiggly  line 
segment)  for  each  pair  interaction.  For  the  nearest- 
neighbor  Heisenberg  model  utilizing  the  simplest 
Neel-state  based  Ansatz,  this,  then,  involves  just 
replacing  one  of  the  x-parameters  by  an  exchange 
parameter  from  the  Hamiltonian;  hence,  one  ulti¬ 
mately  finds  [40] 

(3.6.1) 

But  now  if  we  hark  back  to  the  permenental  for¬ 
mula  for  one  can  obtain  a  formula  in  terms 

of  a  sum  of  ratios  of  permanental  minors  to  the 
full  permanent.  In  statistical  mechanical  contexts, 
just  such  ratios  appear  as  logarithmic  derivatives 
(with  respect  to  activities).  In  a  graph  theoretic 
context,  these  ratios  appear  in  average  (weighted) 
subgraph  counts.  Amusingly,  also,  such  a  ratio  is 
much  reminiscent  of  those  that  appear  in  bosonic 
SCF  problems. 

But  such  graph-theoretic  and  statistical  mechan¬ 
ical  analogies  apply  for  both  higher-order  Ansdtze 
and  for  the  other  cluster-expansion  Ansdtze  earlier 
discussed.  Of  course,  the  higher-order  Ansdtze  be¬ 
come  much  more  elaborate  with  many  addi¬ 
tional  '"activities."  For  the  nearest-neighbor 
Hulthen-Kastelyen  Ansatz,  the  Hamiltonian  ma¬ 
trix  appears  as  a  type  of  a  derivative  of  the  parti¬ 
tion  function  (or  overlap)  of  the  Ising  model,  and 
higher-order  such  Ansdtze  similarly  involve  just 


elaborated  "lattice-gas"  models.  For  the  nearest- 
neighbor  resonating  VB  Ansatz  "derivatives"  of 
the  overlap  subgraph,  generating  polynomial  arise 
[36].  In  general  [39],  the  analogies  arising  in  these 
problems  for  the  overlap  matrix  elements  and  for 
the  Hamiltonian  expectation  values  may  be  sum¬ 
marized  as  in  Figure  3.  Rather,  generally,  both 
overlaps  and  partition  functions  represent  sums 
over  relative  probabilities  which  tend  to  correlate 
only  nearer  sites,  and  the  Hamiltonian  expecta¬ 
tions  should  involve  much  the  same  probability 
structure  modified  in  some  manner  to  give  a  size- 
extensive  result,  as  is  done  by  a  logarithmic 
derivative. 

The  analogies  of  Figure  3,  of  course,  mean  that 
methods  used  to  solve  one  problem  can  be  utilized 
in  solving  another.  Thus,  e.g.,  the  statistical  me¬ 
chanical  transfer-matrix  method  [41,  42]  to  deal 
with  partition  functions  and  associated  expectation 
values  can  be  utilized  for  the  wave-function  prob¬ 
lems,  too — quite  generally  in  a  computationally 
amenable  fashion  for  quasi-l-dimensional  prob¬ 
lems  [43].  Or  the  Metropolis  Monte  Carlo  method 
[44]  can  be  neatly  translated  [29,  33]  to  the  varia¬ 
tional  wave-function-based  sort  of  problem  indi¬ 
cated  here.  But,  besides  computational  methods, 
there  are  conceptual  many-body  features,  such  as 
long-range  order  and  phase  transitions. 


4.  Qualitative  Results 

Results  for  the  various  cluster  expansion  Ansdtze 
of  the  preceding  sections  might  be  compared  to 
one  another  and  several  other  approximation 
schemes  in  some  global  way.  But,  too,  the  Neel- 
based  and  resonating  VB~theoretic  approaches  may 
be  understood  to  apply  for  different  structural 


{overlaps} 


f  subgraph-generatingl 
\  polynomial  J 


(partition  functions}  =  Z 


<T|H1Y) 


(wave-  1  r  subgraph  "j 

function  V  < — ^  counting  >  ■<— > 

e5q)ectations  j  t  averages  ) 


statistical 

mechanical 

expectations 


Z 


FIGURE  3.  A  correspondence  among  quantities:  first,  for  ciuster-expanded  wave  functions:  second,  for  graph-theoretic 
generating  functions;  and,  third,  for  statistical  mechanics. 
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circumstances.  This  was  perhaps  first  suggested  in 
the  context  of  a  particular  case  (namely,  the  trian¬ 
gular  lattice)  by  Anderson  and  Fazekas  [45],  but 
the  argument  can  be  made  [46]  much  more  gener¬ 
ally,  i.e.,  one  may  compare  the  (0-order)  energies 
for  a  simple  Neel-state  and  a  single  Kekule  struc¬ 
ture.  The  energy  per  site  for  a  Neel  state  on  a 
bipartite  graph  is  readily  obtained  as 

e^^KNeel)  =  -Jz/^,  (4.1) 

where  z  is  the  mean  number  of  neighbor  sites  (i.e., 
the  mean  coordination  number).  The  energy  per 
site  for  a  single  Kekule  structure  (on  a  graph 
which  admits  at  least  one  Kekule  structure)  is 
readily  obtained  as 

£r(o>(Kekule)  =  -/3/4.  (4.2) 

Hence,  one  can  summarize  the  situation  in  the  plot 
of  Figure  4,  where  also  is  shown  the  results  of 
further  refinement  of  the  resonance-theoretic  ap¬ 
proach.  For  the  Kekule-structure-based  approach, 
a  resonating  VB  result  based  on  an  equally 
weighted  combination  of  all  Kekule  structures  as 
applied  [36]  to  a  special  sequence  of  benzenoid 
polymer  strips  (of  varying  mean  coordination 


energy  per  site 


FIGURE  4.  Plots  of  Neel-state  energy  per  site  and 
Kekule-structure  energy  per  site  for  the  nearest-neighbor 
Heisenberg  (or  VB)  model  as  a  function  of  the  mean 
coordination  number  z.  The  Neel-state  result  is  for 
bipartite  graphs,  while  that  for  a  Kekule  structure 
presumes  that  the  graph  admits  at  least  one  Kekule 
structure. 


number)  is  included  in  the  plot.  As  a  result,  one 
sees  that 

■  Resonance-theoretic  descriptions  are  most 
appropriate  for  (benzenoid  or  aromatic)  sys¬ 
tems  with  low  mean  coordination  number 
(z  <  3)  and  with  many  Kekule  structures 
per  site. 

■  Neel-state-based  descriptions  are  most  ap¬ 
propriate  for  (3-dimensional)  structures  with 
high  mean  coordination  numbers  and  low 
extents  of  frustration  (i.e.,  with  few  small 
odd  cycles  of  antiferromagnetically-signed 
interactions). 

Indeed,  these  conclusions  are  quite  naturally  sup¬ 
ported  by  the  experimental  successes  of 
resonance-theoretic  ideas  in  organic  chemistry  in 
application  to  the  types  of  systems  of  the  first 
proposition,  and  of  Neel-state-based  ideas  in 
solid-state  magnetic  materials  science  in  applica¬ 
tion  to  the  types  of  systems  of  the  second  proposi¬ 
tion. 

Another  distinction  between  resonance-theoretic 
and  Neel-state-based  approaches  is  that  each  seem 
to  exhibit  a  qualitative  difference  as  regards  long- 
range  ordering.  Indeed,  the  Neel-state-based 
Ansatz  involves  the  '"classical"  type  of  Neel  anti¬ 
ferromagnetic  ordering  with  order  parameter 

ntU,j)  =  (  +  )({s?  -  <s?>}{s?  -  <s/>)) 

as  dii,  j)  00,  (4.3) 

where  the  ±  sign  applies  as  the  sites  are  or  are  not 
of  the  same  type  (starred  or  unstarred),  d(/,  /)  is 
the  distance  between  sites  i  and  and  the  expec¬ 
tations  <sf  >  and  <sp  can  be  anticipated  to  be  0  (in 
the  absence  of  a  magnetic  field).  Many  of  the 
conventional  many-body  techniques  (Green's  func¬ 
tions,  spin  waves,  Neel-state-based  many-body 
perturbation  theory,  etc.)  lead  to  this  type  of  long- 
range  ordering,  although  each  different  scheme 
can  lead  to  different  nonzero  values.  But  the  res¬ 
onating  VB-theoretic  approaches  turn  out  to  lead 
[29,  34,  47]  to  another  type  of  long-range  spin-pair¬ 
ing  order  involving  a  spin-pairing  correlation  func¬ 
tion  for  pairs  of  bonds,  say  e  and  /, 

Me,  f)  =(  {s,(e,  •  Sb,e)  -  p(e)}{Sa(f)  '  Sb(f)  -p(/)})  / 

(4.4) 
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where  a(e)  and  b(e)  are  the  sites  at  the  two  ends  of 
a  bond  e,  and  p(e)  is  the  pairing  expectation 
(s^Cg)  •  Then,  this  correlation  function  can 

approach  different  limits  for  distant  bonds,  de¬ 
pending  on  the  equivalence  classes  of  the  two 
bonds  e  and  /.  Indeed,  this  type  of  long-range 
order  can  be  readily  appreciated  from  considera¬ 
tion  of  classical  neighbor  VB-bonding  diagrams  or 
Kekule  structures.  For  example,  consider  the  two 
Kekule  structures  shown  in  Figure  5  for  a  polyan- 
thracenoid  polymer.  In  either  case,  there  are  many 
other  related  Kekule  structures  that  can  be  drawn 
by  rearranging  a  few  double  bonds  independently 
within  each  anthracene  unit,  while  the  bond  inter¬ 
connecting  anthracene  units  must  remain  always 
single  or  always  double,  i.e.,  if  for  one  Kekule 
structure  the  bond  order  for  one  of  these  intercon¬ 
necting  bonds  is  known,  then  it  is  known  for  all 
such  interconnecting  bonds  (even  those  in  the 
long-range  limit).  Also,  of  course,  the  Kekule 
structures  in  different  bond-order  classes  are  dif¬ 
ferent  in  every  monomer  unit  so  that  in  the  long- 
polymer  limit  they  are  noninteracting  (via  any 
finite-particle  interaction);  hence,  each  class  gives 
rise  to  its  own  ground  state.  Finally,  this  long-range 
order  persists  with  the  inclusion  of  VB  structures 
beyond  Kekule  structures  so  long  as  the  range  of 
the  spin-pairings  is  limited  (or  even  if  it  is  not 
limited  but  the  contributions  of  longer  spin-paired 
structures  falls  off  sufficiently  fast,  as  one  might 
expect  if  the  resonance-theoretic  description  is  to 


FIGURES.  Two  Kekule  structures  for  a  polymeric  string 
of  adjoined  anthracene  units.  Note  that  with  the  first 
bond  between  adjoining  anthracenes  being  single  all  the 
remaining  interunit  bonds  are  single,  whereas  with  the 
first  bond  between  adjoining  anthracenes  being  double, 
all  the  remaining  adjoining  bonds  are  double. 


be  reasonable).  For  polymers  with  multiple  inter¬ 
connections  between  monomer  units,  the  bond  lo¬ 
calization  often  is  not  so  severe  but  still  occurs  to 
some  degree  differently  for  different  classes  of 
Kekule  structures.  For  example,  consider  the  two 
Kekule  structures  of  the  polynaphthalenoid  poly¬ 
mer  of  Figure  6,  where  the  Kekule  structures  are 
representatives  from  two  classes  with  either  0  or  1 
of  the  inter-connecting  bonds  between  naphthalene 
units  always  being  double.  (A  third  class  with  two 
such  bonds  always  double  contains  but  a  single 
Kekule  structure.)  Further,  these  different  classes 
with  associated  ground  states  lead  to  a  considera¬ 
tion  of  the  possibility  of  solitonic  excitations,  as 
discussed  elsewhere  [48]. 

Finally,  it  may  be  mentioned  that  there  are  a 
number  of  rigorously  proved  [49]  theorems  con¬ 
cerning  ground-state  symmetries  for  the  nearest- 
neighbor  Heisenberg  for  a  bipartite  graph.  Also, 
for  this  case,  there  are  theorems  [50]  concerning 
long-range  antiferromagnetic  ordering  at  finite 
(nonzero)  temperatures. 


5.  Example  Quantitative  Results 

Results  for  the  various  cluster  expansion  Ansatze 
of  earlier  sections  might  be  illustrated  for  a  few 
representative  cases.  When  possible,  comparison  of 
different  approximation  schemes  might  be  made. 
The  first  example  cases  here  though  will  be  for  the 
conjugated-circuits  model  which  is  believed  to  be 
quite  accurately  solved  by  the  uniformly  weighted 
RVB  cluster  Ansatz,  The  second  and  third  example 
cases  return  to  the  consideration  of  the  nearest- 
neighbor  Heisenberg  model. 


FIGURE  6.  Two  Kekule  structures  for  a  doubly  adjoined 
ladderlike  chain  of  naphthalene  units.  Note  the  requisite 
long-range  ordering  manifested  In  the  number  of  double 
bonds  that  occur  between  successive  naphthalene  units. 
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A  general  class  of  interesting  molecules  is  that 
of  the  fullerenes  of  icosahedral  symmetry,  the 
member  being  the  "uniquely  elegant"  truncated 
icosahedron  structure  [51],  for  whose  discovery  the 
1996  Nobel  prize  in  chemistry  was  awarded.  This 
high-symmetry  class  of  fullerenes  has  now  been 
studied  for  almost  a  decade  [52,  53],  and  a  great 
deal  of  effort  has  been  directed  [54]  to  solutions  of 
the  Hiickel  model,  although  more  complete  quan¬ 
tum  chemical  models  have  also  been  utilized  [55]. 
The  study  via  resonance-theoretic  models  is  rather 
meager,  in  part,  because  the  phases  of  the  Kekule 
structures  are  not  so  readily  specified  as  for  the 
bipartite  case  (with  starred  and  unstarred  sites). 
But  the  conjugated-circuits  model  can  be  applied 
in  a  rather  empirical  mode  (where  such  phase 
problems  are  simply  presumed  not  to  be  explicitly 
manifested),  and  the  six  structures  with  up  through 
240  sites  have  been  treated  [52]  by  the  conjugated- 
circuits  model.  But,  in  fact,  the  powerful  computa¬ 
tional  procedure  of  [56]  is  quite  readily  applicable 
to  much  larger  systems  (so  long  as  one  remains 
within  the  conjugated-circuit  framework,  with  or¬ 
thogonal  Kekule  structures).  Hence,  in  Table  I,  we 
here  report  the  results  for  all  the  20  icosahedral- 
symmetry  fullerene  structures  of  up  to  980  sites 
(so  long  as  they  have  isolated  pentagons,  which 
seems  to  be  a  reasonable  criterion  for  stability  of 
fullerenes  [57]).  Such  icosahedral-symmetry 
fullerene  cages  are  uniquely  identified  [52-54,  58] 
by  a  pair  of  numbers  h  and  k,  which  characterize 
an  equilateral-triangular-network  fragment,  20 
copies  of  which  when  appropriately  joined  to¬ 
gether  may  be  viewed  to  compose  the  whole  cage. 
Such  a  triangular  fragment  may  be  obtained  by 
cutting  from  the  graphite  honeycomb  network  an 
equiliateral  triangle  with  apices  in  the  centers  of 
three  hexagons — the  numbers  h  and  k  represent 
the  coordinates  of  one  apex  relative  to  another  in  a 
skew  coordinate  system  as  indicated  in  Figure  7, 
where  also  the  unit  of  distance  is  the  (shortest) 
distance  across  a  hexagon.  Once  one  has  20  copies 
of  one  of  these  equilateral  triangular  fragments, 
"gluing"  each  to  one  of  the  same-sized  faces  of  a 
regular  icosahedron  yields  the  fullerene  cage,  with 
one  5-membered  ring  appearing  at  each  of  the 
apices  of  the  underlying  icosahedron.  The  cages 
are  characterized  by  0  <  fc  <  /z  >  0;  they  have  N 
=  20(h^  hk  k^)  sites,  and,  evidently,  the  sym- 


FIGURE  7.  An  illustration  for  (h,  k)  =  (3, 1 )  of  the 
associated  equilateral  triangle  to  be  cut  fronn  the  graphite 
lattice  graph,  whereafter  20  copies  of  such  triangles  may 
be  fused  together  on  the  surface  of  an  icosahedron  with 
faces  of  the  same  size  so  as  to  yield  an  Icosahedral- 
symmetry  fullerene. 


metry  of  the  resulting  graph  is  if  h  ^  k  or 
k  =  0,  while,  otherwise,  it  is  of  symmetry  One 
may  observe  that  the  results  of  the  Table  I  reveal 
that  the  total  Kekule-structure  counts  increase 
rather  dramatically  with  cage  size,  but  a  suitable 
graph-theoretical  computational  method  [57]  is  still 
able  to  handle  the  results  for  the  simple  wave- 
function  cluster-expansion.  Although  the  total 
Kekule-structure  count  evidently  diverges  expo¬ 
nentially  fast  with  cage  size,  the  Kekule  structure 
counts  per  site  (  defined  as  where  K  is  the 

total  count)  rather  rapidly  approach  a  finite  limit, 
which  must  be  that  for  graphite.  A  similar  slightly 
more  slowly  converging  behavior  is  seen  in  Table  I 
to  apply  for  the  conjugated-circuit  resonance  en- 
ergy  per  site.  Notably,  though,  the  cages  with  h~k 
not  divisible  by  3  exhibit  [53,  54,  59]  a 
HOMO-LUMO  gap  of  0,  so  that  they  should  be 
reactive  (perhaps  near  the  12  apices  of  the  underly¬ 
ing  icosahedron,  where  the  structures  are  least  like 
graphite).  Notably,  this  partitioning  of  the  cages  in 
accordance  with  the  divisibility  of  h-k  by  3  is 
manifested  in  terms  of  the  per-site  conjugated-cir¬ 
cuit  resonance  energies  and  even  (to  a  lesser  extent 
though)  in  terms  of  the  per-site  Kekule-structure 
counts,  i.e.,  if  h-k  is  divisible  by  3,  then  the  reso¬ 
nance  energy  (per  site)  is  somewhat  less  stabilizing 
than  usual  otherwise  (and  a  related  behavior  can 
be  discerned  for  Finally,  it  may  be  noted 

that  the  N  =  240  result  in  this  table  corrects  slightly 
an  earlier  reported  result  [52]. 
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TABLE  I 


Kekul^-structure  counts  and  conjugated-circuit  resonance  energies  for  icosahedral  fullerenes. 


N 

{h,  k) 

K 

KVN 

Erss/NR, 

60 

(1,1) 

12,500 

1,17026 

0.14271 

80 

(2,0) 

140,625 

1.15971 

0.09717 

140 

(2,1) 

2,178,836,352 

1.16601 

0.15262 

180 

(3,0) 

1,389,029,765,625 

1.16804 

0.17977 

240 

(2,2) 

21,587,074,966,666,816 

1.16966 

0.18490 

260 

(3,1) 

249,276,672,312,858,144 

1.16656 

0.16632 

320 

(4,0) 

4,742,708,922,01 9,21 6,41 6,528 

1.16879 

0.17816 

380 

(3,2) 

63,052,896,815,923,948,794,000,000 

1.16921 

0.17976 

420 

(4,1) 

55,853,91 0,21 3,1 01 ,1 70,303,889,31 0,1 04 

1.17070 

0.19122 

500 

(5,0) 

1 .0598455748691 71 55901  X  1 0®“* 

1.16964 

0.18198 

540 

(3,3) 

1.31669745812355346381  X  10®^ 

1.17149 

0.19307 

560 

(4,2) 

1.65980752921912098393  X  10®® 

1.17017 

0.18432 

620 

(5,1) 

3.05480054527886621321  X  10'*® 

1.17091 

0.18766 

720 

(6,0) 

4.24881220361973614246  X  10'*® 

1.17200 

0.19477 

740 

(4,3) 

6.53828524614458400211  X  10®° 

1.17130 

0.18886 

780 

(5,2) 

6.64523410371069994773  X  10®® 

1.17220 

0.19513 

860 

(6,1) 

1.30985649576953680262  X  10®° 

1.17150 

0.18940 

960 

(4,4) 

2.49942403779921924432  X  10®° 

1.17264 

0.19596 

980 

(5,3) 

3.29588295422837584208  X  10°^ 

1.17191 

0.19093 

980 

(7,0) 

3.84734704509710763917  X  10®’’ 

1.17210 

0.19188 

00 

(oo,  k) 

— 

1.17531 

0.19962 

5.2.  THE  HEISEfVBERG  MODEL  FOR  AN 
INFINITE  CHAIN 

One  natural  (and  oft  initially  considered)  sys¬ 
tem  on  which  to  make  a  comparison  of  various 
strong  correlation  approximation  methods  is  the 
simple  case  of  the  1-dimensional  chain  of  sites. 
Also,  such  a  comparison  for  the  infinite-length 
chain  is  made  in  Table  II,  where  energies  per  site 
are  reported  for  the  three  types  of  cluster- 
expanded  wave  functions  discussed  in  Sections 
4“6  along  with  results  reported  via  other  many- 
body  methods  and  the  exact  result  [31],  known  for 
cyclic  chain  systems,  via  a  (Bethe-Ansatz)  method 
which  does  not  seem  to  be  very  readily  extendable 
to  other  structures.  It  may  be  noted  that  the  so- 
called  Jordan-Wigner-transformed  RHF  and  UHF 
results  are  restricted  [60]  and  unrestricted  [61] 
Hartree-Fock  solutions  for  the  Hamiltonian  after  a 
transformation  of  the  Pauli  operators  in  Sj  •  to 
a  Fermionic  form,  and,  again,  this  approach  seems 
not  too  readily  extendable  to  other  structures.  The 
spin-wave  result  involves  [62]  a  Hartree-Fock  so¬ 
lution  after  transformation  to  Bosonic  operators, 
and  is  applicable  to  all  kinds  of  structures,  but  the 
bosonic  operators  are  defined  on  an  enlarged  space 


of  which  the  physically  realizable  space  is  only  a 
small  part,  while,  however,  the  approximate 
Hartree-Fock  solution  admixes  in  this  nonphysical 
space,  so  that  it  is  not  evident  that  one  obtains  a 
variational  bound  to  the  physical  ground  state. 
This  Bosonic  method  seems  not  to  be  ordinarily 
applied  to  other  than  Heisenberg  models,  although 
there  is  some  possibility  [63].  These  various  ap¬ 
proximation  schemes  which  seem  to  us  not  so 
amenable  to  other  structures  or  to  other  models 
are  indicated  in  Table  II  by  placing  their  energy 
values  in  parentheses.  Now,  in  light  of  the  discus¬ 
sion  in  connection  with  Figure  4,  this  system  is  not 
expected  to  be  exceptionally  well  described  via 
either  the  Neel-state-based  or  resonance- VB-theo- 
retic  approaches.  But,  nevertheless,  rather  reason¬ 
able  results  are  obtained  at  least  when  any  of  these 
types  of  cluster-expanded  wave  functions  are  taken 
beyond  the  lowest  order.  Indeed,  the  results  are 
quite  good  in  comparison  with  a  variety  of  stan¬ 
dard  many-body  methods,  e.g.,  as  compared  to  the 
unrestricted  Hartree-Fock  limit  (at  —  0.5000)  which 
is  one  of  the  poorer  results  in  the  table,  i.e.,  sizable 
portions  of  the  so-called  "correlation  energy,"  are 
obtained.  The  Monte  Carlo  methods  of  which  there 
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TABLE  II _ 

Energies  per  site  for  various  methods  as  applied  to 
the  nearest-neighbor  Heisenberg  Model  for  an 
infinite  ladder. 


Method 

{Energy} /JW 

Ref. 

Neel-state  energy 

Ordinary  UHF  limit  of  the 

-0.5000 

Hubbard  model 

-0.5000 

Single  Kekule  structure 

3  ^  1  first-order  real- 

-0.7500 

space  renormalization 

-0.7826 

[64] 

Hulthen-Kasteleyn  Ansatz 
Jordan-Wigner- 

-0.8156 

[31,32] 

transformed  RHF 

9  1  first-order  real- 

(-0.8383) 

[60] 

space  renormalization 
Cluster-expansion  of 

-0.8423 

[65] 

cluster  expansion 

Two-site  Neel-state-based 

-0.8469 

To  appear 

cluster  expansion  Ansatz 
(Non-variational)  antiferro- 

-0.8558 

[26] 

magnetic  spin-wave 
Jordan-Wigner- 

(-0.8634) 

[62] 

transformed  UHF 

19-^1  first-order  real- 

(-0.8646) 

[61] 

space  renormalization 
Two-bond-range 

-0.8650 

[66] 

resonating  VB  Ansatz 

First  sequential  cluster 

-0.8716 

[35] 

expansion  of  Eq.  (3.4.2) 
Four-site  Neel-state-based 

-0.8737 

To  appear 

cluster-expansion  Ansatz 
Three-bond-range 

-0.8758 

[27] 

resonating  VB  Ansatz 

-0.8790 

Present 

Exact 

(-0.886294) 

[31] 

Monte  Carlo  N  =  48  ring 

3  ^  1  additive-cluster 

-0.887 

[67] 

renormalization 
Neel-state-based  second- 

-0.8969 

[68] 

order  perturbation  theory 
Limit  to  Hubbard  Green’s 

-1.0000 

[69] 

function  approximant 

-1.2188 

[70] 

are  different  sorts  exhibit  what  seems  the  highest 
accuracy  for  a  variety  of  systems  and  models. 

5.3.  THE  HEISENBERG  MODEL  FOR  THE 

INFINITE  LADDER 

Another  example  system  is  that  of  a  ladder,  the 
structure  of  which  pictured  in  Figure  8  can  be 
viewed  as  a  strip  cut  from  the  square-planar  lat¬ 
tice.  Here,  various  results  are  reported  in  Table  III. 
This  system  satisfies  conditions  enunciated  in  con- 


FIGURE  8-  A  square-planar  lattice  strip. 

nection  with  Figure  4  so  as  to  make  a  resonating 
VB  picture  more  favorable:  This  system  is  of  a 
fairly  low  mean  coordination  number  z  =  3,  and  it 
has  many  Kekule  structures.  Indeed,  for  a  chain  of 
n-fused  squares,  the  number  of  Kekule  structures 
is  known  to  be  given  by  the  (n  +  l)th  Fibonacci 
number  f„+i,  which,  in  turn,  can  be  expressed  in 
very  many  ways,  e.g., 

F,  =  {(2  +  +  (2  +  (^_)0!^}/5,  (5.3.1) 

where  </>+=(!  +  y/5)/2  ^  1.61803399  is  the 
''golden  mean"  and  =  1  -  0+  is  some  sort  of 
complement.  In  concert  with  the  expectations,  the 
resonating  VB-theoretic  results  seem  to  be  quite 
good. 

But  perhaps  a  few  words  should  be  said  as  to 
just  what  systems  these  results  might  apply.  Basi¬ 
cally,  in  application  to  conjugated  7r-networks  with 
4-membered  rings,  the  Heisenberg  model  should 
have  important  higher-order  terms,  which,  if  not 
included,  predict  too  large  a  resonance  stabiliza¬ 
tion,  e.g.,  for  cyclobutadiene.  Thus,  the  quoted 
results  for  the  current  nearest-neighbor  model  do 
not  apply  to  such  conjugated  77-networks. 

But  the  current  nearest-neighbor  model  is  be¬ 
lieved  to  be  well  applicable  to  ladderlike  struc¬ 
tures  of  Cu^  ■^-containing  perovskites  (of  the  gen¬ 
eral  structural  class  of  high-temperature  supercon- 

TABLE  III _ 

Energies  per  site  for  various  methods  as  applied  to 
the  nearest-neighbor  Heisenberg  Model  for  an 
infinite  ladder. 


Method 

(Energy) /JW 

Ref. 

Neel-state  energy 

-0.7500 

Single  Kekule  structure 

-0.7500 

Two-site  Neel-state-based 

Ansatz 

-1.1014 

Present 

One-bond-range  RVB 

Ansatz 

-1.1138 

[68] 

Neel-state  2nc/-order 

perturbation  theory 

-1.1250 

[69] 

Two-bond-range  RVB  Ansatz 

-1.1462 

Present 

Numerical  extrapolation 

(from  exact  diagonalizatlons) 

-1.156 

[68] 
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ductors),  and,  indeed,  these  ladder  structures  have 
been  much  studied  [71-73]  in  this  context  in  the 
last  few  years.  Interestingly,  there  is  an  important 
difference  in  relative  applicability  of  Neehstate- 
based  and  resonating- VB~theoretic  wave  functions 
for  the  current  case  of  ladders  with  two  legs  in 
comparison  with  a  ladder  with  three  legs.  This 
difference  can  be  readily  understood  in  terms  of 
the  relative  abundances  of  Kekule  structures  as 
mediated  by  the  long-range  spin-pairing  orders 
discussed  in  Section  4  here,  although  these  earlier 
understood  [68]  consequences  were  termed  [71] 
''surprising''  when  first  studied  in  connection  with 
the  experimentally  realized  perovskites.  Basically, 
if  the  long-range  spin-pairing  order  deviates  much 
from  the  optimal  resonance-theoretic  circumstance 
for  which  spin-pairing  from  one  site  to  every 
neighbor  occurs  with  equal  probability,  then  reso¬ 
nance  makes  a  lesser  contribution  and  the  alterna¬ 
tive  Neel-state  description  is  relatively  more  pre¬ 
ferred. 


6.  Prospects 

Many-body  VB  theory  is  seen  to  be  imple- 
mentable  in  several  different  manners,  even  upon 
focus  on  the  circumstance  of  variationally  treated 
cluster-expanded  wave  functions.  Much  has  been 
done  with  the  simplest  Ansatze,  but  a  number  of 
computations  with  higher-order  Ansatze  indicate 
even  further  promise.  Such  results  at  least  in  some 
cases  seem  to  be  of  quite  high  accuracy.  Within  a 
variational  format,  it  is  a  challenge  to  apply  the 
methods  to  extended  systems  of  2-  or  3-dimen¬ 
sional  systems,  although  for  the  Heisenberg  model 
on  the  square-planar  lattice,  there  has  been  a  cer¬ 
tain  degree  of  work  (in  connection  with  high-tem- 
perature  superconductivity,  and  some  further  indi¬ 
cation  of  this  work  is  given  in  the  reviews  of  [12]). 

There  appear  some  qualitative  differences  be¬ 
tween  many-body  MO-based  and  VB-based  wave 
functions.  Also,  it  may  be  emphasized  that,  in 
practice,  these  differences  can  persist  to  somewhat 
high  orders,  most  especially  for  extended  many- 
body  systems.  Such  a  point  has  occasionally  been 
made  previously  [74].  But,  here,  with  the  recogni¬ 
tion  of  the  correspondence  to  statistical-mechanical 
quantities,  as  in  Figure  3,  two  different  types  of 
wave  functions  might  be  imagined  to  fall  into 
what  in  statistical  mechanics  are  termed  different 
"universality  classes,"  e.g.,  exhibiting  different 


types  of  long-range  order  and  exhibiting  different 
critical  point  behavior  (independent  of  perturba¬ 
tive  corrections  to  a  zero-picture).  Thus,  many- 
body  VB-theoretic  cluster-expanded  wave  func¬ 
tions  can  be  imagined  to  be  much  more  than  just 
another  way  to  do  a  computation  equivalent  to 
that  for  an  MO-based  many-body  wave 
function — the  two  descriptions  need  not  in  any 
achievable  high  order  be  equivalent.  The  question 
then  devolves  to  just  which  systems  each  of  the 
two  types  of  descriptions  apply. 

Thus,  there  appears  a  certain  need  for  VB-theo- 
retic  descriptions.  Although  the  computations  de¬ 
scribed  here  have  focused  on  Heisenberg  and  con- 
jugated-circuit  models,  there  is  also  some  (lesser 
amount  of)  work  on  Hubbard  or  PPP  semiempiri- 
cal  models.  It  is  hoped  that  the  methods  will  be 
applied  in  an  ab  initio  framework. 
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ABSTRACT:  We  outline  the  CASVB  strategy,  which  may  be  used  either  to  generate 
very  compact  modern  valence  bond  representations  of  CASSCF  wave  functions  or  to 
optimize  general  types  of  modern  VB  wave  function.  A  simple  algorithm  is  presented  for 
the  elimination  of  redundant  or  constrained  parameters  from  the  appropriate  second-order 
optimization  problems.  Selected  results  for  three  systems — benzene,  the  allyl  radical,  and 
LiH — are  used  to  illustrate  various  ways  in  which  CASVB  procedures  can  be  used  to 
perform  fully  variational  optimizations  of  nonorthogonal  orbitals  and  structure 
coefficients  in  modern  VB  wave  functions.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem 
65;  439-451,  1997 

Key  words:  CASVB;  modern  VB;  nonorthogonal  orbital  optimization; 
multiconfiguration  spin-coupled;  CASSCF 


Introduction 

The  initial  aim  of  developing  our  CASVB  pro¬ 
cedures  [1-4]  was  to  exploit  the  invariance  of 
wave  functions  of  "full  configuration  interaction 
(Cl)"  form  to  general  nonunitary  linear  transfor- 
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mation  of  the  active  orbitals,  so  as  to  generate 
alternative  representations  of  CASSCF  wave  func¬ 
tions  in  which  a  dominant  component  adopts 
modern  VB  form.  By  a  wave  function  of  "modern 
VB"  form,  we  mean  here  one  or  more  spatial 
configurations  constructed  from  general,  nonortho¬ 
gonal  orbitals,  and  combined  with  all  allowed  ways 
of  coupling  together  the  electron  spins  so  as  to 
obtain  the  required  resultant.  The  prototype  sin- 
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gle-configuration  wave  function  of  this  type  is  the 
spin-coupled  wave  function  [5]: 

^SC  =  •••<?,?  ©pp  <^2  •  •  •  ©sm  )  (1) 

in  which  the  singly  occupied  nonorthogonal  or¬ 
bitals  accommodate  the  active  electrons,  and 
the  doubly  occupied  orthogonal  orbitals  (p^  accom¬ 
modate  the  inactive  ones.  The  spin  eigenfunction 
&SM  is  fully  optimized  in  the  full  spin  space  for  N 
electrons  with  total  spin  S  and  projection  M,  and 
0pp  represents  perfect  pairing  of  the  spins  of  the 
2n  inactive  electrons. 

It  was  anticipated  from  the  outset  that  it  would 
be  desirable  to  incorporate  our  CASVB  procedures 
into  molecular  orbital  (MO)  Cl  packages  that  fea¬ 
ture  powerful  CASSCF  codes.  This  could  make  it 
possible  to  use  such  "standard"  packages  to  carry 
out  the  fully  variational  optimization  of  quite  gen¬ 
eral  types  of  modern  VB  wave  function,  by  alter¬ 
nating  optimization  of  the  active  and  external  pa¬ 
rameter  sets  in  an  iterative  two-step  procedure. 
The  CASVB  procedures  have  already  been  made 
available  as  part  of  the  molpro  package  [6],  in 
which  they  are  interfaced  to  a  very  efficient 
CASSCF  program  [7],  and  we  hope  to  see  them 
incorporated  into  many  others  in  due  course. 

Since  almost  all  of  the  technical  details  under¬ 
pinning  our  approach  are  available  elsewhere 
[1-4],  we  present  in  the  next  section  only  a  brief 
overview  of  the  methodology.  An  important  as¬ 
pect  of  actual  calculations  is  our  scheme  for  elimi¬ 
nating  redundant  or  constrained  parameters  from 
the  second-order  optimization  procedures;  this  is 
described  here  for  the  first  time.  We  then  use 
selected  results  for  three  systems — benzene,  the 
allyl  radical,  and  LiH — to  illustrate  some  of  the 
ways  in  which  the  CASVB  procedures  can  be  uti¬ 
lized  to  perform  fully  variational  optimizations  of 
nonorthogonal  orbitals  and  structure  coefficients  in 
modern  VB  wave  functions. 


CASVB  Method 

Consider  a  normalized  CASSCF  wave  function 
^CAS  consisting  of  structures  built  from  m 
active  orbitals,  A  general,  nonunitary,  nonsin¬ 
gular  m  X  m  linear  transformation  of  the  orbitals 

m  =  {ct>}0  (2) 


induces  a  corresponding  X  transformation 
of  the  structure  space 

m  =  (omo).  (3) 

Without  explicitly  constructing  T(0),  its  effect  on 
the  complete  structure  space  can  be  realized  in  a 
very  efficient  manner  by  writing  O  as  a  product  of 
simple  updates  and  utilizing  the  general  prop¬ 
erty  of  full  Cl  spaces  that 

O  -  O1O2  ^  T(0)  -  T(0,)T{0^),  (4) 

The  simple  updates  take  the  form 

0^.(A):  +  (5) 

for  which  the  corresponding  transformation  of  the 
structure  space  can  be  achieved  by  application  of 
the  operator 

f+A^V  +  A^E®,  (6) 

in  which  and  generate  spin-averaged 
single  and  double  excitations.  This  highly  efficient 
strategy  [1-4,  8]  for  realizing  exactly  the  effects  of 
the  structure  transformation  in  Eq.  (3)  is  central  to 
the  viability  of  the  CASVB  approach.  We  have  also 
presented  expressions  for  the  first  and  second 
derivatives  of  T(0)  with  respect  to  the  A  parame¬ 
ters  [2]. 

Consider  now  a  modern  VB  wave  function 
consisting  of  a  linear  combination  of  a  subset  of 
the  Nqi  structures  in  the  CASSCF  expansion,  but 
constructed  instead  from  m  nonorthogonal  orbitals. 
An  appealing  criterion  for  choosing  the  orbital 
transformation  O  in  Eq.  (2)  is  to  maximize  the 
overlap  Syg  between  and  ^cas/ 

_  ^"^casI^vb)  , 

An  efficient  second-order  Newton-Raphson  strat¬ 
egy  for  maximizing  Syg  was  presented  in  Ref.  [2]. 
This  approach  provides  a  relatively  inexpensive 
scheme  for  representing  CASSCF  wave  functions 
in  modern  VB  form.  Very  high  values  of  Syg,  on 
the  order  of  0.99,  may  typically  be  achieved  with  a 
single  spin-coupled-like  configuration  [Eq.  (1)]. 

An  alternative  to  maximization  of  Syg  is  to 
choose  O  by  minimizing  the  expectation  of  the 
energy,  Eyg,  defined  according  to 

p  _  ^^vbI^I^vb)  ,  . 

<^VBmB>  ‘  ^ 
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The  nonorthogonality  problem  implicit  in  evaluat¬ 
ing  the  matrix  elements  in  this  expression  may  be 
circumvented  by  transforming  back  to  the  original 
basis  of  orthogonal  molecular  orbitals,  using  the 
identities 


H'  =  T^(0)HT(0), 

(9) 

S'  =  T^(0)ST(0), 

(10) 

I 

o 

H 

II 

u 

o 

I 

H 

II 

(11) 

in  which  H,  S,  and  c  are  the  Hamiltonian  matrix, 
overlap  matrix,  and  Cl  vectors  appropriate  to  {O}, 
whereas  the  primed  quantities  relate  to  {O'}.  An 
efficient  second-order  Newton-Raphson  strategy 
for  minimizing  Eyg  was  presented  in  Ref.  [2].  The 
resulting  is  optimal  for  the  given  choice  of 
active  space  and  corresponds  to  a  representation  of 
the  full  CASSCF  wave  function  in  which  a  small 
number  of  modern  VB  structures  dominate  the 
total  energy.  It  is  our  experience  that  analogous 
overlap-  and  energy-optimized  wave  functions  are 
always  in  very  good  qualitative  agreement.  The 
viability  of  the  energy  optimization  method  de¬ 
pends  on  the  availability  of  highly  efficient  algo¬ 
rithms  for  realizing  the  action  of  the  Hamiltonian 
operator  on  a  full-CI  vector  [9,  10], 

Furthermore,  optimization  of  the  energy  expec¬ 
tation  value  Eyg  with  respect  to  core-active,  core- 
virtual,  and  active-virtual  orbital  rotations  can  be 
achieved  using  standard  CASSCF  procedures.  As  a 
consequence,  fully  variational  modern  VB  calcula¬ 
tions  can  be  performed  by  alternating  optimization 
of  the  active  and  external  parameter  sets  in  an 
iterative  two-step  procedure.  This  has  been  imple¬ 
mented  as  part  of  the  molpro  package  [6].  We 
have  also  presented  practical  approaches  [3]  for 
the  proper  utilization  of  molecular  point  group 
symmetry  in  modern  VB  approaches  that  involve 
the  optimization  of  nonorthogonal  orbitals. 


Elimination  of  Redundant  or 
Constrained  Parameters 

We  consider  in  this  section  the  optimization  of  a 
general  multiconfiguration  modern  VB  wave  func¬ 
tion  of  the  form 

^struc 

^act  ^  ^  Q2) 


in  which  the  structures  ipi  are  defined  in  terms  of 
a  set  of  m  orbitals,  {(^^},  with  varying  occupations 
and  spin  functions.  We  have  used  the  notation 
to  remind  ourselves  that  the  total  wave  function 
can,  without  loss  of  generality,  also  include  or¬ 
bitals  that  are  doubly  occupied  in  all  structures. 
The  active  orbitals  are  allowed  to  be  nonorthogo¬ 
nal,  and  are  expanded  in  a  set  of  basis  functions 
{ Xi)  according  to 

^bas 

JlOi.Xr  (13) 

It  is  convenient  to  collect  the  "'primitive"  varia¬ 
tional  parameters  from  Eqs.  (12)  and  (13)  into  a 
vector  X,  such  that  contains  the  expansion 

coefficients  for  orbital  con¬ 

tains  the  expansion  coefficients  for  orbital  (^>2,  etc., 
and  x(m  -f  l:m  X  +  ^struc)  holds  the 
structure  coefficients.  Our  procedure  seeks  to  min¬ 
imize  the  energy  Eyg(x)  based  on  a  second-order 
model  of  the  form 

Eyg(x  +  8x)  -  Eyg(x)  +  6x  +  1/2  6x^G  5x, 

(14) 

in  which  g  is  the  (column)  vector  of  first  deriva¬ 
tives  of  E  with  respect  to  the  variational  parame¬ 
ters,  and  G  is  the  corresponding  matrix  of  second 
derivatives  (Hessian).  There  are  many  variants  of 
such  procedures,  such  as,  for  example,  stabilized 
Newton- Raphson  schemes  or  augmented  Hessian 
approaches  [11-13]. 

Methods  for  eliminating  redundant  or  con¬ 
strained  parameters  from  such  second-order  opti¬ 
mization  problems  are  well  known  in  the  literature 
(see,  e.g..  Ref.  [11]).  A  common  way  of  taking 
constraints  into  account  is  to  include  Lagrange 
multipliers  in  the  objective  function.  It  is  well 
known,  however,  that  such  approaches  can  ad¬ 
versely  affect  the  convergence  characteristics.  An 
attractive  alternative  is  the  use  of  nonlinear  elimi¬ 
nation  in  a  constrained  minimization  scheme  that 
involves  first  and  second  derivatives  of  the  con¬ 
straints  with  respect  to  the  variational  parameters 
[13].  The  second-order  nonlinear  elimination  con¬ 
strained  minimization  algorithm  described  in  Ref. 
[13]  has  proved  invaluable  in  the  direct  calculation 
of  core-optimized  spin-coupled  wave  functions. 

We  have  strived  here  for  a  computationally 
somewhat  simpler  approach,  bearing  in  mind  that 
the  number  of  variational  parameters  can  be  quite 
considerable  in  practical  applications.  Some  of  the 
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methodology  described  here  has  now  also  been 
implemented  in  our  determinant-based  spin-cou¬ 
pled  code  [14].  Direct  comparisons  with  the  non¬ 
linear  elimination  approach  show  no  appreciable 
degradation  in  convergence  characteristics,  but  a 
significant  reduction  in  the  computational  over¬ 
head.  Analogous  considerations  to  those  described 
here  apply  also  to  CASVB  calculations  that  involve 
maximization  of  Sy^Cx)  rather  than  minimization 
of  Evb(x). 

One  can  of  course  always  define  nonsingular 
linear  transformations  of  the  parameters 


>< 

11 

(15) 

and  rewrite  the  second-order  model  in  the  form 

EvB(y  +  8y)  =  £vB(y)  +  (s^)^  sy 

+  l/2  8yTG!'8y, 

(16) 

with 

gy  =  (L‘*)^g 

(17) 

and 

Gy  =  (l-O^gl''^ 

(18) 

It  is  useful  to  point  out  here  the  practical  advan¬ 
tage  of  choosing  L  to  be  an  orthogonal  transforma¬ 
tion,  so  that  =  L^. 

At  the  start  of  each  iteration,  x  is  modified  (if 
necessary)  so  that  ^^yg  obeys  all  of  the  constraints. 
Our  primary  concern  in  the  optimization  proce¬ 
dure  is  then  to  define  a  rectangular  transformation 
matrix  L  such  that  our  constraints  are  necessarily 
satisfied  in  the  updated  vector,  x  +  8x.  In  the 
CASVB  strategy,  we  can  distinguish  three  classes 
of  constraint: 

1.  Simple  constraints  of  the  form  c^x  =  0 

2.  Normalization  conditions  for  orbital  and 
structure  coefficients 

3.  Orthogonality  constraints 

These  are  in  order  of  increasing  complexity  but,  in 
practice,  the  elimination  of  parameters  associated 
with  normalizations  must  always  be  carried  out 
last. 

SIMPLE  CONSTRAINTS 

Constraints  that  may  be  expressed  in  the  form 

Y,c(i)x(i)  =  0  (19) 


include  various  simple  cases  such  as,  for  example, 
forcing  a  single  expansion  coefficient  to  be  zero  (in 
which  case  c  has  only  one  nonzero  coefficient)  or 
setting  pairs  of  coefficients  equal  or  opposite. 
However,  much  more  complicated  conditions,  such 
as  symmetry  requirements  for  orbitals,  are  also 
covered  here.  The  symmetry  condition  on  a  single 
orbital 

6%  =  (20) 

can  be  satisfied  by  diagonalizing  the  matrix  repre¬ 
sentation  of  the  symmetry  operation  in  the 
basis  of  { then  expanding  (f)^  in  the  (right 
hand)  eigenvectors  corresponding  to  the  eigen¬ 
value  A  [3].  If  such  eigenvectors  exist,  then  the 
symmetry  condition  embodied  in  Eq.  (20)  gener¬ 
ates  simple  constraints,  where  the  con¬ 

straints  vectors  will  span  the  orthogonal  comple¬ 
ment  to  the  eigenspace  associated  with  A,  When 
two  orbitals  are  related  by  symmetry,  as  in 

=  (21) 

then  the  set  of  allowed  parameter  vectors  can  be 
found  by  the  matrix  multiply 

c(/  +  (v-  DWbas) 

=  i;(0%c(y  +  (ju.  - 

i 

z  =  l,...,OTb,s/  (22) 

for  each  of  the  linearly  independent  vectors 
describing  updates  to  (f)^.  As  before,  the  con¬ 
straints  may  be  defined  as  the  parameter 

vectors  spanning  the  orthogonal  complement.  Cal¬ 
culations  employing  symmetry  conditions  on  the 
orbitals  are  described  later  for  the  allyl  radical  and 
for  LiH. 

NORMALIZATION  CONDITIONS 

In  order  for  an  orbital  normalization  to  be  sta¬ 
tionary,  i.e., 

+  =  (23) 

we  require  (to  first  order)  that  =  0.  This 

condition  can  be  expressed  as 

(0<'‘>)^S50^'‘^  =  0  (24) 
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in  which  is  the  fith  column  of  the  orbital 
transformation  matrix,  and  S  is  the  matrix  of  over- 
laps  between  basis  functions.  In  terms  of  the  con¬ 
straints  vector  c,  this  then  becomes 

c(i  +  =  ((0('^>fs),., 

/  =  (25) 

The  elimination  of  the  normalization  parameters 
may  thus  be  carried  out  in  a  very  similar  fashion 
to  the  elimination  of  the  simple  constraints  de¬ 
scribed  above.  The  slight  difference  in  this  case  is 
that  the  condition  is  now  on  the  update  vector 
only,  i.e.,  of  the  form  6  x  =  0.  The  normalization 
condition  associated  with  the  structure  coefficients 
can  be  eliminated  in  an  exactly  analogous  manner. 

ORTHOGOIVALITY  COEVSTRAIIXTS 

Although  the  CASVB  procedures  are  primarily 
intended  for  the  determination  of  intrinsically 
nonorthogonal  orbitals,  there  are  various  situations 
in  modern  VB  calculations  in  which  it  may  be 
advantageous  to  enforce  orthogonality  between 
some  of  the  orbitals.  Such  an  example  is  given 
later  for  LiH. 

For  the  elimination  of  the  constraint  associated 
with  (cjyjcf)^)  =  0  we  write  the  update  of  orbital 
<f)^  in  the  form  ==  where 

>  =  0.  Similarly,  =  5(c(/>^  +  d<l)^).  The 
condition 


tion.  This  must  be  achieved  without  affecting  the 
simple  orbital  constraints  outlined  above  and  so 
we  have  adopted  what  might  best  be  termed  a 
pairwise  symmetric  orthogonalization.  For  each  or- 
thogonalization  pair  in  turn,  we  solve 

<</)^  + + A(A,>  =  0  (28) 

for  the  two  update  vectors  and  A(^^.  We  have 
used  the  notation  A  cf)^  to  avoid  confusion  with  the 
update  84>^  predicted  by  the  second-order  model. 
The  updates  must  be  "'allowed."  For  example,  A(/)^ 
must  be  orthogonal  to  constraints  vectors  involv¬ 
ing  as  well  as  to  any  orbitals  (p^  for  which 
=  0  has  previously  been  enforced.  Fur¬ 
thermore,  it  is  useful  to  require  that  A  is  orthog¬ 
onal  to  (p^.  The  natural  choice  for  this  update 
vector  is  clearly  the  projection  of  cj)^  onto  the 
allowed  space,  because  this  will  require  a  minimal 
change  to  both  orbitals  ((p^  and  (p^,)  to  achieve 
orthogonality.  If  both  orbitals  have  nonzero  projec¬ 
tions  on  the  opposite  allowed  space,  we  can  solve 
for  the  value  of  c  in 


A(/>, 


=  0 


(29) 


(A</)^  and  A<^^  are  here  the  projections  of  the  other 
orbital).  The  special  cases  where  the  projection  of 
either  of  cf)^  or  <j)^  is  null  are  treated  separately  in  a 
similar  manner. 


+  +  (26) 

requires,  to  first  order  in  the  orbital  changes,  that 
a  =  -c.  In  practical  terms,  this  leads  to  a  con¬ 
straints  vector  that  is  very  similar  to  the  normal¬ 
ization  case: 

c(f  +  (^-  Dm^as)  =  ((OW)^S),., 

c(i  +  (v-  Dm^as)  =  ((0^'"^)^s),., 

f  =  l,...,mi,as.  (27) 

While  this  is  not  exact,  because  (p^  and  </>/  will 
not  in  general  be  orthogonal,  it  does  become  a  very 
good  approximation  close  to  convergence.  The 
main  effects  of  the  approximation  are  likely  to  be 
on  the  global  convergence  characteristics,  but  in 
our  applications  so  far  this  has  not  been  a  problem. 

Since  the  elimination  of  orthogonality  con¬ 
straints  is  only  approximate,  the  orthogonality 
must  be  reimposed  after  each  update  in  an  itera¬ 


IMPLEMENTATIOIN 

We  can  see  that  all  three  types  of  constraint  can 
be  expressed  as  conditions  on  the  update,  taking 
the  form 

cyax  =  o,  Z-1,...,N,,,.  (30) 

The  set  of  free  variables  of  the  problem  can  there¬ 
fore  be  found  as  the  orthogonal  complement  of 
this  set  of  Neon  vectors.  We  find  it  by  a  series  of 
standard  Gram-Schmidt  orthogonalizations,  as 
follows: 

1.  The  Neon  constraints  vectors  cj  are  first  or- 
thogonalized,  discarding  any  with  a  norm 
below  an  appropriate  threshold. 

2.  The  full  set  of  unit  vectors  is  orthogonalized 
to  the  surviving  constraints. 

3.  The  resulting  vectors  are  finally  orthogonal¬ 
ized,  discarding  any  with  a  norm  below  the 
threshold. 
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The  first  step  not  only  facilitates  the  evaluation  of 
the  orthogonal  complement,  but  it  also  has  the 
effect  of  eliminating  any  linear  dependencies 
among  the  constraints  vectors.  The  final  orthogo- 
nalization  not  only  establishes  the  minimum  lin¬ 
early  independent  set  of  free  parameters,  but  it 
also  ensures  that  the  transformation  matrix  L  be¬ 
comes  orthogonal. 

The  orthogonal  complement  could  of  course  be 
found  in  various  other  ways,  but  we  have  found 
that  the  above  procedure,  starting  from  a  full  set  of 
unit  vectors,  tends  to  give  a  transformation  matrix 
L  of  maximum  sparseness.  This  can  significantly 
influence  the  efficiency  of  the  transformation  of  the 
Hessian  embodied  in  Eq.  (18). 


Results 

We  have  chosen  three  sets  of  examples  to  indi¬ 
cate  some  of  the  various  ways  in  which  the  CASVB 
code  can  be  used  to  carry  out  fully  variational 
optimizations  of  nonorthogonal  orbitals  in  modern 
VB  wave  functions:  benzene,  the  allyl  radical,  and 
LiH.  All  of  the  calculations  described  here  were 
carried  out  using  the  appropriate  modules  now 
available  in  the  MOLPRO  [6]  suite  of  programs. 

BENZENE 

For  benzene  (D^^)  we  adopted  bond  lengths 

=  139.64  pm  and  =  108.31  pm.  The  s/p 
basis  functions  for  carbon  and  the  s  basis  functions 
for  hydrogen  were  taken  from  correlation  consis¬ 
tent  pVTZ  basis  sets  [15].  These  were  augmented 
with  polarization  functions  with  exponents  dQ  = 
0.8  and  p^  =  1.0,  so  that  the  basis  sets  for  C/H 
consist  of  (10  s5  pi  d/5slp)  Cartesian  GTOs  con¬ 
tracted  to  [4:s3pld/3slp].  A  6-in-6  CASSCF  calcu¬ 
lation  was  performed  for  the  tt  electrons,  keeping 
the  36  cr  electrons  in  an  optimized  closed-shell 
core.  The  nondynamical  correlation  energy  incor¬ 
porated  in  this  wave  function  is  73  millihartrees 
(see  Table  I).  By  definition,  the  RHF  wave  function 
[core](fl2u)^(^i^)^  accounts  for  none  of  this  corre¬ 
lation  energy.  The  weight  of  the  configuration 
[core](fl2w)^(^i^)^  the  full  CASSCF  wave  func¬ 
tion  is  88.4%. 

Spin-coupled  calculations,  with  an  active  space 
corresponding  to  the  six  tt  electrons,  were  carried 
out  (using  the  CASVB  code)  with  a  frozen  core, 
taken  directly  from  the  CASSCF  calculation.  This 


TABLE  I _ 

Calculated  energies  for  benzene. 


Calculation 

E, 

hartrees 

^  ^CAS’ 

millihartrees 

Proportion  of 

^CAS  ”  ^RHF 

CASSCF 

-230.8368216 

0 

100% 

SC 

-230.8293315 

7.49 

89.7% 

SC  (f.c.) 

-230.8293314 

7.49 

89.7% 

RHF 

-230.7640556 

72.77 

0% 

calculation,  labeled  SC  (f.c.)  in  Table  I,  recovers 
nearly  90%  of  the  nondynamical  energy  and  the 
overlap  with  the  CASSCF  wave  function  exceeds 
0.995.  The  further  energy  lowering  and  change  in 
SyB  on  optimizing  the  a  ''core"  is  negligible 
(calculation  SC  in  Table  I). 

Although  no  such  constraints  were  imposed, 
and  the  calculations  were  actually  performed  in 
C2^  symmetry,  we  find  six  symmetry-equivalent 
spin-coupled  tt  orbitals  that  may  be  transformed 
into  one  another  by  successive  C^  rotations.  Num¬ 
bering  the  orbitals  sequentially  around  the  ring, 
the  symmetry-unique  overlap  integrals  are 
<(/>il</>2>  -  0.524,  <(AiIc/>3>  =  0.029,  and  = 

—  0.157.  Each  orbital  consists  of  a  deformed  C(2p^) 
function  associated  with  a  given  carbon  atom,  but 
it  exhibits  some  deformation  toward  the  neighbor¬ 
ing  atoms  in  the  ring  (see  Fig.  1).  This  outcome  is, 
of  course,  entirely  consistent  with  all  of  the  previ¬ 
ous  spin-coupled  studies  of  this  molecule  [3, 
16-22].  The  deformations  of  the  orbitals  play  a 
crucial  role;  without  them,  it  would  be  essential  to 
incorporate  also  the  plethora  of  ionic  structures 
with  double  occupancy  of  one  or  more  orbitals. 

Expressed  in  the  Rumer  basis  [23],  often  em¬ 
ployed  in  classical  VB  calculations,  the  total  spin 
function  corresponds  to  weights  of  40.6%  each  for 
the  two  Kekule  structures  and  of  6.3%  each  for  the 
three  para-bonded  C'Dewar")  structures.  Using  the 
same  set  of  orbitals,  the  energy  change  on  restrict¬ 
ing  the  spin  space  to  just  one  of  the  Kekule  struc¬ 
tures  corresponds  to  a  resonance  energy  of  83.5  kj 
mol“\ 

Hirao  et  al.  [24]  have  adopted  the  same  name, 
i.e.,  CASVB,  for  calculations  in  which  they  trans¬ 
formed  the  CASSCF  wave  function  to  a  representa¬ 
tion  based  on  orthogonal  localized  molecular  or¬ 
bitals.  In  the  case  of  benzene,  they  report  weights 
of  7.8%  each  for  the  Kekule  structures  and  of  2.6% 
each  for  the  para-bonded  structures.  In  their  model, 
76.6%  of  the  benzene  ground-state  wave  function 
is  ionic.  Analogous  conclusions,  but  by  no  means 
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FIGURE  1.  Various  representations  of  a  symmetry-unique  spin-coupled  tt  orbital  for  benzene.  (Left)  Contours  in  the 
horizontal  plane  1  bohr  above  the  molecular  plane.  (Center)  Contours  in  a  vertical  mirror  plane.  (Right)  A  representative 
three-dimensional  contour  (isosurface). 


quite  so  extreme,  arise  in  classical  VB  descriptions 
based  on  strictly  localized  orbitals  [25].  It  should 
be  clear  why  the  present  authors  prefer  modern 
VB  calculations  with  fully-optimized  nonorthogo- 
nal  orbitals. 

The  Chirgwin-Coulson  weight  [26]  of 
^CAs  is  99.1%,  so  that  spin-coupled  ionic  configu¬ 
rations  amount  to  just  0.9%  of  the  total  wave 
function.  Nonetheless,  this  very  small  component, 
with  1  ”  SvB  <  0.005,  accounts  for  some  10%  of 
the  nondynamical  correlation  energy,  and  so  it  is 
natural  to  analyze  it  further.  We  find  that  doubly 
ionic  configurations  are  the  more  important  con¬ 
tributors,  with  an  accumulated  Chirgwin-Coulson 
weight  of  0.55%.  This  is  in  line  with  the  expected 
diminished  importance  of  singly  ionic  configura¬ 
tions  in  wave  functions  based  on  fully  optimized 
nonorthogonal  orbitals.  Furthermore,  from  the 
dominance  in  the  total  wave  function  of  the  two 
Kekule-like  configurations,  we  can  expect  singly 
ionic  configurations  with  charges  in  ortho  posi¬ 
tions  to  be  the  least  important.  This  is  borne  out  by 
the  total  weights  of  the  various  singly  ionic  config¬ 
urations,  which  decrease  in  the  order  meta  >  para 
>  ortho.  We  find  that  charges  in  meta  positions 
give  a  total  weight  of  0.32%,  whereas  para  and 
ortho  contributions  are  orders  of  magnitude 
smaller. 

ALLYL  RADICAL 

As  a  test  of  the  convergence  characteristics  of 
the  CASVB  code,  we  have  reinvestigated  the  ^A2 
ground  state  of  the  allyl  radical,  C3H5,  treating  as 
active  the  three  77  electrons.  The  spin-coupled  VB 
approach  (see  also  the  discussion  of  LiH)  has  re¬ 


cently  been  employed  by  Oliva  et  al.  [27]  to  study 
the  doublet  valence  states  and  the  n  =  3  Rydberg 
excited  states  of  the  allyl  radical  below  the  first 
ionization  potential  (8.13  eV).  In  general,  the  exci¬ 
tation  energies  compare  favorably  with  those  ob¬ 
tained  in  analogous  three-electron  full  Cl  calcula¬ 
tions. 

The  calculations  in  the  present  work  were  per¬ 
formed  in  C2J;  symmetry,  using  the  same  geome¬ 
try  as  Oliva  et  al.  [27]  and  the  basis  set  they 
denoted  basis  II,  except  that  the  most  diffuse  d 
function  (exponent  0.015)  was  placed  on  all  three 
carbon  atoms,  rather  than  just  on  the  central  one. 
The  total  number  of  contracted  Cartesian  GTOs  is 
120  (52Ai,39Bi,  I7B2, 12A2).  A  3-in-3  CASSCF  cal¬ 
culation  was  performed  for  the  tt  electrons,  with 
the  20  cr  electrons  accommodated  in  an  optimized 
closed-shell  core.  Our  various  total  energies  are 
collected  in  Table  II. 

As  was  also  found  in  previous  work  [28],  a  fully 
variational  spin-coupled  calculation  with  the  same 
choice  of  active  and  inactive  spaces  generates  a 
total  wave  function  that  is  identical  to  that  from 
the  CASSCF,  albeit  in  much  more  compact  form. 
One  of  the  spin-coupled  orbitals,  cf)^,  takes  the 


TABLE  II _ 

Calculated  energies  for  the  allyl  radical  (^^2). 


Calculation 

E,  hartrees 

^  ^CAS 

millihartrees 

3-in-3  CASSCF 

-116.522760 

0 

SC:  antipair  orbitals 

-116.522760 

0.00 

SC:  localized  orbitals 

-116.521438 

1.32 

RHF 

-116.483356 

39.40 

INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


445 


COOPER,  THORSTEINSSON,  AND  GERRATT 


form  of  a  deformed  C(2p^)  function  associated 
with  the  central  C  atom,  but  with  deformations 
toward  the  terminal  C  atoms.  The  remaining  spin- 
coupled  orbitals,  (f)^  and  (^2/  adopt  antipair  form, 
as  is  shown  in  the  top  row  of  Figure  2:  they 
resemble  in-  and  out-of-phase  combinations  of  de¬ 
formed  C(2p^)  fimctions  associated  with  the  ter¬ 
minal  C  atoms.  The  corresponding  electron  spins 
are  predominantly,  but  not  exclusively,  triplet  cou¬ 
pled.  Expressed  in  the  Kotani  or  Serber  bases  [23], 
the  mode  with  triplet  coupling  of  these  two  spins 
accounts  for  99.1%  of  the  total  spin  function.  The 
only  nonzero  overlap  is  =  0.744. 

It  proved  possible  in  a  previous  study  [28]  to 
locate  also  a  rival  spin-coupled  solution  in  which 
and  </)2  adopt  more  localized  forms,  in  which 
they  are  equivalent  with  respect  to  reflection  in  the 
mirror  plane  perpendicular  to  the  molecular 
plane.  In  a  somewhat  smaller  basis  set,  and  with¬ 
out  full  optimization  of  the  a  core,  the  energy  of  a 
solution  based  on  localized  orbitals  was  found  to 
be  inferior  to  that  of  the  antipair  solution  by  1.5 
millihartrees  [28]. 

In  the  present  work,  we  used  the  CASVB  code 
to  optimize  a  constrained  spin-coupled  wave  func¬ 
tion  in  which  cj)-^  and  <^2  permuted  by  reflec¬ 
tion  in  the  mirror  plane,  but  (f)^  is  invariant  to 


this  operation.  The  doubly  occupied  core  orbitals 
were  fully  optimized  simultaneously  with  the 
spin-coupled  tt  orbitals.  The  convergence  charac¬ 
teristics  of  the  constrained  optimization  procedure 
were  found  to  be  excellent.  The  resulting  spin-cou¬ 
pled  TT  orbitals  are  shown  in  the  bottom  row  of 
Figure  2.  No  constraints  were  applied  to  the  spin 
space,  but  it  turns  out  that  in  order  to  achieve  the 
correct  overall  ^A2  symmetry  the  spins  associated 
with  orbitals  <pi  and  02  required  to  be  strictly 
triplet  coupled.  The  nonzero  overlaps  are  <0il03> 
=  <02l^3>  =  0-501  and  <0il02>  =  “0.184.  The 
difference  in  energy  from  the  antipair  solution 
(1.32  millihartrees,  see  Table  II)  is  similar  to  the 
value  found  previously  [28]  and  arises  from  the 
removal  of  all  the  spin  degrees  of  variational  free¬ 
dom.  Convergence  to  this  localized  orbital  solution 
is  only  possible  with  the  appropriate  symmetry 
constraints;  without  them,  this  solution  corre¬ 
sponds  only  to  a  saddle  point  in  the  full  spin-cou¬ 
pled  variational  space. 

LiH 

It  is  useful  to  describe  first  the  key  features  of  a 
previous  spin-coupled  VB  study  of  the 
ground  state  of  LiH  [29].  The  spin-coupled  wave 


FIGURE  2.  Spin-coupled  tt  orbitals  for  the  ^^2  ground  state  of  the  allyl  radical,  shown  as  contours  in  the  horizontal 
plane  1  bohr  above  the  molecular  plane. 
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function  may  be  written  in  the  form 

(31) 

with  four  singly  occupied  nonorthogonal  orbitals. 
For  four  electrons  with  a  net  spin  of  zero,  the  full 
spin  space  [23]  consists  of  just  two  linearly  inde¬ 
pendent  modes  of  spin  coupling.  The  two  spin¬ 
coupling  coefficients  (cqj^)  were  optimized  simulta¬ 
neously  with  the  orbitals,  without  any  constraints 
that  would  alter  the  total  wave  function. 

The  converged  spin-coupled  orbitals  all  possess 
a  symmetry.  At  large  nuclear  separations,  they 
correspond  to  Li(lsls'2s)  and  H(ls).  We  found 
that  the  two  "core"  orbitals,  and  the  overlap  be¬ 
tween  them  0.93),  remain  essentially  un¬ 
changed  as  the  atoms  approach.  The  H(ls)  func¬ 
tion  also  changes  relatively  little.  On  the  other 
hand,  dramatic  changes  occur  in  the  orbital  ini¬ 
tially  associated  with  Li(2s).  As  the  atoms  ap¬ 
proach,  electron  density  is  transferred  from  lithium 
to  the  hydrogen  atom,  in  the  form  of  a  rather 
diffuse  deformed  H(ls)  function.  For  nuclear  sepa¬ 
rations  around  the  form  of  the  two  valence 
orbitals  implies  considerable  Li'^Fi"  character  [29]. 
Perfect  pairing  remains  the  dominant  mode  of  spin 
coupling  for  all  values  of  R. 

At  convergence,  each  occupied  orbital  gives  rise 
to  a  "stack"  of  orthonormal  virtual  orbitals,  deter¬ 
mined  in  an  effective  field  which  excludes  the 
occupied  orbital.  Virtual  orbitals  in  different  stacks 
are  not  in  general  orthogonal  to  one  another.  Fur¬ 
ther  spatial  configurations  were  generated  by 
means  of  excitations  from  valence  spin-coupled 
orbitals  and  0-4)  into  corresponding  virtual 
orbitals,  and  the  corresponding  wave  functions. 


total  energies  and  dipole  moments  were  obtained 
in  very  compact  nonorthogonal  Cl  calculations.  A 
suitably  constructed  series  of  such  spin-coupled 
VB  expansions  [29]  exhibited  very  rapid  conver¬ 
gence  onto  a  dipole  moment  function  previously 
calculated  by  means  of  accurate  MO  Cl  calcula¬ 
tions. 

The  calculations  in  the  present  work  were  per¬ 
formed  in  €2^  symmetry  using  the  density-matrix 
averaged  atomic  natural  orbital  basis  sets  of  Wid- 
mark  et  al.  [30].  For  Li/H,  these  sets  consist  of 
(14s9p4d3//8s4p3d)  spherical  GTOs  contracted  to 
[6s5p3d2//4s3p2rf],  generating  a  total  of  73  GTOs 
(32Ai,17Bi,17B2,7A2).  A  full-valence  CASSCF 
calculation  for  this  molecule  involves  just  two  ac¬ 
tive  electrons,  with  the  orbitals  partitioned: 

■  Optimized  core  (double  occupancy):  (i.e., 

Icr) 

■  Active  (variable  occupancy):  2ay-4:a-^,  Ib-^, 
and  1^2  (be.,  2o— 4o-  and  Itt) 

■  Virtual  (unoccupied):  5a-y-32a-^,  2b^~Y7b^, 
2b2~17b2,  \a2-7a2 

A  two-electron  full-CI  calculation,  with  the  doubly 
occupied  la^  orbital  taken  from  the  full-valence 
CASSCF  wave  function,  requires  fairly  trivial  com¬ 
puting  resources,  and  gives  a  clear  indication  of 
the  best  that  we  can  expect  to  achieve  in  frozen-core 
calculations  with  this  basis  set. 

Computed  equilibrium  geometries,  together 
with  the  corresponding  total  energies  and  dipole 
moments,  are  collected  in  Table  III.  Starting  from 
the  RFiF  wave  function,  the  single  largest  energy 
improvement  corresponds  to  the  radial  correlation 
introduced  by  opening  up  the  3^^  orbital  (i.e.,  3a), 
Almost  as  large  is  the  angular  correlation  intro¬ 
duced  by  opening  up  the  degenerate  Ib^  and  1^72 


TABLE  III _ 

RHF,  CASSCF,  and  full-CI  equilibrium  geometries  for  LIH. 


Active  orbitals 

Inactive  space 

ftg,  A 

E,  hartrees 

jLt,  Debyes 

(RHF) 

— 

1.606 

-7.987257 

6.02 

2a-|  j  3a-j 

(1a,)2 

1.634 

-8.003784 

5.74 

2ai,3ai,4ai 

da^)^ 

1.633 

-8.009548 

5.88 

2a^,  ^b^,^\b2 

{^ar 

1.589 

-8.001320 

5.98 

2ai,3a^,1bi,1b2 

(1a/ 

1.611 

-8.015914 

5.78 

2a-i,  3d-|,  4a-4|  ^b-t,  "ibo 

(la/ 

1.611 

-8.021112 

5.89 

(full  CD® 

(la,)2 

1.608 

-8.023861 

— 

®  Doubly  occupied  la^  core  orbital  taken  from  the  corresponding  full-valence  CASSCF. 
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orbitals  (i.e.,  I'tt).  The  full- valence  CASSCF  calcu¬ 
lation  gives  a  value  of  that  is  very  close  to  that 
from  the  two-electron  full  Cl,  and  the  minimum 
energies  are  within  3  millihartrees  of  one  another. 
This  full-CI  calculation  gives  an  equilibrium  bond 
length  of  1.6076  A,  and  all  subsequent  calculations 
were  carried  out  at  this  geometry.  The  experimen¬ 
tally  determined  value  of  is  1.595  A  [31]. 

Energies  from  various  four-electron  CASSCF 
calculations  are  collected  in  Table  IV,  for  a  nuclear 

o 

separation  of  1.6076  A.  Comparing  with  the  corre¬ 
sponding  two-electron  CASSCF  calculations  at  the 
same  geometry,  it  is  clear  that  opening  up  the  la^ 
core  orbital  results  in  a  fairly  dramatic  energy 
lowering.  There  are  of  course  further  energy  lower¬ 
ings  on  opening  up  also  the  5a-^,  and  2^2 

orbitals,  and  it  is  clear  that  the  all-electron  full-va¬ 
lence  CASSCF  wave  function  is  still  far  from  the 
corresponding  full  CL  Note  that  three  of  the 
CASSCF  calculations  have  been  labeled  A,  B,  or  C 
in  Table  IV  in  order  to  facilitate  the  subsequent 
discussion. 

As  we  have  indicated,  computationally  the  most 
inexpensive  use  of  the  CASVB  code  is  to  generate 
modern  VB  representations  of  CASSCF  wave  func¬ 
tions  by  means  of  the  overlap  criterion  [Eq.  (7)]. 
Applying  this  criterion  to  CASSCF  calculation  A 
leads  to  a  representation  in  which  the  total  wave 
function  is  overwhelmingly  dominated  by  a  spin- 


coupled-like  component  The  overlap 

between  this  dominant  component  and  the  full 
wave  function  (Syg,  see  Table  V)  is  very  close  to 
unity.  The  energy  Ey^  corresponding  to  the  spin- 
coupled-like  component  accounts  for  all  but  7.3 
microhartrees  of  the  total  CASSCF  energy.  This 
small  difference  could  of  course  be  reduced  if  the 
orbital  transformation  were  chosen  instead  using 
the  energy  criterion  [Eq.  (8)].  A  fully  variational 
calculation  for  the  spin-coupled  configuration 
provides  an  energy  within  7.3  micro¬ 
hartrees  of  that  provided  by  the  CASSCF  calcula¬ 
tion.  All  four  spin-coupled  orbitals  were  found  to 
be  of  pure  a  symmetry  (i.e.,  a^). 

For  calculation  B,  we  considered  the  three-con- 
figuration  modern  VB  wave  function 

<^3  04  X)  ^0jt®00;A:| 

+  C2cQ^(<^1<^2^6^6®pp) 

in  which  0pp  is  the  four-electron  perfect-pairing 
spin  function.  There  can,  of  course,  only  be  one 
mode  of  spin  coupling  for  the  configurations  with 
a  doubly  occupied  orbital  (^5  or  (p^X  on  account  of 
the  Pauli  principle.  Orbitals  were  con¬ 

strained  to  be  of  (7  symmetry  (i.e.,  and  orbitals 


TABLE  IV _ 

RHF  CASSCF,  and  full-CI  results  for  LiH  at  1 .6076  A. 


Active  orbitals 

Inactive  space 

E,  hartrees 

fji,  Debyes 

(RHF) 

— 

-7.987257 

6.03 

Two  active  electrons: 

2a^,3a^ 

(1ai)" 

-8.003710 

5.70 

2ai,3ai,4ai 

da/ 

-8.009480 

5.84 

2a^,  ^b^,  1^2 

(1ai)2 

-8.001275 

6.02 

2a^,3a-|,  ^b^,  1b2 

da/ 

-8.015913 

5.77 

2a-^,3a^  4a^  1^2 

da/ 

-8.021111 

5.88 

(full  CD® 

da/ 

-8.023861 

— 

Four  active  electrons: 

A  (1-4)ai 

— 

-8.016846 

5.71 

B  (^-4)a„^b„^b^ 

— 

-8.029067 

5.78 

(1-4)ai,1ibi,2bi,1b2.2t>2 

— 

-8.035753 

5.76 

(1-5)ai,1£)i,1£)2 

— 

-8.034309 

5.88 

(1-5)ai,1£)i,2t»i,1£»2.2ib2 

— 

-8.035003 

5.88 

C  (1-6)81,161,162 

— 

-8.038302 

5.88 

(1-6)81,161,261,162,262 

— 

-8.044885 

5.87 

®  Doubly  occupied  ^a^  core  orbital  taken  from  the  corresponding  full-valence  CASSCF. 
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TABLE  V _ 

Various  CASSCF  and  CASVB  results  for  LIH,  as  described  in  the  text,® 


CASSCF 

Overlap  criterion 

Variational  MCSC 

Ecas.  hartrees 

I 

CO 

< 

CD 

Eye,  hartrees 

AE,  /.thartrees 

^Mcsc  hartrees 

Mmcsc  Debyes 

A 

-8.016846 

5.3  X  10“^ 

-8.016839 

7.3 

-8.016839 

5.71 

B 

-8.029067 

2.3  X  10“® 

-8.029048 

19.2 

-8.029048 

5.78 

C 

-8.038302 

-8.033347 

5.86 

®The  labels  in  the  first  column  correspond  to  those  in  Table  IV. 


(^5  and  (f)^  to  be  a  TT  pair  (i.e.,  and  i»2)-  Using 
the  overlap  criterion  [Eq.  (7)],  we  may  generate  an 
alternative  representation  of  the  CASSCF  wave 
function  in  which  a  component  of  the  form  of  Eq. 
(32)  is  overwhelmingly  dominant.  The  optimal 
value  of  Syg  (see  Table  V)  is  remarkably  close  to 
unity,  and  the  corresponding  expectation  value  of 
the  energy  is  only  19.2  microhartrees  inferior  to 
that  from  the  full  CASSCF  calculation.  A  fully 
variational  optimization  of  the  multiconfiguration 
spin-coupled  (MCSC)  wave  function  leads  to 
an  energy  that  is  19.0  microhartrees  inferior  to 
CASSCF  B. 

In  the  case  of  CASSCF  calculation  C,  we  pro¬ 
ceeded  directly  to  the  fully  variational  optimiza¬ 
tion  of  a  four-configuration  MCSC  wave  function 
of  the  form 


^1^2  ^3^4  H  ^Qk®00‘,kj 

T  ^1^2  07^7®pp) 

+  C3J/(^(/>i(^2^8^8®pp)* 


Orbitals  were  constrained  to  be  of  sym¬ 

metry,  and  orbitals  (f)y  and  to  be  the  b-^  and  ^2 
pair.  In  order  to  maintain  some  distinction  be¬ 
tween  {(^102^3^4)  con¬ 

strained  (^5  and  05,  to  be  orthogonal  to  both  of  0^ 
and  02-  Such  a  calculation,  which  utilizes  numer¬ 
ous  features  of  our  constrained  optimization  pro¬ 
cedure,  converges  rapidly  to  a  total  energy  within 
about  5  millihartrees  (see  Table  V)  of  that  from  the 
full  CASSCF  wave  function. 

We  find  that  the  converged  orbitals  0i-04,  and 
the  overlaps  between  them  (see  Table  VI),  are  very 
similar  to  those  obtained  from  the  single-config¬ 
uration  spin-coupled  calculation  [Eq.  (31)]  and  the 
three-configuration  MCSC  calculation  [Eq.  (32)]. 
The  overlaps  of  03  and  04  with  05  and  05,  turn 
out  to  be  rather  small  (see  Table  VI),  and  this 
suggests  that  it  could  be  a  reasonable  approxima¬ 
tion  in  future  work  to  restrict  orbitals  05  and  05  to 
lie  in  the  orthogonal  complement  of  orbitals  0i-04. 
As  might  be  anticipated,  the  dominant  component 
of  all  three  modern  VB  wave  functions, 
and  is  the  combination  of  the  perfect-pairing 
spin  function  with  the  orbital  string  0;^  02  03  04. 

Multiconfiguration  spin-coupled  calculations  for 
LiH  were  first  performed  by  Pyper  and  Gerratt 
[32].  The  Pyper-Gerratt  (PG)  wave  function  for 


TABLE  VI _ 

Overlaps  between  the  nonorthogonal  orbitals  in  the  MCSC  wave  function  [Eq,  (33)], 


01  02  03  04  05  ^6  07 
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this  molecule  takes  the  form 

%G  =  Ci{  0-10-2  0-304}  +  C2  X)  (34) 

a  =  A:,  y 

The  orbitals  were  fully  optimized  according  to 
their  designated  symmetry  properties  simultane¬ 
ously  with  the  Cl  coefficients,  but  the  mode  of  spin 
coupling  was  restricted  to  perfect  pairing.  It  should 
be  clear  that  the  appropriate  all-electron  CASSCF 
wave  function  with  which  to  compare  is  the  one 
with  an  active  space  consisting  of  (l-2)b^, 

and  (l-2)i'2-  According  to  the  energies  listed  in 
Table  IV,  the  energy  improvement  of  the  PG  wave 
function  relative  to  the  spin-coupled  configuration 
should  be  considerable.  Penotti  [33]  has  recently 
developed  a  full  generalization  of  the  PG  method 
in  which  the  orbitals,  spin-coupling  coefficients, 
and  structure  coefficients  are  fully  optimized  si¬ 
multaneously.  The  computational  demands  turn 
out  to  be  very  much  larger  than  those  of  the 
approach  we  have  presented  here.  Penotti  has  also 
considered  simultaneous  optimization  of  the  basis 
set  [33]. 

An  attractive  alternative  to  the  exact  optimiza¬ 
tion  of  MCSC  wave  functions  has  recently  been 
explored  by  Clarke  et  al.  [34],  who  examined  a 
four-configuration  description  of  the  form 

^SCVB*  “ 

+  C3  E  (35) 

a==JC,  y 

In  the  spirit  of  a  spin-coupled  VB  expansion  [29], 
orbitals  were  taken  from  a  standard  spin- 

coupled  calculation,  without  further  optimization. 
In  order  to  overcome  convergence  problems,  spin- 
coupled  orbitals  a'2-cr4  were  projected  out  of  the 

basis  functions,  and  Gram-Schmidt  orthogo- 
nalization  was  used  to  extract  a  set  of  -  4 
linearly  independent  functions.  Orbital  0-5  was 
then  expressed  as  a  linear  combination  of  cr^,  0-2, 
0-4,  and  the  linearly  independent  projected  func¬ 
tions.  Similarly,  orbital  cr^  was  expressed  as  a 
linear  combination  of  cr^,  0*2,  0-3,  and  the  linearly 
independent  projected  functions.  In  this  form,  or¬ 
bitals  0-5  and  cr^  were  optimized  simultaneously 
with  77^  and  (a  =  x,  y)  and  the  coefficients, 
but  the  spin-coupling  coefficients  in  each  configu¬ 
ration  were  fixed  to  be  the  same  as  those  in  the 
spin-coupled  wave  function  {(7^0-2(73  0-4).  The  main 
simplifications,  which  should  make  the  approach 
viable  also  for  systems  with  more  active  electrons. 


are  the  use  of  a  second-order  perturbation  theory 
expression  for  the  energy  and  an  approximation  to 
the  required  elements  of  the  Hessian.  Of  course, 
once  the  orbitals  have  been  (approximately)  opti¬ 
mized  in  this  way,  they  may  be  used  to  construct  a 
variational  spin-coupled  VB  wave  function,  relax¬ 
ing  also  the  constraints  on  the  spin-coupling  coeffi¬ 
cients,  but  without  further  orbital  optimization. 
The  various  CASSCF  and  fully  variational  MCSC 
calculations  described  in  the  present  work  should 
prove  to  be  useful  benchmarks  for  assessing  the 
accuracy  and  computational  demands  of  this  type 
of  approach,  and  further  results  will  be  published 
in  due  course.  The  most  appropriate  all-electron 
CASSCF  wave  function  with  which  to  compare 
^scvB^  is,  of  course,  the  one  with  an  active  space 
consisting  of  and  (l-2)b2- 


Summary  and  Conclusions 

We  have  outlined  the  CASVB  approach  for  the 
fully  variational  optimization  of  general  types  of 
modern  valence  bond  wave  function,  and  we  have 
presented  a  simple  algorithm  for  the  elimination  of 
redundant  or  constrained  parameters  from  the  ap¬ 
propriate  second-order  optimization  problems.  The 
CASVB  code  may  also  be  used  to  generate  very 
compact  modern  VB  representations  of  CASSCF 
wave  functions,  with  the  overlap  criterion  [Eq.  (7)] 
being  relatively  inexpensive. 

Various  ways  in  which  the  CASVB  procedures 
may  be  used  were  illustrated  by  means  of  selected 
results  for  benzene,  the  allyl  radical,  and  LiH, 
including  the  incorporation  of  symmetry  condi¬ 
tions  and  the  generation  of  fully  variational  multi¬ 
configuration  spin-coupled  (MCSC)  wave  func¬ 
tions.  We  have  also  established  useful  benchmarks 
for  assessing  the  accuracy  and  computational  de¬ 
mands  of  various  other  MCSC  approaches  that  are 
currently  being  pursued  [33,  34]. 

Since  the  modern  VB  wave  function  in  the 
CASVB  approach  can  be  viewed  as  a  constrained 
form  of  CASSCF  wave  function,  a  multitude  of 
techniques  immediately  become  available  for 
modern  VB  methods,  including  the  incorporation 
of  dynamical  correlation  and  the  direct,  on-the-fly 
evaluation  of  integrals  over  basis  functions.  In  the 
case  of  the  implementation  of  CASVB  in  molpro 
[6],  the  CASSCF  part  of  the  code  provides  for  the 
efficient  evaluation  of  energy  gradients,  enabling, 
for  example,  geometry  optimizations  with  fully 
variational,  core-optimized  MCSC  wave  functions. 
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ABSTRACT:  We  have  implemented  a  linear  scaling,  fully  self-consistent  density- 
functional  method  for  performing  first-principles  calculations  on  systems  with  a  large 
number  of  atoms,  using  standard  norm-conserving  pseudopotentials  and  flexible  linear 
combinations  of  atomic  orbitals  (LCAO)  basis  sets.  Exchange  and  correlation  are  treated 
within  the  local-spin-density  or  gradient-corrected  approximations.  The  basis  functions 
and  the  electron  density  are  projected  on  a  real-space  grid  in  order  to  calculate  the 
Hartree  and  exchange-correlation  potentials  and  matrix  elements.  We  substitute  the 
customary  diagonalization  procedure  by  the  minimization  of  a  modified  energy 
functional,  which  gives  orthogonal  wave  functions  and  the  same  energy  and  density  as 
the  Kohn-Sham  energy  functional,  without  the  need  of  an  explicit  orthogonalization.  The 
additional  restriction  to  a  finite  range  for  the  electron  wave  functions  allows  the 
computational  effort  (time  and  memory)  to  increase  only  linearly  with  the  size  of  the 
system.  Forces  and  stresses  are  also  calculated  efficiently  and  accurately,  allowing 
structural  relaxation  and  molecular  dynamics  simulations.  We  present  test  calculations 
beginning  with  small  molecules  and  ending  with  a  piece  of  DNA.  Using  double-z, 
polarized  bases,  geometries  within  1%  of  experiments  are  obtained.  ©  1997  John  Wiley  & 
Sons,  Inc.  Int  J  Quant  Chem  65:  453-461,  1997 


Introduction 

The  ability  to  perform  reliable  simulations  of 
molecules  and  solid  systems  with  thousands 
of  atoms  has  enormous  payoffs  in  many  fields, 
especially  for  the  materials  and  pharmaceutical 
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industries.  With  the  rapid  increase  of  computa¬ 
tional  power,  the  emphasis  is  changing  from 
semiempirical  potentials  and  force  fields  to  ab 
initio  quantum  mechanical  methods  [1].  However, 
a  major  problem  of  these  methods  is  that,  in  gen¬ 
eral,  their  computational  cost  scales  as  N^,  with 
the  number  of  atoms  N.  In  the  last  few  years, 
much  effort  has  been  devoted  to  overcome  this 
problem,  and  a  number  of  methods  have  been 
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developed  with  "order-N"'  [0(N)]  scaling,  i.e., 
whose  computational  cost  scales  only  linearly  with 
the  number  of  atoms  [2-7].  Most  of  these  efforts 
and  methods  are  based  on  simplified,  non-self- 
consistent,  tight-binding  or  Harris-functional  mod¬ 
els,  although  the  first  steps  toward  self-consistent 
methods  have  also  been  made,  using  Gaussian 
orbitals  [8]  or  a  real-space  grid  to  define  the  elec¬ 
tron  wave  functions  [9].  These  last  methods  are 
very  demanding  in  computer  time  and  memory, 
requiring  large  computers  and  becoming  competi¬ 
tive  only  for  extremely  large  systems. 

We  have  developed  a  fully  self-consistent  den¬ 
sity-functional  method,  based  on  flexible  linear 
combination  of  atomic  orbitals  (LCAO)  basis  sets, 
with  actual  and  efficient  0(N)  scaling.  It  allows 
from  extremely  fast  simulations,  on  the  one  hand, 
using  minimal  basis  sets,  to  very  accurate  calcula¬ 
tions,  on  the  other  hand,  using  expanded  bases 
(multiple-z,  polarization,  off-site  orbitals),  the 
choice  depending  on  the  size  of  the  system,  re¬ 
quired  accuracy,  and  available  computational 
power.  In  previous  studies  [10,  11]  we  have  de¬ 
scribed  preliminary  versions  of  this  method,  where 
the  essentials  of  the  linear  scaling  were  already  put 
forward  (see  Fig.  2  in  Ref.  [10]).  The  main  im¬ 
provements  that  we  describe  in  this  work  are:  (i) 
the  flexibilization  of  the  basis  sets  to  include  essen¬ 
tially  any  desired  functions  and  (ii)  the  imple¬ 
mentation  of  new  features,  like  spin  polarization, 
generalized  gradient  corrections  (GGA)  ex¬ 
change-correlation,  or  the  calculation  of  the  stress 
tensor. 


Basic  Approximations 

Apart  from  that  of  Born  and  Oppenheimer,  the 
most  basic  approximations  used  here  concern  the 
treatment  of  exchange  and  correlation,  and  the 
elimination  of  core  electrons,  which  are  replaced 
by  pseudopotentials.  Exchange-correlation  is 
treated  within  the  framework  of  the  Kohn-Sham 
[12]  version  of  density-functional  theory.  We  allow 
for  both  the  local  (spin)  density  approximation 
(LDA/  LSD)  [13],  with  several  possible  parametri- 
zations,  or  the  generalized-gradient-corrections 
(GGA)  approximation,  in  the  fully  ab  initio  (non- 
empirical)  version  of  Perdew,  Burke,  and  Ernzer- 
hof  [14]. 

Since  we  calculate  on  a  real-space  grid  (see 
below),  we  use  standard  norm-conserving  pseu¬ 


dopotentials  [15]  to  avoid  the  computations  for 
core  electrons  and  to  smooth  the  valence  charge 
density.  These  pseudopotentials  are  transformed 
to  their  Kleinman-Bylander  form  [16]  for  im¬ 
proved  simplicity  and  efficiency.  This  implies  that 
only  two-center  integrals  are  required  to  calculate 
the  nonlocal  part  of  the  pseudopotential  matrix 
elements.  The  nonlinear  partial-core  correction,  for 
the  treatment  of  exchange  and  correlation  in  the 
core  region  [17],  is  also  allowed  to  treat  special 
cases,  like  alkaline  atoms. 


Basis  Sets 

In  order  to  have  a  finite  range  for  the  matrix 
elements,  one  can  either  neglect  them  beyond  a 
given  distance  or  use  confined  basis  orbitals,  i.e., 
orbitals  that  are  strictly  zero  beyond  a  certain 
radius  [18].  We  have  adopted  this  last  approach 
because  it  keeps  the  energy  strictly  variational, 
thus  facilitating  the  test  of  the  convergence  with 
respect  to  the  radius  confinement. 

The  algorithms  used  in  our  method  (explained 
in  the  following)  perform  all  the  required  calcula¬ 
tions  numerically.  No  analytical  expression  of  any 
kind  is  needed.  We  are,  therefore,  not  bound  to 
Gaussians,  and  adjusting  the  radial  wave  function 
to  the  core  region  (pseudopotential)  or  to  other 
features  is  easy  and  inexpensive.  Multiple-z  bases 
can  be  introduced,  and  also  polarization  and  off¬ 
site  orbitals  (orbitals  centered  anywhere).  Some 
procedures  are  implemented  for  the  generation  of 
the  basis  starting  from  the  solution  of  the  separate 
atomic  problems  with  the  corresponding  pseu¬ 
dopotentials,  but  the  user  can  introduce  his/her 
favorite  basis,  too. 

In  the  case  of  a  minimal  basis  set,  we  have 
found  convenient  and  efficient  the  method  of 
Sankey  and  Niklewski  [18].  Their  basis  functions 
are  the  (angular-momentum-dependent)  numerical 
eigenfunctions  of  the  atomic  pseudopotential,  for 
an  energy  chosen  so  that  the  first  node  occurs  at 
the  desired  cutoff  radius  (which  may  depend  on 
the  atomic  species  and  angular  momentum).  Be¬ 
yond  that  radius,  the  basis  functions  are  zero.  The 
effect  of  the  confinement  is  shown  below. 

One  obvious  possibility  for  multiple-z  bases  is 
obtained  by  using  the  set  of  pseudopotential 
eigenfunctions  with  an  increasing  number  of  nodes. 
This  set  has  the  virtue  of  being  asymptotically 
complete.  The  efficiency,  however,  of  this  kind  of 
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bases  depends  on  the  radii  of  confinement  of  the 
different  orbitals,  since  the  excited  states  of  the 
pseudopotential  problem  are  usually  unbound.  In 
the  following  some  examples  are  shown  of  the 
results  on  molecular  geometries  when  using  this 
kind  of  bases,  changing  the  confinement  radii  vari- 
ationally. 

Split  valence  bases  are  also  easily  generated  by 
incorporating  suited  Gaussian  orbitals  [19]  as  extra 
basis  functions,  on  top  of  the  minimal  basis  de¬ 
fined  above.  In  general,  we  find  that  double-z  plus 
single-z-polarization  bases  reproduce  the  molecu¬ 
lar  geometries  obtained  with  a  planewave  basis  set 
within  1%  in  the  cases  that  we  have  studied. 
Nevertheless,  it  is  well  known  that  the  election  of 
good  basis  sets  is  a  delicate  task,  and  lots  of  efforts 
have  been  devoted  to  it.  It  must  be  considered, 
however,  that  the  use  of  pseudopotentials  and 
numerical  (but  confined)  functions  makes  previous 
experience  on  basis  sets  not  directly  applicable. 


Matrix  Elements 

Within  the  nonlocal-pseudopotential  approxi¬ 
mation,  the  standard  Kohn-Sham  Hamiltonian 
may  be  written  as 

H=T  +  E  +  y,"']  +  y"(r)  + 

I 

(1) 

where  I  is  an  atom  index,  Im  are  angular  momen¬ 
tum  quantum  numbers,  T  =  p^/2  m  is  the  kinetic 
energy  operator,  y^(r)  and  are  the  Hartree 

and  exchange-correlation  potentials,  and 

=  E\4>u>n^<&  (2) 

Im 

is  the  nonlocal  (nl)  part  of  the  pseudopotential,  in 
its  Kleinman-Bylander  (KB)  form.  are  the 

so-called  Kleinman-Bylander  projectors,  which 
obey 

=  [ V;,(r)  -  (3) 

(4) 

where  iA//w(r)  are  the  atomic  pseudoorbitals,  i.e., 
the  solutions  to  the  radial  Schrodinger  equation 
with  potentials  V^i(r)  (without  any  cutoff  radius). 
Since  ~  is  nonzero  only  inside  a  small 


pseudopotential  core  radius  the  functions 

are  also  zero  outside  this  radius. 

In  order  to  eliminate  the  long  range  of 
we  screen  it  with  a  sum  p^^®”^(r)  of  atomic  electron 
densities  pf^'^dr  -  r^l),  constructed  by  populating 
the  basis  functions  with  appropriate  valence  atomic 
charges.  Notice  that,  since  the  atomic  basis  orbitals 
are  zero  beyond  r^,  the  screened  "neutral-atom" 
potentials  are  also  zero  beyond  this  radius 
[18].  Let  now  8p(r)  be  the  difference  between  the 
self-consistent  electron  density  p(r)  and  the  sum  of 
atomic  densities  p^*'^^(r),  and  let  8V^(r)  be  the 
Hartree  potential  generated  by  5p(r).  Then  the 
total  Hamiltonian  may  be  rewritten  as 

H  =  T  +  +  ^V^(r)  +  y  ^Hr), 

I  I 

(5) 

The  matrix  elements  of  the  first  two  terms  involve 
only  two-center  integrals  which  are  easily  calcu¬ 
lated  in  reciprocal  space  and  tabulated  as  a  func¬ 
tion  of  interatomic  distance  [18].  This  is  done  at 
the  beginning  of  the  calculation  with  great  accu¬ 
racy,  representing  a  very  small  computational  ef¬ 
fort. 

The  remaining  terms  involve  potentials  which 
are  calculated  in  a  real-space  grid.  The  third  term 
is  a  sum  of  short-range  screened  pseudopotentials 
which  are  tabulated  as  a  function  of  the  distance  to 
the  atoms  and  easily  interpolated  at  any  desired 
grid  point.  The  last  two  terms  require  the  calcula¬ 
tion  of  the  electron  density  at  the  grid.  To  do  this, 
we  first  find  the  atomic  basis  orbitals  at  the  grid 
points,  also  by  interpolation  from  numerical  tables. 
Then  the  electron  density  is  given  by 

p(r)  =  (6) 

where  p,  v  run  over  all  the  atomic  basis  orbitals 
(f)^,  and  p^^  is  the  one-electron  density  matrix  in 
this  basis,  obtained  as  explained  below.  Since  the 
method  is  aimed  at  molecules  and  large  solid 
supercells,  Brillouin  zone  integrals  for  repeated 
structures  are  approximated  by  the  T  point  (/c  =  0), 
so  that  all  the  wave  functions  can  be  made  real. 
Notice  that  only  overlapping  orbital  pairs  con¬ 
tribute  to  the  sum  at  their  intersection,  so  that  the 
calculation  of  the  density  can  be  performed  in 
0(N)  operations  using  sparse-matrix  multiplica¬ 
tion  techniques.  Once  the  density  is  available  in 
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the  grid,  we  solve  Poisson's  equation  with  a  fast 
Fourier  transform  to  obtain  8V^(t),  adding  it  to 
y^^(r)  and  The  matrix  elements  of  these 

three  terms  are  then  computed  by  integration  in 
the  same  real  space  grid,  also  in  0(N)  operations. 


Total  Energy  and  Forces 


overlap  [20],  the  last  term  in  (9)  may  be  written  as 

I  E  r-^  =  ^  E  f  d^T 

I  ij  |rj  tjl  ^ 

(11) 

I 

where 


Let  us  forget  for  the  moment  that  we  want  an 
order-N  method.  Then,  once  the  Kohn-Sham 
Hamiltonian  has  been  obtained,  we  can  solve  the 
eigenvalue  problem  to  find  the  electron  eigenfunc¬ 
tions  in  the  LCAO  basis 


Uj'ocal  =  |v/ocaI(^)plocal(^)4^^2^^  Q2) 

Taking  into  account  that  -I-  Eq. 

(9)  can  be  transformed,  after  some  manipulations, 
into 


\4>i)  =  Ec,>l0^>  (7) 

and  the  density  matrix 

(8) 

i 


\ 

+  T  E  f  vr  (r) pr(r)  dh-Y.  (13) 

^  1]  I 

+  Spit)  dh 


where  Wi  are  occupation  weights  which,  at  T  =  0, 
will  be  either  zero  or  two  (spin  polarization  is  not 
considered  in  this  discussion  for  simplicity).  The 
electron  density  can  be  obtained  from  Eq.  (6)  and 
the  Kohn-Sham  total  energy  can  then  be  written 
as 


-  2  / V^(r)p(x)  dh 

fXV 

+  J(€“(r)  -  y’“^(r))p(r)  d^t 


1 

2  p  lr;-r;l 


(9) 


where  Zj  are  the  valence  pseudoatom  charges  and 
e^^(r)p(r)  is  the  exchange-correlation  energy  den¬ 
sity.  In  order  to  avoid  the  long-term  interactions  of 
the  last  term,  it  is  convenient  to  define  the  'Tocal- 
pseudopotential  charge  density"  as  that 

which  produces  an  electrostatic  potential  equal  to 

local. 


1 

plocal(j)  ^ - y2vlocal(j.)  Qq) 

4:776 

This  positive  charge  density  is  confined  within 
and  produces  a  potential  eZj/r  outside. 
Therefore,  assuming  that  the  core  radian  do  not 


1  /• 

+  —  j  5y^(r)  8p{t)  (Px  +  j  e^^(r)p(r)  d^x. 

(14) 

The  first  two  terms  involve  two-center  integrals 
which  are  interpolated  from  initially  calculated 
tables,  as  described  previously,  and  the  third  term 
is  found  trivially  from  (12).  The  last  three  terms 
are  calculated  using  the  real  space  grid. 

Apart  from  getting  rid  of  all  long-range  poten¬ 
tials  [except  that  implicit  in  51/^(r)],  the  advan¬ 
tage  of  (13)  is  that,  except  for  the  relatively  slowly 
varying  exchange-correlation  energy  density,  the 
grid  integrals  involve  5p(r),  which  is  generally 
much  smaller  than  p(r).  Thus,  the  errors  associated 
with  the  finite  grid  spacing  are  drastically  re¬ 
duced.  Also  critical  is  that  the  kinetic  energy  ma¬ 
trix  elements  can  be  calculated  almost  exactly, 
without  any  grid  integrations.  Figure  1  presents 
the  convergence  of  the  total  energy  for  a  diamond 
supercell  of  64  atoms  as  a  function  of  the  plane- 
wave  cutoff  associated  to  the  grid  [  =  (tt/ 

aY,  with  a  being  the  separation  between  grid  points 
in  atomic  units],  used  to  represent  the  electron 
density  [21].  This  convergence  is  considerably  faster 
than  that  typical  of  planewave  calculations,  which 
require  a  larger  cutoff  to  represent  accurately  the 
kinetic  energy. 

Atomic  forces  and  stresses  are  obtained  by  di¬ 
rect  derivation  of  (13)  with  respect  to  atomic  posi- 
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Ecut  (Ry) 

FIGURE  1.  Convergence  of  the  total  energy  versus 
grid  cutoff  for  a  supercell  of  diamond  with  64  C  atoms. 


tions  and  obtained  simultaneously  with  Com¬ 

plete  formulas  will  be  given  elsewhere  and  here 
we  provide  only  a  brief  sketch.  Thus,  dx^  can 
be  obtained  by  straightforward  derivation  of  the 
interpolating  functions  which  represent  and 

- f  €^^(r)p(r)  d^r  =  fv^^(r) - d^r 

dXj^  '  dx^ 

IXiV 

(15) 


Order-iV  Functional 

Once  the  Kohn-Sham  Hamiltonian  matrix  in 
the  LCAO  basis  has  been  computed  for  a  given 
charge  density,  we  seek  to  solve  it  in  order-N 
operations.  For  that  purpose,  we  must  avoid  the 
scaling  of  the  eigenvalue  problem,  which  ap¬ 
pears  if  either  direct  diagonalization  or  standard 
iterative  solution  are  performed  (in  the  latter  case 
because  of  the  explicit  orthogonalization  step  nec¬ 
essary  during  the  energy  minimization).  To  avoid 
this  problem,  several  methods  have  been  proposed 
in  the  last  years.  Here  we  use  the  approach  pro¬ 
posed  by  Mauri  et  al.  [3]  and  by  Ordejon  et  al.  [4, 
5],  although  any  other  linear  scaling  procedure 
could  in  principle  be  used.  The  method  of  Mauri 
and  Ordejon  contains  two  basic  ingredients.  The 
first  is  the  calculation  of  the  wave  functions 
describing  the  ground  state  of  the  system  by  the 
minimization  of  a  modified  energy  functional, 
which  does  not  require  an  explicit  orthogonaliza¬ 
tion  step,  but  for  which  the  wave  functions  be¬ 
come  automatically  orthonormal  at  the  minimum. 
More  generally,  we  use  a  modification  of  the 
Mauri-Ordejon  functional,  which  was  proposed 
by  Kim  it  et  al.  [6].  This  functional  takes  the  form: 

M 

E  =  2Z(Hii-  vSii)(28ii  -  S,p  +  tjN,  (17) 


where  index  /Xj  runs  only  through  the  basis  or¬ 
bitals  of  atom  1.  Similar  expressions  are  obtained 
for  the  derivatives  of  the  other  integrals  in  (13). 
Except  for  the  gradient  in  front  of  the  matrix 
elements  involved  in  the  contribution  to  the  forces 
of  the  last  three  terms  of  (13)  are  the  same  as  those 
of  the  Hamiltonian,  and  are  calculated  by  the  same 
techniques.  The  calculation  of  the  forces  as  deriva¬ 
tives  of  the  energy  assures  that  Pulay  corrections 
are  automatically  included.  The  stress  tensor  can 
also  be  calculated  simultaneously  with  the  forces, 
since  it  involves  the  product  of  each  force  contri¬ 
bution  times  a  distance.  For  example. 


dT 


/XjVj 


d€ 


xy 


dT 


(16) 


where  6^^  is  the  xy  component  of  the  strain  tensor 
and  Xij  and  are  the  x  and  y  components  of 
the  vector  between  atoms  I  and  /. 


where  the  two  in  front  of  the  sum  is  for  spins,  N  is 
the  number  of  electrons,  and  Hij  =  (ipilHlipj), 

=  )  ^xe  the  Hamiltonian  and  overlap  ele¬ 

ments  between  the  wave  functions.  The  sum  is 
done  over  a  set  of  functions  equal  or  larger  than 
the  number  of  occupied  states  (N/2),  and  the  total 
charge  is  controlled  by  the  Fermi  level  parameter 
7].  The  minimization  of  this  functional  with  respect 
to  variations  of  the  wave  functions  if/^  (in  terms  of 
their  expansion  in  the  LCAO  basis)  provides  the 
exact  ground  state  of  the  given  H  (in  the  space 
spanned  by  the  basis  set)  and  orthonormal  wave 
functions  [6]. 

The  second  ingredient  of  the  order-N  solution  is 
the  restriction  of  the  minimization  of  the  func¬ 
tional  in  Eq.  (17)  to  wave  functions  which  are 
localized  in  space.  These  are  Wannier-like  wave 
functions  centered  on  different  positions  of  the 
system,  which  are  truncated  beyond  a  given  cutoff 
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radius  (different  from  the  cutoff  of  the  LCAO 
basis  orbitals  r^).  This  is  a  variational  parame¬ 
ter  which  controls  the  accuracy  of  the  calculation, 
and  it  is  the  only  approximation  involved  in  the 
0(N)  solution  of  the  Hamiltonian.  A  detailed  de¬ 
scription  of  the  accuracy  of  that  truncation  can  be 
found  in  Ref.  [5]. 

Once  the  wave  functions  solution  of  H  have 
been  obtained  with  the  order-N  functional,  a  new 
Kohn-Sham  Hamiltonian  is  built  from  the  new 
charge  density,  and  the  procedure  is  iterated  until 
self-consistency  is  achieved.  At  this  point,  the  en¬ 
ergy  and  atomic  forces  are  evaluated  by  means  of 
the  expressions  given  in  the  previous  section. 


Accuracy  and  Efficiency  Tests 

SMALL  MOLECULES 

Several  tests  on  the  performance  of  the  approxi¬ 
mations  used  in  our  method  are  presented  here, 
especially  for  the  bases  and  their  confinement. 
Table  I  shows  the  results  for  the  interatomic  dis¬ 
tance  in  diatomic  molecules  as  compared  with 
experiment  and  with  other  LDA  calculations  per¬ 
formed  with  converged  augmented  plane  waves. 
They  are  all  calculated  within  LDA.  Several  bases 
are  compared.  The  single-z  basis  is  as  explained 
above,  using  a  sufficiently  long  radius  of  confine¬ 
ment.  Its  performance  is  improved  by  scaling  the 
radial  dependence  with  appropriate  scale  factors, 
which  are  obtained  variationally.  Double-z  bases 
are  used  in  the  split-valence  scheme,  and  the  effect 
of  confinement  on  them  is  also  shown. 


Table  II  displays  results  of  polarized  bases.  The 
accuracy  obtained  is  within  1%  of  the  experimen¬ 
tal  numbers  (except  for  hydrogen,  since  its  small 
size  makes  the  percentual  error  larger).  This  shows 
the  high  quality  attainable  with  short-range  basis 
functions.  The  transferability  of  the  previous  bases 
is  tried  on  CO  and  NO  (within  Table  I)  and  also 
for  HCN  (Table  III),  which  also  performs  very  well 
with  double-z  bases  (both  split-valence  and  short- 
range). 

More  delicate  is  the  description  of  subtle  bond¬ 
ings  like  the  hydrogen  bonds.  A  couple  of  water 
molecules  is  studied,  and  the  results  for  the  geom¬ 
etry  are  compared  with  experiments  and  with  other 
calculations.  Table  IV  displays  the  results  using 
LDA  exchange-correlation.  Table  V  does  the  same 
for  the  results  of  GGA  calculations.  It  can  be  con¬ 
cluded  that:  (i)  with  a  double-z,  polarized  basis  the 
results  obtained  for  the  geometry  are  perfectly 
comparable  in  accuracy  with  converged  plane 
waves  results,  (ii)  a  good  comparison  with  experi¬ 
ment  requires  the  use  of  GGA,  and  (hi)  the  bind¬ 
ing  energies  are  in  general  difficult  to  describe 
with  our  methods,  requiring  both  long  and 
ample  bases. 

DNA 

We  present  here  first  ab  initio  results  for  the 
structure  of  deoxyribonucleic  acid  (DNA).  It  is 
performed  on  a  turn  of  the  double  helix  with  ten 
guanine-cytosine  base  pairs,  which  is  repeated  pe¬ 
riodically  making  an  infinite  structure  with  a  unit 
cell  containing  650  atoms.  It  is  relaxed  starting 


TABLE  I  ^ _ 

Bond  lengths  (in  A)  calculated  for  some  diatomic  molecules  using  different  bases.^ 


Experiment  [22] 

APW  [22] 

SZ 

SZ(scaled) 

SZ(SR) 

DZ(SV) 

DZ(SR) 

Ha 

0.741 

0.767 

0.928 

0.772 

0.789 

0.773 

0.727 

Ca 

1.244 

1.249 

1.425 

1.348 

1.332 

1.279 

1.262 

Na 

1.098 

1.098 

1.283 

1.255 

1.194 

1.123 

1.125 

02(LDA) 

1.207 

1.212 

1.487 

1.445 

1.296 

1.366 

1.259 

OjdSD) 

1.207 

— 

1.406 

1.388 

1.284 

1.293 

1.241 

CO 

1.127 

1.132 

1.309 

1.276 

1.206 

1.172 

1.156 

NO 

1.148 

1.148 

1.409 

1.376 

1.252 

1.233 

1.234 

^  SZ:  single-z  pseudoatomic  orbitals  with  large  cutoff  radius  (r^  =  6.2  a.u.);  SZ(SR)  same  with  short  (3.3  a.u.  for  H,  4.1  a.u.  for  C, 
3.6  a.u.  for  N,  and  3.2  a.u.  for  O);  SZ  (scaled)  scales  the  SZ  bases  with  scale  factors  determined  variationally:  0.950  for  C,  0.975  for 
N,  0.980  for  0,  and  0.800  for  H.  For  CO  and  NO  the  scaling  factors  are  found  to  be  the  same  as  for  the  dimer  of  each  element; 
DZ(SV):  split-valence  double-z  for  long  orbitals;  DZ(SR):  short  double-z  bases. 
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TABLE  II 


Calculated  bond  lengths  (in  A)  for  some 
diatomic  molecules  adding  polarization  orbitals 
to  double-z  (DZP)  short  range  (SR)  basis  sets.^ 


Exp. 

100Ry 

DZP(SR) 

200  Ry 
DZP(SR) 

H2 

0.741 

0.729  (1.49%) 

0.726  (2.01%) 

C2 

1.244 

1 .253  (0.74%) 

1 .250  (0.46%) 

N2 

1.098 

1.103  (0.45%) 

1 .092  (0.55%) 

OadDA) 

1.207 

1.215  (0.66%) 

1 .206  (0.08%) 

02(LSD) 

1.207 

1 .203  (0.32%) 

1.197  (0.83%) 

CO 

1.127 

1.117  (0.89%) 

1.118  (0.75%) 

NO 

1.148 

1.144  (0.31%) 

1.142  (0.55%) 

for  the  polarization  orbitals  minimize  the  total  energy.  In 
parenthesis,  the  relative  error  with  respect  to  experiment. 


from  the  experimental  geometry  [29],  which  was 
obtained  from  X-ray  diffraction  experiments  on 
microcrystalline  fibers,  refined  with  a  least-squares 
method  using  '"standard"  bond  lengths. 


TABLE  III _ 

Bond  lengths  (in  A)  for  HCN  using  different 
basis  sets. 


Exp. 

SZ(SR) 

DZ(SR) 

DZ(SV) 

d(HC) 

1.066 

1.136 

1.070 

1.083 

d(CN) 

1.153 

1.211 

1.159 

1.174 

This  calculation  is,  to  our  knowledge,  the  first 
self-consistent  ab  initio  calculation  of  such  a  struc¬ 
ture  [30]  and  represents  quite  a  challenge  because 
of  both  the  complexity  and  the  importance  of  the 
system.  In  a  first  step  presented  here,  the  simplest 
scheme  has  been  used,  namely,  LDA  for  exchange 
and  correlation,  minimal  basis  set,  and  quite  short 
confinement  radii  [30]  (3.3  a.u.  for  H,  4.1  a.u.  for  C, 
3.6  a.u.  for  N,  3.2  a.u.  for  O,  and  4.7  a.u.  for  P).  The 
grid  cutoff  was  113  Ry.  The  approximations  are 
substantial  but  constitute  a  beginning  toward  finer 
calculations  (higher  cutoff,  double-z,  somewhat 
longer  r/s,  GGA  instead  of  LDA). 


TABLE  IV _ 

LDA  results  for  the  H2O  dimer  using  different  basis  sets.^ 


SZ(SR) 

DZ(SR) 

DZP(SR) 

SZ 

DZ 

DZP 

PLW-VWN  [23] 

LCAO-VWN  [24] 

Exp.  [23] 

f(O-O) 

2.628 

2.575 

2.687 

2.723 

2.590 

2.747 

2.70 

2.71 

2.98  ±  0.01 

f(03-H) 

1.012 

0.971 

0.976 

1.049 

0.999 

0.986 

0.961 

0.980 

— 

«^a 

105.6 

110.1 

106.1 

108.7 

112.2 

106.7 

106.2 

104.7 

— 

'•(0<,-H,) 

1.042 

0.999 

0.988 

1.069 

1.026 

1.002 

0.980 

0.997 

— 

f(0^-H2) 

1.007 

0.972 

0.971 

1.041 

1.000 

0.981 

0.961 

0.977 

— 

<l>d 

103.0 

106.9 

105.6 

107.5 

113.3 

107.3 

106.1 

105.4 

Od 

1.2 

0.4 

6.3 

6.7 

7.1 

7.5 

4.84 

9.0 

6  +  20 

BE 

19.39 

20.39 

19.37 

14.21 

15.54 

11.01 

9.06 

9.16 

5.44  ±  0.7 

®For  notation  see  Ref.  [23].  Lengths  in  A  and  angles  in  degrees.  PLW:  plane-wave  calculation.  VWN:  Vosko,  Wilk,  and  Nusair  [25] 
functional  for  correlation.  BE:  Binding  energy  (kcal/mol). 


TABLE  V _ 

GGA^"^  results  for  the  H2O  dimer  using  different  basis  sets. 


SZ(SR) 

DZ(SR) 

DZP(SR) 

SZ 

DZ 

DZP 

PLW-B  [23] 

LCAO-B  [24] 

LCAO-PW  [24]  MP2  [28] 

r(O-O) 

2.663 

2.641 

2.715 

2.837 

2.752 

2.902 

2.98 

2.886 

2.877 

2.911 

r(Oa-H) 

1.013 

0.967 

0.971 

1.051 

0.996 

0.981 

— 

0.979 

0.981 

0.957 

105.2 

109.2 

105.8 

107.2 

111.2 

106.2 

— 

106.2 

104.4 

— 

r(0^-Hi) 

1.039 

0.989 

0.987 

1.066 

1.015 

0.988 

— 

0.990 

0.990 

0.964 

r(0^-H2) 

1.008 

0.967 

0.967 

1.045 

0.997 

0.980 

— 

0.977 

0.979 

— 

•t>d 

103.2 

106.6 

105.3 

107.1 

110.8 

104.7 

106.2 

106.0 

— 

^d 

-0.8 

0.1 

4.9 

6.9 

2.6 

4.7 

— 

7.0 

15.1 

4.5 

BE 

15.17 

15.92 

15.57 

10.16 

11.76 

7.36 

4.90 

4.51 

5.993 

5.44 

®  For  notation  see  Ref.  [23].  Lengths  in  A  and  angles  in  degrees.  BE:  Binding  energy  (kcal/mol).  B:  Becke  [26]  functional  for  GGA. 
PW:  Perdew-Wang91  [27]  functional  for  GGA.  MP2:  Second-order  Moller-Plesset  with  a  6-311  +  +  G(2d,  2p)  basis  [28]. 
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Let  us  show  first  results  for  an  isolated  gua¬ 
nine-cytosine  pair.  Figure  2  shows  the  structure  of 
the  pair,  with  its  three  hydrogen  bonds.  Table  VI 
shows  the  distances  obtained  for  them.  The  bridge 
distances  (N-N  or  O-N)  are  10-15%  shorter  than 
that  obtained  from  other  more  elaborate  calcula¬ 
tions  [31,  32].  This  may  be  due  to  the  short 
employed  in  the  calculations.  The  position  of  the 
hydrogen  atoms  in  between  tends  too  much  to¬ 
ward  the  middle  of  the  bridge.  It  has  been  argued 
that  this  is  a  pathology  of  LDA  that  can  be  cor¬ 
rected  using  GGA  instead,  but  it  could  also  be  due 
(or  enhanced)  by  the  fact  of  using  nonscaled  sin- 
gle-z  basis  for  H,  which  gives  too  long  O-H  and 
N-H  bonds. 

The  DNA  calculation  required  about  500  Mb  of 
computer  memory.  For  the  order-N  solution  of  the 
electronic  problem,  a  cutoff  of  4.0  A  was  used  for 
the  electronic  wave  functions,  which  were  reduced 
to  only  occupied  states  (i.e.,  M  =  N/2).  The  en¬ 
ergy  functional  [Eq.  (17)]  therefore  reduces  to  the 
original  Mauri-Ordejon  functional  [3,  4].  The  full 
relaxation  of  the  structure  needed  over  200  mini¬ 


mization  steps,  which  amounted  to  around  5  days 
of  CPU  time  in  a  HP  Cl  10  workstation.  The  devia¬ 
tion  from  the  experimental  geometry  of  Ref.  [29]  is 

/  E(qh  “  ^^exo)^  o  ,  , 

^  =  V  -  ^  =  0.23  A.  (18) 

V  N 

The  hydrogen  bonds  between  guanine  and  cyto¬ 
sine  display  similar  pathologies  as  the  ones  showed 
by  the  isolated  pair  of  bases,  as  can  be  seen  in 
Table  VII.  It  is  interesting  to  note,  however,  that 
the  skeleton  shows  much  smaller  deviations,  as. 


TABLE  VII  _ _ 

Distances  (in  A)  for  the  H  bonds  in  a  guanine 
(Gu)  -  cytosine  (Cy)  pair  within  the  DNA  structure. 


Gu-Cy 

d(Gu-Cy),h 

d(Gu-Cy)exp 

N-N 

2.58 

2.85 

N~0 

2.63 

2.78 

O-N 

2.50 

2.80 

TABLE  VI  _ _ 

Distances  (in  A)  for  the  H-bonds  in  the  guanine  (Gu)  -  cytosine  (Cy)  isolated  pair.® 


Gu-Cy 

d{Gu-H) 

d(H-Cy) 

c/(Gu-Cy) 

d(Gu  Cy)[\/]p2 

Exp. 

N-H-N 

1.24 

1.37 

2.61 

3.05 

2.95 

N-H-0 

1.17 

1.36 

2.53 

3.01 

2.86 

0-H-N 

1.23 

1.28 

2.51 

2.93 

2.91 

®  MP2  refers  to  the  Moller-Plesset  calculation  of  Ref.  [32]  using  a  6-31 G*  basis.  Experimental  result  are  taken  from  Saenger  [33]. 
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for  example,  can  be  seen  in  a  characteristic  dis¬ 
tance  between  the  C  atoms  in  the  sugar  molecules 
supporting  a  basis  pair  at  opposite  sides,  R,  char¬ 
acteristic  of  the  interhelix  distance: 

R,,p  -  10.70  A,  Rth  =  10.61  A.  (19) 

The  structure  is  not  relaxed  along  the  helix  axis. 
However,  the  calculated  stress  along  that  direction 
is  quite  small  (11  kbars)  allowing  us  to  expect  a 
small  relaxation  in  that  direction.  A  more  detailed 
account  of  the  results  will  be  presented  elsewhere. 


Conclusions 

An  efficient  method  for  ab  initio  calculations 
has  been  presented.  It  is  based  on  the  density-func¬ 
tional  theory  (DFT)  and  utilizes  LCAO  and  stan¬ 
dard  norm-conserving  pseudopotentials.  It  has  also 
been  shown  how  the  computational  effort  scales 
linearly  with  the  system  size.  After  a  description  of 
its  different  aspects,  calculations  have  been  pre¬ 
sented  for  testing  its  capabilities,  beginning  with 
small  molecules  and  finishing  with  DNA.  The 
quality  of  the  results  has  been  shown  to  be  mostly 
related  to  the  quality  of  the  LCAO  basis  set. 
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ABSTRACT:  Electron  propagator  methods  for  calculating  vertical  electron  detachment 
energies  of  closed-shell  anions  are  compared  with  respect  to  accuracy  and  efficiency. 
Examination  of  self-energy  terms  and  numerical  tests  on  typical  anions  show  that  the 
partial  third-order  (P3)  quasi-particle  approximation  is  superior  to  the  third-order  and 
outer  valence  Green's  function  methods  in  both  respects.  ©  1997  John  Wiley  &  Sons,  Inc.  Int 
J  Quant  Chem  65:  463-469,  1997 


Introduction 

The  chemistry  and  physics  of  gas-phase  anions 
exhibit  several  lively  trends  where  theory  and 
experiment  interact.  Double  Rydberg  anions 
demonstrate  the  importance  of  diffuse  basis  func¬ 
tions  and  electron  correlation  in  describing 
electron  pairs  bound  to  a  molecular  cation  [1], 
Thorough  calculations  continue  to  identify  super¬ 
halogens:  molecules  with  electron  affinities  ex¬ 
ceeding  those  of  the  halogen  atoms  [2].  Doubly 
charged  and  multiply  charged  metal  fluoride  com¬ 
plexes  have  been  studied  computationally  [3].  Car- 
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bon  cluster  dianions  [4]  have  been  examined  with 
several  theoretical  techniques  [5].  Advances  in 
preparative  and  computational  capabilities  por¬ 
tend  an  expansion  of  interest  in  larger  species  such 
as  fullerenes  [6]  and  organic  polyanions  [7].  A 
need  therefore  exists  for  correlated  methods  that 
are  accurate  and  feasible  for  systems  that  require 
hundreds  of  basis  functions. 

Among  the  most  important  properties  of  an 
anion  are  its  electron  detachment  energies,  both 
vertical  and  adiabatic.  Because  of  the  variety  and 
unprecedented  character  of  bonding  patterns  found 
in  anions,  a  theory  of  electron  detachment  energies 
should  be  free  of  model  potentials,  should  be  sys¬ 
tematically  improvable  with  respect  to  electron 
correlation,  and  should  include  no  arbitrary  pa¬ 
rameters.  Some  ab  initio  techniques  satisfy  these 
requirements.  Accuracy  and  feasibility  criteria,  es¬ 
pecially  the  scaling  of  arithmetic  operations  and 
storage  requirements,  often  determine  which  of 
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these  techniques  will  be  most  advantageous  for  a 
given  system.  Electron  propagator  methods  [8-15] 
excel  in  calculations  of  many  types  of  electron 
binding  energies  and  have  proven  to  be  useful 
tools  in  the  study  of  anions  [16]. 

A  recent  propagator  approximation  is  compared 
here  with  methods  that  have  been  extensively  ap¬ 
plied.  Closed-shell  anions  with  subtle  correlation 
effects  and  extensive  orbital  relaxation  in  the  corre¬ 
sponding  neutral  radicals  are  considered.  Calcula¬ 
tions  are  performed  with  various  basis  sets  in 
order  to  estimate  the  behavior  of  the  propagator 
approximations  in  the  limit  of  a  complete  basis  set. 


Theory 

The  r,  s  element  of  the  electron  propagator  [8]  is 
GJE)  ^ 

(N\a\\N-l,nXN-l,n\a,\N} 
\  E  +  E„iN-l)  -Eq(N)  -i-n 
^  ^  (N\aJN+l,m)(N+l,m\al\N)\ 
E-E,„(N+l)+£o(N)+f7,  /• 

(1) 

Through  introduction  of  superoperators  and  a  cor¬ 
responding  metric,  the  propagator  may  be  repre¬ 
sented  more  compactly  [9,  17].  Superoperators  act 
on  field  operator  products,  X,  where  the  number 
of  annihilators  exceeds  the  number  of  creators  by 
one.  The  identity  superoperator,  I,  and  the  Hamil¬ 
tonian  superoperator,  H,  are  defined  by 

IX  =  X  (2) 

and 

HX  =  [X,  H]-,  (3) 

respectively.  The  superoperator  metric,  defined  by 

ifji\p)  =  (N\[iJ,p]^\N},  (4) 

depends  on  the  choice  of  the  N-electron  reference 
state,  IN).  In  this  notation, 

=  G,,(E)  =  (aMEI  -  (5) 

A  matrix  generalization  of  Eq.  (5)  is 

G(E)  =  (a|(Ef-H)'’a).  (6) 


After  the  inner  projection, 

G(£)  =  (a|u)(u|(Ef-  H)u)”\u|a),  (7) 

where  u  is  the  vector  of  all  X  field  operator 
products.  If  u  is  partitioned  into  the  primary  space, 
a,  and  an  orthogonal  space,  f,  then  the  partitioned 
form  of  the  propagator  matrix  reduces  to 


£l-(a|Ha)  -(a|Hf) 

-1 

1 

-(f|Ha)  El-(fiHf) 

,0. 

(8) 

After  a  few  elementary  matrix  manipulations,  a 
convenient  form  of  the  inverse  propagator  matrix 
emerges,  where 

G-^E)  =  El  -  (a|Ha) 

-(a|Hf)[El-(f|Hf)]‘'(f|Ha).  (9) 

Elements  of  the  zeroth-order  inverse  propagator 
matrix  are 

[Go-HE)]^^  =  (E-e,)5„.  (10) 

(The  poles  equal  Koopmans'  theorem  results.)  The 
inverse  propagator  matrix  and  its  zeroth-order 
counterpart  therefore  are  related  through 

G-'CE)  =  GoHE)  -  X(^)  -  2'(E),  (11) 

where 

2(co),..  =  =  E('-sllfM)pA,  (12) 

tu 

and 

S'(E)  =  (a|Hf)[  El  -  (f|Hf)]“\f  iHa).  (13) 

Corrections  to  the  zeroth-order  propagator  in  Eq. 
(11)  are  gathered  together  in  a  term  known  as  the 
self-energy  matrix  S(E).  The  Dyson  equation  may 
be  written  as 

G-HE)  =  Go'(E)  -  2(E).  (14) 

In  the  self-energy  matrix,  there  are  energy-inde¬ 
pendent  terms  and  energy-dependent  terms: 

2(E)  =  X(oo)  +  2'(E).  (15) 

In  the  limit  of  |E| oo,  2(E)  approaches  its 
energy-independent  component,  X(^).  At  a  pole 
energy, 

det{G(E)}  ->  00.  (16) 


=  lim 
0 
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Poles  may  be  found  by  requiring  that 

det{G-HE)}  =  0.  (17) 

Previous  Approximations 

The  second-order  self-energy  matrix, 
=(a|H£3f’[El-(f3|Hf3)‘"^] 

(18) 

follows  from  zeroth-  and  first-order  choices  for 
superoperator  blocks  defined  in  terms  of  the  a  and 
£3  {2h-p  and  2p-h)  operator  manifolds  [9].  (Super¬ 
scripts  in  parentheses  denote  the  order  through 
which  a  superoperator  matrix  element  is  evalu¬ 
ated^)  Upon  retention  of  first-order  terms  in 
(f3|Hf3),  third-  and  higher-order  contributions  to 
the  self-energy  matrix  are  generated.  Third-order 
terms  also  are  created  by  perturbative  improve¬ 
ments  in  the  ground-state  averages  [18],  The 
third-order  self-energy  matrix  contains  contribu¬ 
tions  of  both  types  and  may  be  expressed  as 

=  X^^KE)  +  (alVa)® 

+  (a| Hf  3)® [  El  -  (f 3I Hof 3r]  ' (f 3I Ha)^“ 

+  (a|Hf3f^|El  -  (fjlHofsrl  \f3lHaf’ 


+  (a|Hf3f^[El-  (f3|Hof3f^]  \f3\Vhf 

x[El-(f3|Hof3)®]  '(fjiHa)"  (19) 

where  V  is  the  superoperator  for  the  fluctuation 
potential. 

Results  on  valence  ionization  energies  and  elec¬ 
tron  affinities  of  closed-shell  molecules  generally 
indicate  that  off-diagonal  elements  of  the  self¬ 
energy  matrix  in  the  canonical  basis  are  small  and 
have  a  negligible  effect  on  poles.  Quasi-particle 
approximations  explicitly  neglect  these  matrix  ele¬ 
ments.  The  associated  pole  search  becomes  espe¬ 
cially  easy,  for  the  zeroes  of  the  diagonal  elements 
of  the  Dyson  equation  can  be  found  by  solving 

+  (20) 

Simple  measures  for  estimating  fourth-  and 
higher-order  terms  have  been  applied  widely.  Once 
a  third-order  pole  is  found,  various  terms  are  rear¬ 
ranged  to  make  these  estimates.  The  A  version  of 
the  outer  valence  Green's  function  (OVGF)  meth¬ 
ods  [14],  e.g.,  corrects  canonical  orbital  energies 
with 

X(^,0VGF)(£)  _  2(2)(£)  +  (1  +  X,)“^Xf;(E), 

(21) 

where  E  is  chosen  to  be  the  third-order  pole  and 


((alHf3)®{£l  -  (f3|Hf3)®}'\f3lHart 
X  =  -2-^^ _ - _ - _ (22) 

{(alHf3f  (e1  - 

The  B  approximation  is  similar,  but  separates  the  energy-dependent  2  p-ft  and  2h~p  contributions  through 
X(b,ovgf)(£)  ^  ;5(2)(£)  +  (alVa)'?  +  (1  +  X2"-'’)''Xf/^"-'’>(£)  +  (1  +  X,2p-")"'Xf/2F-'»(E),  (23) 
where 
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and 


{(a|Hf2p-/,r{El  - 

|(a|Hf2p_,f '{eI  -  (f2p-.lHf2p-„r}  '(f2p-.lHaf 


(25) 


Note  that  the  constant  terms  are  not  scaled  in  this 
approximation.  These  schemes  function  best  when 
the  third-order  numerators  in  Eqs.  (22),  (24),  and 
(25)  are  small  compared  to  the  second-order  de¬ 
nominators.  For  other  cases,  the  C  recipe  often  is 
preferable.  The  self-energy  correction  is 

=  2®(£)  +  (1  +  x^y^xf^iE), 

(26) 

where 

xy-f’xfy'-f’KE)  +  xy-''xfyp-'’KE) 

-  xfy’'-’’KE)  +  2f/2p-''>(E)  ■ 

(27) 

The  utility  of  quasi-particle  approximations  was 
tested  recently  through  a  comparison  of  third-order 
and  OVGF  results  for  25  ionization  energies  of  10 
closed-shell  molecules  with  H,  C,  N,  O,  and  F 
nuclei  [19].  Calculations  were  performed  with  a 
correlation-consistent,  triple  ^  basis  [20].  Root 
mean-square  errors  with  respect  to  experiment  in 
eV  were  1.25  for  Koopmans's  theorem,  0.67  for 
third  order,  0.27  for  OVGF-A,  0.33  for  OVGF-B, 
and  0.32  for  OVGF-C.  After  employment  of  numer¬ 
ical  criteria  [19]  for  choosing  a  recommended 
OVGF  value  (i.e.,  a  selection  from  one  of  the  three 
versions),  the  root  mean-square  error  is  0.25  eV. 
Once  third-order  results  are  available,  OVGF  esti¬ 
mates  are  very  efficient  tools  for  making  assign¬ 
ments  and  predicting  ionization  energies. 


Partial  Third-order  Theory 

The  usual  choice  of  superoperator  metric  starts 
from  a  Hartree-Fock  wavefunction  plus  perturba¬ 
tive  corrections: 

(Y|Z)  =  (HF\(1  +  T■^)[y^Z]+  (1  +  T)|HF>, 

(28) 


where 

T  =  TP  +  Tf )  +  TP  +  Tf)  jp  ^  ^  (29) 

The  level  of  excitation  in  is  indicated  by  the 
subscript,  e,  and  the  order  is  defined  by  the  super¬ 
script,  /.  For  example,  second-order  triple  excita¬ 
tions  are  represented  by  ip.  Coupled-cluster 
parametrizations  of  this  metric  suggest  an  alterna¬ 
tive  form: 

(Y|Z)  =  <HFk-^[Y^Z]  +  e^|HF>.  (30) 

This  choice  produces  asymmetric  superoperator 
matrices.  A  simplified  final  form  for  the  self-energy 
matrix  that  does  not  require  optimization  of  clus¬ 
ter  amplitudes  is  sought  presently;  the  approxima¬ 
tion 

«  1  +  TP  (31) 

therefore  is  made. 

With  this  choice,  several  third-order  terms  that 
appeared  with  the  usual  metric  are  eliminated. 
The  new  self-energy  matrix  in  third  order  is  asym¬ 
metric  and  is  expressed  by 

2(F) 

+  (a|Hf3f'{El-(f3lHf3)®}  \y\Hif 

+  (a|Hf3/”{£l-(f3lHf3fy'(f3lVf3f' 

x(El-(f3|Hf3)®’}  \f3\Haf\  (32) 

Note  that  energy-independent  terms  in  the  third- 
order  self-energy  matrix  are  not  retained. 

Two  observations  suggest  additional  economies: 
First,  numerical  results  for  ionization  energies 
show  that  third-order,  2p-h  terms  in  Eq.  (32)  are 
small  relative  to  their  2h-p  counterparts  [21]. 
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Terms  arising  from  these  operators  are  important 
in  second  order,  however.  Second,  evaluation  of 
one  of  the  third-order  2  p-h  terms  requires  electron 
repulsion  integrals  with  four  virtual  indices,  ac¬ 
cording  to 


Tp,c(E)  =  -  E 


1  ^  (pi\\ab)(ab\\cd) 


2  ^  E  +  e,- 


(33) 


where  i,  ;, fc, . . .  are  occupied  indices,  a,h,c,... 
are  virtual  indices,  and  p,(\,r,...  are  general  in¬ 
dices.  Storage  of  these  integrals  is  often  avoided 
through  semidirect  algorithms  [22].  Contractions 
involving  integrals  with  four  virtual  indices  re¬ 
main  the  arithmetic  bottleneck  in  third-order 
quasi-particle  theory,  for  the  scaling  factor  for  a 
given  final  state  with  index  p  is  OV^,  where  O  is 
the  number  of  occupied  orbitals  and  V  is  the 
number  of  virtual  orbitals. 

Neglect  of  third-order,  Ip-h  terms  produces 
this  self-energy  matrix: 


S(E)p^  - 


<  pi\\ab)(ab\\qO 


2  E  +  -  e,-  -  fy 

2  aij  E  +  Cfl  “ 


(34) 


where 


W,«/y  =  UaWij)  +  o  E 


(qa\\bc}(bc\\ij) 


be 


+  a-Eyy)E 


(cik\\bi)(ba\\jk} 


bk  "E 


(35) 


and 


tJp„,y(E)  = 


1„  {pa\\kl){kl\\ij) 

2  w  E  +  -  e, 


-(1- VE 

bk 


{pb\\jk){ak\\bi} 

E  +  Cfc  -  ey  -  e/ 


(36) 


This  partial  third-order  expression  is  designated 
by  the  abbreviation  P3. 

Comparison  of  the  self-energy  matrix  elements 
of  Eq.  (34)  with  older,  related  methods  [14,  23] 
reveals  the  advantages  of  the  P3  approximation. 
OVGF  results  [14]  and  the  third-order  quasi-par¬ 


ticle  calculations  on  which  they  are  based  require 
the  evaluation  of  the  diagonal  elements  of  the 
self-energy  matrices  given  in  Eq.  (18)  (second-order 
terms)  and  Eq.  (19)  (third-order  terms).  The  second 
term  (the  third-order,  energy-independent  dia¬ 
grams)  and  the  third  term  in  Eq.  (19)  are  not 
present  in  the  P3  self-energy.  Furthermore,  the 
2  p-h  operators  included  in  the  fourth  and  fifth 
terms  of  Eq.  (19)  are  dropped  in  P3  theory  as  well 
Another  difference  between  P3  and  OVGF  is  the 
absence  of  scaling  factors,  such  as  those  in  Eqs, 
(21)”(27),  in  P3. 

In  the  P3  quasi-particle  propagator,  only  diago¬ 
nal  elements  of  the  self-energy  matrix  are  retained. 
The  first  contraction  in  Eq.  (35)  is  the  most  de¬ 
manding,  for  it  has  an  arithmetic  scaling  factor  of 
for  a  given  q.  This  step  also  requires  electron 
repulsion  integrals  with  one  occupied  and  three 
virtual  indices.  The  intermediate  is  energy- 
independent  and  must  be  evaluated  once  only  for 
each  final  state. 

A  quasi-particle  implementation  of  the  P3 
method  was  applied  to  19  ionization  energies  of 
six  closed-shell  molecules  with  a  correlation-con¬ 
sistent,  triple  ^  basis  [21].  Comparisons  with  re¬ 
sults  of  more  costly  propagator  techniques  were 
made  by  applying  the  same  basis  sets.  The  average 
absolute  errors  in  eV  are  1.34  for  Koopmans's 
theorem,  0.25  for  OVGF-B,  and  0.19  for  P3.  The  P3 
procedure  exhibits  accuracy  at  least  as  good  as 
that  of  other  methods,  superior  arithmetic  scaling, 
and  no  need  for  electron  repulsion  integrals  with 
four  virtual  indices. 


Results 

Electron  propagator  calculations  were  per¬ 
formed  on  F“,  OH“,  NH2,  Cl",  SH“,  and  PH2 
with  a  modified  version  of  Gaussian  94  [24].  The 
geometries  of  the  anions  were  optimized  with 
MBPT(2)  [25]  and  the  6-31  +  G(rf,p)  basis  [26]. 
Propagator  calculations  employ  three  basis  sets: 
6-311  +  +G(2d/,2pd)  [27],  aug-cc-pVDZ,  and  aug- 
cc-pVTZ  [20].  Core  orbitals  were  dropped  from  all 
summations  pertaining  to  the  propagator  self-en¬ 
ergies.  Several  diagonal  self-energy  approxima¬ 
tions  were  examined:  second  order,  third  order, 
OVGF-A,  OVGF-B,  OVGF-C,  the  recommended 
OVGF  (OVGF-R)  value  arising  from  a  selection 
procedure  [19],  and  P3. 
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TABLE  I _ 

Electron  detachment  energies  (eV). 


Anion 

Basis 

KT 

2 

3 

A 

B 

c 

R 

P3 

Expt.  [28] 

F- 

6-311  ++G(2c/f,2pd) 

4.83 

1.23 

5.69 

3.83 

4.39 

4.17 

4.39 

3.72 

aug-cc-pVDZ 

4.93 

1.03 

6.61 

4.32 

5.05 

4.62 

5.05 

3.79 

aug-cc-pVTZ 

4.92 

1.21 

6.75 

4.24 

5.00 

4.66 

5.00 

4.15 

3.399  ±  0.003 

OR- 

6-311 +-HG(2d^2pc/) 

2.88 

-0.01 

3.59 

1.92 

2.38 

2.28 

2.38 

2.12 

aug-cc-pVDZ 

2.95 

-0.17 

4.22 

2.26 

2.86 

2.58 

2.86 

2.11 

aug-cc-pVTZ 

2.97 

-0.02 

4.43 

2.24 

2.86 

2.67 

2.86 

2.48 

1.828 

NH2“ 

6-311  -(--l-G(2df,2pd) 

1.30 

-0.38 

1.36 

0.52 

0.77 

0.81 

0.77 

0.77 

aug-cc-pVDZ 

1.31 

-0.51 

1.53 

0.64 

0.97 

0.91 

0.97 

0.64 

aug-cc-pVTZ 

1.34 

-0.30 

1.65 

0.68 

1.01 

1.02 

1.01 

0.96 

0.779  ±  0.037 

GI¬ 

6-3^^++G{2df,2pd) 

4.08 

3.04 

3.68 

3.39 

3.54 

3.57 

3.54 

3.41 

aug-cc-pVDZ 

4.10 

2.91 

3.85 

3.49 

3.70 

3.68 

3.70 

3.35 

aug-cc-pVTZ 

4.09 

3.08 

3.88 

3.51 

3.73 

3.74 

3.73 

3.50 

3.615 

SH" 

6-311-i--i-G(2df,2pd) 

2.56 

1.80 

2.27 

2.03 

2.18 

2.20 

2.18 

2.08 

aug-cc-pVDZ 

2.58 

1.70 

2.39 

2.09 

2.29 

2.28 

2.29 

2.01 

aug-cc-pVTZ 

2.58 

1.88 

2.48 

2.16 

2.38 

2.39 

2.38 

2.19 

2.319  ±  0.010 

PH2" 

6-311-l--l-G(2d/,2pd) 

1.20 

0.89 

1.06 

0.95 

1.06 

1.05 

1.06 

0.98 

aug-cc-pVDZ 

1.21 

0.81 

1.10 

0.94 

1.10 

1.09 

1.09 

0.88 

aug-cc-pVTZ 

1.23 

0.99  . 

1.22 

1.06 

1.22 

1.21 

1.21 

1.09 

1.271  ±  0.010 

F“,  OH“,  and  NH2  were  challenging  tests  for 
perturbative  quasi-particle  methods.  Table  I  sum¬ 
marizes  these  calculations  and  shows  that  third- 
order  results  (see  the  column  labeled  3)  may  be 
worse  than  the  predictions  of  Koopmans's  theo¬ 
rem  (see  the  KT  column).  Such  behavior  is  rare  for 
ionization  energies  of  closed-shell  molecules. 
Second-order  results  are  relatively  unreliable  and 
may  even  produce  the  wrong  sign.  It  is  also  worth 
noting  that  third-order  pole  strengths  often  exceed 
unity  for  OH“  and  NH2 .  (The  analytic  behavior  of 
the  self-energy  matrix  elements  does  not  assure 
the  correct  sign  of  the  derivatives  with  respect  to 
E.)  OVGF  scaling  procedures  reduce  the  errors,  but 
discrepancies  remain  large.  The  A  version  of  OVGF 
behaves  best  for  F”,  OH",  and  NH2,  but  OVGF-R 
(see  the  column  labeled  R)  generally  favors  the  B 
version.  As  basis  sets  are  improved,  electron  de¬ 
tachment  energy  results  generally  increase.  The 
overestimation  of  these  quantities  by  third-order 
and  OVGF  methods  implies  that  discrepancies  will 
grow  in  calculations  with  basis  sets  of  higher  qual¬ 
ity.  It  also  is  possible  for  fortuitous  cancellations  of 
basis-set  and  correlation  errors  to  produce  accurate 
predictions. 

The  situation  is  markedly  different  for  Cl“,  SH^, 
and  PH  2-  Here,  agreement  with  experiment  is 
good  for  third-order  and  OVGF  methods,  even 
though  an  overestimate  of  the  electron  detachment 
energies  occurs  for  certains  of  pairs  of  methods 


and  basis  sets.  For  Cl"  and  SH",  better  basis  sets 
will  produce  poorer  results  with  third-order  and 
the  B  and  C  versions  of  OVGF.  A  reexamination  of 
the  OVGF  selection  procedure  and  its  parameters 
may  be  advisable  for  anions, 

P3  results  for  F",  OH",  and  NHJ  indicate  that 
substantial  errors  have  been  overcome  with  this 
approximation,  but  that  more  terms  must  be  added 
to  obtain  close  agreement  with  experiment.  Fortu¬ 
itous  agreement  with  experiment  is  clearly  a  dan¬ 
ger  with  the  6-31  +  +G(2df,2pd)  basis,  especially 
for  NH2.  Basis  sets  larger  than  aug-cc-pVTZ  are 
likely  to  produce  greater  errors.  This  method  must 
be  used  with  caution  when  orbital  relaxation  is 
important  in  the  final,  neutral  state. 

P3  behaves  better  with  respect  to  basis-set  satu¬ 
ration  for  Cl",  SH",  and  PH2 .  Use  of  aug-cc-pVQZ 
basis  sets  will  probably  produce  very  close  agree¬ 
ment  with  experiment,  for  the  aug-cc-pVTZ  results 
are  still  below  the  experimental  values.  In  these 
three  cases,  relaxation  and  correlation  corrections 
to  Koopmans  theorem  results  are  relatively  small 
and  perturbative  arguments  can  be  expected  to  be 
more  successful. 


Conclusions 

The  P3  quasi-particle  approximation  is  more 
computationally  efficient  than  are  the  third-order 
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and  OVGF  methods.  The  bottleneck  contraction  of 
P3  requires  operations,  whereas  the  most 

difficult  contraction  of  third  order  (and  therefore 
OVGF)  scales  as  OV^.  The  latter  step  must  be 
repeated  for  each  energy  iteration,  but  the 
step  present  in  P3  must  be  performed  only  once 
per  final  state.  No  electron  repulsion  integrals  with 
four  virtual  indices  are  needed  in  a  P3  calculation. 

Results  on  the  electron  detachment  energies  of 
anions  suggest  that  P3  is  a  more  reliable  method 
than  are  the  third-order  and  OVGF  approxima¬ 
tions  in  the  limit  of  a  complete  basis  set.  For  F“, 
OH“,  and  NH2,  large  discrepancies  with  experi¬ 
ment  remain  for  all  of  the  methods,  although  the 
errors  are  not  as  large  for  P3.  For  Cl",  SH“,  and 
PH  2,  where  relaxation  and  correlation  effects  are 
not  as  large,  P3  results  produce  close  agreement 
with  experiment  and  are  likely  to  be  superior  to 
third  order  and  OVGF  for  saturated  basis  sets. 
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ABSTRACT:  We  develop  the  path  integral  method  for  quantum  chemistry,  apply  the 
Monte  Carlo  method  to  an  evaluation  of  the  path  integral,  and  calculate  the  ensemble 
average  of  the  energy.  For  finite  temperature,  a  simple  Monte  Carlo  evaluation  of  the 
path  integral  brings  out  the  negative-sign  problem.  In  this  work,  to  avoid  this  problem  in 
the  numerical  evaluation,  we  apply  the  new  reweighting  method  to  the  Monte  Carlo 
integration,  and  calculate  effectively  the  ensemble  average  of  the  energy  for  finite 
temperature.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  471-476,  1997 


Introduction 

The  path  integral  introduced  by  Feynman  [1] 
has  been  applied  to  many  problems  in  quantum 
mechanics.  In  the  path  integral,  the  physical  quan¬ 
tity  can  be  handled  as  a  commuting  variable  like  a 
classical  one.  Thus  the  physical  processes  can  be 
understood  more  clearly  than  in  ordinary  quan¬ 
tum  mechanics.  When  we  apply  the  path  integral 
method  directly  to  the  electron  field  described  by 
the  anticommuting  field  operator,  we  must  deal 
with  the  corresponding  anticommuting  c-num- 
bers,  the  Grassmann  numbers.  However,  the  prop¬ 
erties  of  the  Grassmann  numbers  are,  in  general, 
not  so  well  known  [2,  3],  so  that,  in  the  field  of 
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quantum  chemistry,  the  path  integral  method  has 
not  been  widely  applied  to  the  many-electron 
problems. 

Recently  the  path  integral  method  has  been  ap¬ 
plied  to  the  Hiickel  model  [4]  and  the  many-elec¬ 
tron  problem  [5].  We  have  also  developed  the  path 
integral  method  without  using  the  Grassmann 
numbers  to  estimate  the  electron  correlation  in 
molecules  [6].  In  that  work,  we  derived  the  new 
closure  relation  in  terms  of  the  Linear  Combina¬ 
tion  of  Atomic  Orbital  (LCAO)  coefficients  of  the 
molecular  orbital,  and  have  constructed  a  path 
integral  form  of  some  physical  quantities  using  the 
LCAO  closure  relation.  In  the  previous  work  [7], 
we  presented  the  new  closure  relations  that  could 
be  derived  from  an  arbitrary  complete  set  of  state 
vectors,  applied  the  closure  relation  to  construct 
the  partition  function  in  the  path  integral  form. 
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and  evaluated  the  ensemble  average  of  the  energy. 
In  these  treatments,  there  appear  no  Grassmann 
numbers  corresponding  to  the  electron  field  opera¬ 
tor  but  the  c-numbers  corresponding  to  LCAO 
coefficients  or  state  amplitudes. 

The  essence  of  the  path  integral  is  a  sum  of 
histories,  and  the  path  integral  is  usually  ex¬ 
pressed  as  a  multidimensional  integral  with  a  huge 
number  of  variables.  Therefore,  we  cannot  directly 
carry  out  this  integration  by  ordinary  ways  and 
will  be  forced  to  use  statistical  methods,  such  as 
the  Monte  Carlo  method.  Therefore,  we  have  ap¬ 
plied  the  Monte  Carlo  method  to  evaluate  the  path 
integral  [7]. 

The  ordinary  quantum  Monte  Carlo  method  is  a 
very  powerful  tool  in  order  to  estimate  thermal 
and  quantal  fluctuations  of  interacting  quantum 
spin  systems  and  strongly  correlated  electron  sys¬ 
tems  [8].  This  method  is  based  on  the  generalized 
Trotter  formula,  and  the  quantum  fluctuation  is 
expressed  in  the  ordinary  path  integral  form.  In 
simulations  of  quantum  systems,  however,  some 
of  the  local  Boltzmann  factors  become  negative, 
which  leads  to  deteriorated  statistics  [9].  This  diffi¬ 
culty  is  called  the  negative-sign  problem.  The  neg¬ 
ative-sign  problem  is  particularly  serious  in  low- 
temperature  regions  of  frustrated  quantum  spin 
systems  and  fermion  systems  [10,  11]. 

To  reduce  the  difficulties  of  the  negative-sign 
problem,  the  reweighting  method  has  been  pro¬ 
posed  [12].  In  this  method,  the  Monte  Carlo  sam¬ 
pling  is  performed  under  the  virtual  Boltzmann 
weight  instead  of  the  original  one,  which  is  posi¬ 
tive  definite  and  describes  well  the  original  sys¬ 
tem,  and  a  value  of  a  physical  quantity  is 
reweighted  by  the  correction  factor,  the  ratio  of  the 
original  and  virtual  Boltzmann  factor. 

Conventionally,  one  has  been  simulating  the 
system  by  the  absolute  value  of  the  Boltzmann 
factor  of  the  original  system  as  the  virtual  Boltz¬ 
mann  factor.  However,  in  the  quantum  Monte 
Carlo  calculation,  it  has  been  difficult  to  get  a 
quick  convergence  in  numerical  integrations  using 
the  absolute  value  weight  function. 

In  this  work,  we  apply  the  new  reweighting 
method  to  the  Monte  Carlo  integration,  and  calcu¬ 
late  effectively  the  ensemble  average  of  the  energy 
for  finite  temperature.  In  the  second  section,  we 
briefly  summarize  the  path  integral  formulation 
for  the  ensemble  average  of  the  energy.  In  the 
third  section,  the  negative-sign  problem  in  the 


Monte  Carlo  method  is  discussed  and  a  new 
reweighting  function,  the  midpoint  procedure,  is 
proposed  to  avoid  this  problem  in  the  numerical 
evaluation.  Finally,  we  apply  this  method  to  the 
hydrogen  molecule,  evaluate  the  ensemble  average 
of  the  energy,  and  compare  a  quickness  of  conver¬ 
gence  in  numerical  integrations  between  the  con¬ 
ventional  weight  function  and  ours. 


Path  Integral  Formulation 

In  this  section,  we  briefly  summarize  our  clo¬ 
sure  relations  and  the  path  integral  formulation  of 
the  ensemble  average  of  energy  [7]. 

Now,  we  assume  the  orthonormal  set,  {|fc)}, 
spans  the  Hilbert  space  as 

M 

1=  ElkXki  (1) 

k^l 

(klk')  ^  V,  (2) 

where  M  is  a  dimension  of  the  vector  space 
The  state  |fc>  could  be  either  the  single-particle 
state  or  the  many-electron  configurational  one. 
Then  we  could  construct  a  generalized  state  vector 
by  the  linear  combination  of  the  above  state  vec¬ 
tors,  as 


|C>  = 


M 

E  Q|fc>, 


k=l 


(3) 


here,  the  coefficient  is  an  arbitrary  complex 
number.  If  we  choose  the  atomic  orbital  as  \k), 
then  I C  >  corresponds  to  the  molecular  orbital,  and 
if  \k)  is  the  many-electron  configuration,  it  be¬ 
comes  the  configuration  interaction  (Cl)  state. 

Here,  let  us  assume  that  Q  are  variables,  and 
change  these  parameters  arbitrarily.  Then  resulting 
|C>  is  transformed  into  an  arbitrary  state  vector.  It 
is  important  to  notice  that  the  space  spanned  by 
the  state  vector  |C),  generated  by  changing  Q 
arbitrarily,  could  include  every  state  in  the  Hilbert 
space,  namely  the  set  {|C>}  is  an  overcomplete  set. 
Thus,  we  could  construct  the  closure  relations. 


/n 

*'  k=l 


^  dCt  dCj, 


liri 


M 


exp  -  EIQI  icxa  (4) 


k=l 
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and 

.  ^  dCf  dC.  I  ^  ,  \ 

l=M\fU  '  /g  E  IQI^  -  1  ICXCI.  (5) 

A:=l  277J  j 

The  delta  function  ensures  the  normalization  con¬ 
dition  of  the  state  vectors.  Thus,  in  this  case,  each 
variable  varies  only  on  the  surface  of  the  unit 
sphere  in  the  complex  space. 

Next,  we  restrict  in  Eq.  (3)  to  be  real.  Then, 
we  could  get  the  real  number  closure  relations, 

M  dC  I  ^  \ 

l  =  2f  riT^exp  -  LcHICXCI,  (6) 

k=l  V77  \  ^=1  j 


and 

I  M  \  .  ^  dC„  I  ^  ^  \ 

i  =  2r(yj/n^s|  Ec,^-1  ICXCI.  (7) 

Consider  a  partition  function,  Z  =  Tr^“^^, 
where  Tr  implies  a  trace  operation,  and  = 
with  the  Boltzmann  constant  and  a 
temperature  T.  Let  us  divide  the  temperature  in¬ 
terval  p  into  L  infinitesimal  slices,  i.e.,  jS  =  Lr, 
and  inserting  the  closure  relation  among  segments, 
then  Z  becomes 


1  /• 

Z  -  lim  —  /  dM[C*,C]Z[C*,C],  (8) 

L^oo  a  J 


where 


A  ^  dCt(l)dC,(l) 

rf^[c*,c]  ^  n  n  ,  , 

2771 


xnw[c*(/),c(/)],  (9) 


/=1 


Z[C*,C]  ^  n<c(Z)k-^^/'^|c(z  -  1)>,  (10) 
1=1 


and 


S  (11) 

W[C*(Z),  C(Z)]  is  the  Gaussian  weight  function  or 
the  delta  function  corresponding  to  Eq.  (4)  or  (5), 
respectively,  and  yk'  is  the  normalization  factor 
appearing  in  the  denominator  of  the  closure  rela¬ 
tion.  The  trace  operation  in  the  partition  function 
imposes  a  periodic  boundary  condition  upon  |C(Z)> 
as 


|C(0)>  =  |C(L)>.  (12) 


The  ensemble  average  of  the  Hamiltonian  can 
be  calculated  as 


<H> 


Tr(c-^^H) 

Trc-^« 


lim 

L-^  “ 


fdti[C*,C]Z[C*,C]^ 

L  <C(/)|H|C(;  - 1)> 

f  diJi[C*,C]Z[C*,C] 


Note  that,  in  the  above  expression,  the  normaliza¬ 
tion  factor  H  has  disappeared.  It  is  impossible  to 
evaluate  Eq.  (13)  by  direct  numerical  integration 
because  of  its  huge  amount  of  integration  variables 
generated  by  slicing  p  and  the  presence  of  the 
two-body  interactions.  Therefore,  we  should  ap¬ 
proximate  the  above  integral  or  should  adopt  other 
integration  methods  such  as  the  Monte  Carlo 
method. 


Negative-Sign  Problem  and 
Reweighting  Method 


An  estimation  of  Eq.  (13)  requires  the  numerical 
calculation  of  two  multidimensional  integrals  over 
M  X  L  variables  with  large  enough  L,  but  we 
cannot  directly  carry  out  this  integration  by  ordi¬ 
nary  ways.  Therefore  we  apply  the  Monte  Carlo 
method  to  do  it. 

If  we  carry  out  the  Monte  Carlo  integration,  we 
would  encounter  the  negative-sign  problem.  The 
negative-sign  problem  has  its  origin  in  the  fact  that 
some  of  the  off-diagonal  elements  of  the  local 
Boltzmann  factors,  <C(Z)k"^^/^|C(Z  -  1))  become 
negative. 

If  the  factor  becomes  negative,  we  cannot  carry 
out  the  Monte  Carlo  integration.  To  tackle  the 
problem,  we  apply  the  reweighting  method  to 
Monte  Carlo  integration  [12].  In  this  method,  a 
virtual  Boltmann  factor,  B(Z,  Z  -  1)  is  introduced, 
which  is  a  positive  definite  weight  function. 


<C(/)k-^»/^|C(/-l)> 

-  D) 


=  BO,  I-  D- 


BdJ-  1) 


(14) 
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We  regard  the  correction  factor,  which  is  the  sec¬ 
ond  factor  in  the  above,  as  a  part  of  the  integrand 
and  perform  the  importance  sampling  through  the 
stochastic  process  defined  by  the  virtual  Boltz¬ 
mann  weight. 

When  the  virtual  system,  which  is  described  by 
the  virtual  Boltzmann  weight,  serves  as  a  good 
approximation  of  the  original  system,  we  could 
obtain  correct  values  for  a  smaller  number  of  the 
Monte  Carlo  steps.  One  condition  of  a  useful  vir¬ 
tual  system  is  that  the  ground-state  energy  of  the 
virtual  system  is  close  to  that  of  the  original  sys¬ 
tem.  From  this  point  of  view,  various  virtual  Boltz¬ 
mann  factors  and  effective  Hamiltonians  were  pro¬ 
posed  as  candidates  for  the  better  virtual  system 
[12].  In  the  conventional  approach  [9],  the  absolute 
value  of  the  Boltzmann  factor  is  used  as  a  weight 
function,  i.e., 

BO,  /  -  1)  =  KC(/)k-^»/^|C(/  -  1)>1.  (15) 

However,  in  the  quantum  Monte  Carlo  calculation, 
it  is  difficult  to  get  a  quick  convergence  in  numeri¬ 
cal  integrations  using  the  absolute  value  weight 
function. 

For  the  path  integral  formulation,  the  local 
Boltzmann  factor  is  approximated  as 

<c(/)k-^^/Hc(/ -  1)> 

-  <C(/)|C(/-1)> 

X  exp 

Therefore,  the  origin  of  the  problem  is  reduced  to 
the  overlap  <C(/)|C(/  —  1)>.  In  our  Monte  Carlo 
algorithm  [7],  because  we  randomly  generate  the 
state  |C(Z)>  from  |C(Z  -  1)>,  it  is  possible  that  the 
overlap  becomes  negative.  Considering  this  fact, 
we  propose  a  new  weight  function  as 

X  -  1)>,  (17) 


13  <C(Z)|H|C(Z-1)> 
L  <C(Z)|C(Z  -  1)> 


where 


|C(/))  \t) 


FIGURE  1,  Midpoint  state.  We  introduce  the  midpoint 
state,  |f>  between  |C(/  -  1))  and  |C(/)>. 

Here  we  introduced  the  midpoint  state,  |0  be¬ 
tween  |C(Z  —  1))  and  |C(/)).  (See  Fig.  1.)  In  this 
case,  B(/,  Z  -  1)  is  approximated  as 

B(Z,  Z-1) 

-<C(Z)|Z><Z|C(Z~1)> 

2l[  (com  <fic(/-i)>  J  • 

(19) 

By  using  Eq.  (18),  the  preexponential  factor  be¬ 
comes 

<C(Z)lZ><f|C(Z-l)> 

(1  +  <C(Z)|C(Z  -  1)»(1  +  <C(Z  -  1)|C(Z)» 

“  2  +  <C(Z)|C(Z  -  1)>  +  <C(Z  -  1)|C(Z)>  ' 

(20) 

Then  the  weight  function  B(Z,  Z  -  1)  is  always  pos¬ 
itive  and  the  negative-sign  problem  is  absent. 

Our  new  weight  function,  the  midpoint  proce¬ 
dure,  is  interpreted  as  follows.  Under  the  original 
local  Boltzmann  factor,  the  system  freely  evolves 
from  |C(Z  -  1)>  to  |C(Z)>,  as  in  Figure  2(a),  and  we 
must  consider  every  path  between  them.  On  the 


_ |C(Z  ^  1)>  +  |C(Z)) _ 

*  ~  V2  +  <C(/)|C(Z  -  1)>  +  (CO  -  1)|C(0>  ■ 

(18) 


FIGURE  2,  Paths  between  |C(/  -  1)>  and  |C(/)>.  (a) 
Paths  under  the  original  Boltzmann  factor.  We  must 
consider  every  path  between  them,  (b)  Paths  under  the 
midpoint  procedure.  Every  path  must  passes  |f )  at  the 
middle  of  the  time  interval  p/L. 
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other  hand,  under  the  midpoint  procedure,  the 
system  evolves  from  |C(/  —  1))  to  |C(/))  but  be¬ 
comes  U)  at  the  middle  of  the  time  interval,  j3/L, 
as  in  Figure  2(b).  For  large  enough  L  or  small 
enough  time  interval,  we  could  expect  that  the 
system  behaved  as  if  it  were  a  classical  one  and 
reached  the  midpoint  at  the  middle  of  the  time 
interval.  Thus,  our  midpoint  procedure  is  a  good 
approximation  of  the  original  Boltzmann  factor 
and  does  not  bring  the  negative-sign  problem. 

We  will  show  the  effectiveness  of  the  midpoint 
procedure  by  evaluating  the  ensemble  average  of 
the  energy  in  the  next  section. 


Numerical  Examples: 

Hydrogen  Molecule 

To  demonstrate  that  our  new  reweighting  func¬ 
tion  works,  we  calculate  here  the  average  energy 
<  H  )  of  the  hydrogen  molecule. 

The  calculation  is  carried  out  with  the  mini¬ 
mum  STO  (f  =  1.0)  and  4-31G  basis  sets  at  the 
internuclear  distance  of  1,4  a.u.  The  Monte  Carlo 
integrations  were  done  changing  the  virtual  Boltz¬ 
mann  factor,  the  absolute  value  weight,  and  the 
midpoint  procedure.  The  number  of  samplings  is 
100,000  and  the  temperature  is  10  K.  Calculations 
are  executed  on  the  S-4/1000  (FUJITSU  Co.  Ltd.). 
The  convergence  of  <  H )  are  presented  in  Figures 
3(a)  and  3(b)  corresponding  to  the  minimum  STO 
basis  set  and  the  4-31G  one,  respectively.  In  the 
figures,  the  solid  curves  mean  the  energies  calcu¬ 
lated  by  the  midpoint  procedure,  and  the  dashed 
ones  that  by  the  absolute  value  weight.  We  omit¬ 
ted  error  bars  in  the  figures,  because  the  width  of 
the  error  bar  depends  upon  the  sampling  period  in 
the  Monte  Carlo  integration.  Note  that  the  full  Cl 
energies  are  —1.821140  and  -1.867025  a.u.  for 
those  basis  sets,  respectively.  For  both  of  calcula¬ 
tions,  the  calculated  energies  monotonously  ap¬ 
proach  to  the  full  Cl  energies,  but  the  calculation 
by  the  midpoint  procedure  converges  faster  than 
that  by  the  absolute  value  weight. 


Concluding  Remarks 

We  have  introduced  the  midpoint  procedure  in 
the  reweighting  method.  We  could  imagine  that  in 


0  200  400  600  800  1 ,000 

Monte  Carlo  Steps 

a) 


Monte  Carlo  Steps 
b) 


FIGURE  3.  Convergence  of  the  ensemble  average  of 
energy  of  the  hydrogen  molecule.  The  solid  curve  means 
the  energy  calculated  by  the  midpoint  procedure  and  the 
dashed  one  that  by  the  absolute  value  weight,  (a) 
Calculations  with  the  minimum  STO  basis  set.  (b) 
Calculations  with  the  4-31 G  basis  set. 


short  enough  time  the  system  behaved  as  if  it  were 
a  classical  one  and  reached  the  midpoint  at  the 
middle  of  the  time  interval.  Therefore,  the  physical 
meaning  of  the  procedure  is  clear  and  reasonable. 
We  have  calculated  the  ensemble  average  of  the 
energy  and  have  shown  that  calculations  with  the 
midpoint  procedure  converged  faster  than  that 
with  the  conventional  absolute  value  weight. 

The  reweighting  method  is  a  useful  tool  to  eval¬ 
uate  integrals  by  the  Monte  Carlo  method,  espe¬ 
cially  those  with  the  negative-sign  problem.  The 
reweighting  method  has  a  freedom  of  the  choice  of 
the  reweighting  function,  and  it  should  be  chosen 
by  the  physical  insights.  To  choose  the  better 
reweighting  function,  it  is  necessary  that  the  func¬ 
tion  is  hard  to  be  negative  and  the  virtual  system 
specified  by  the  function  has  close  properties  to 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


475 


KAWABE  ET  AL. 


the  original  one.  If  the  properties  of  the  virtual 
system  are  very  different  from  the  original  one,  we 
could  not  expect  a  quick  convergence  of  calcula¬ 
tions.  The  fast  convergence  of  calculations  and 
clearness  of  the  physical  meaning  shows  that  our 
midpoint  procedure  is  a  good  approximation  to 
the  original  system. 

In  the  Monte  Carlo  method  for  the  Fermi  sys¬ 
tem,  the  negative-sign  problem  always  occurs 
whatever  closure  relations  one  uses;  however,  we 
can  expect  that  the  midpoint  procedure  is  effective 
on  the  Monte  Carlo  integration  with  any  closure 
relation  because  of  the  above  arguments. 

Using  the  midpoint  procedure,  we  can  easily 
and  quickly  evaluate  physical  quantities  by  the 
path  integral  Monte  Carlo  method  not  only  for  the 
zero  temperature  but  also  for  the  finite  tempera¬ 
ture.  In  this  work,  we  have  used  the  8  function 
closure  relation,  Eq.  (4),  and  have  calculated  the 
energy  with  the  midpoint  procedure  only  for  the 
hydrogen  molecule.  In  the  future,  we  will  calculate 
other  quantities  as  well  as  the  energy  at  the  finite 
temperature  for  many  species  with  various  closure 
relations. 
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ABSTRACT:  A  multigrid  method  for  numerically  solving  electrostatics  and  quantum 
chemical  problems  in  real  space  is  discussed.  Multigrid  techniques  are  used  to  solve  both 
the  linear  Poisson  equation  and  the  nonlinear  Kohn-Sham  and  Poisson-Boltzmann 
equations.  The  electrostatic  potential,  Laplacian,  charge  densities  (electrons  and  nuclei), 
Kohn-Sham  DPT  orbitals,  and  the  self-consistent  field  potential  are  all  represented 
discretely  on  the  Cartesian  grid.  High-order  finite  differences  are  utilized  to  obtain 
physically  reasonable  results  on  modestly  sized  grids.  The  method  is  summarized  and 
numerical  results  for  all-electron  atomic  and  molecular  structure  are  presented.  The 
strengths  and  weaknesses  of  the  method  are  discussed  with  suggested  directions  for 
future  developments,  including  a  new  high-order  conservative  differencing  scheme  for 
accurate  composite  grid  computations  which  preserves  the  linear  scaling  property  of  the 
multigrid  method.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  477-486,  1997 


Introduction 


umerical  grid-based  approaches  offer  an  al¬ 
ternative  to  traditional  basis-set  methods  for 
electronic  structure.  Several  groups  have  recently 
begun  to  develop  real-space  grid  techniques, 
including  Becke  [1],  Bernholc  et  al.  [2-4] 
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Contract  grant  number:  CHE-9632309. 


Chelikowsky  et  al.  [5],  Gygi  and  Galli  [6],  Kaxiras 
et  al.  [7],  and  Beck  et  al.  [8-11].  In  addition  to 
these  methods,  there  have  been  other  develop¬ 
ments  which  point  to  the  possibility  of  linear  scal¬ 
ing  electronic  structure.  The  various  linear  scaling 
ideas  (besides  multigrid)  are  not  discussed  here 
but  the  reader  is  referred  to  the  growing  literature 
[12-23]. 

The  distinct  advantages  of  the  grid  methods  are 
(1)  the  convergence  of  the  "basis  set  free"  method 
is  controlled  only  by  the  grid  spacing  and  the 
order  of  the  difference  equations  (or  accuracy  of 
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the  finite  element  approximation),  (2)  the  methods 
are  flexible  in  that  solution  effort  and  accuracy  can 
be  focused  in  specific  regions  of  space  (all  iteration 
steps  are  local),  (3)  finite  or  periodic  systems  are 
treated  with  equal  difficulty;  there  is  no  need  for 
supercells  as  with  plane  waves,  and  (4)  linear 
scaling  is  possible  with  multigrid  acceleration  or 
related  techniques.  The  principal  disadvantage  is 
that  large  grids  may  be  required  to  obtain  the 
desired  accuracy  so  the  methods  can  be  memory¬ 
intensive.  However,  recent  pseudopotential  calcu¬ 
lations  [3-5]  have  shown  that  storage  require¬ 
ments  are  comparable  to  the  corresponding 
plane-wave  methods.  In  fact,  it  is  somewhat  sur¬ 
prising  that  results  of  decent  accuracy  can  be  ob¬ 
tained  at  all  when  the  grids  (if  Cartesian)  do  not 
respect  any  physical  atomic  or  molecular  symme¬ 
try.  It  will  be  shown  below  that  reasonable  accu¬ 
racy  can  indeed  be  obtained  in  all-electron  calcula¬ 
tions  even  on  uniform  grids  where  the  core  states 
are  relatively  poorly  represented.  Grid  refinement 
techniques  will  allow  improvement  in  those  re¬ 
gions  without  a  substantial  increase  in  computa¬ 
tional  cost  while  still  maintaining  the  favorable 
linear  scaling  property  of  the  multigrid. 

In  previous  work,  we  first  developed  a  multi¬ 
grid  method  for  solving  the  three-dimensional 
Poisson  equation  [10].  The  charges  were  repre¬ 
sented  directly  on  a  Cartesian  grid.  The  technique 
was  tested  on  several  finite  and  periodic  problems. 
Using  high-order  finite  difference  representations, 
it  yielded  accurate  results  for  analytic  atomic  mod¬ 
els  and  for  periodic  lattice  problems  which  tested 
the  method's  ability  to  handle  long-ranged  interac¬ 
tions.  Systems  as  large  as  a  4096  atom  ionic  crystal 
lattice  were  examined,  and  the  known  linear  scal¬ 
ing  of  the  multigrid  method  was  observed  [11].  It 
is  interesting  to  note  that  for  the  infinite  lattice 
problem  the  (local)  iterations  in  real  space  are 
performed  within  the  calculation  box  with  neigh¬ 
boring  wraparound  interactions  only  as  far  as  the 
finite  difference  Laplacian  extends  in  space.  The 
periodicity  is  enforced  simply  by  making  the  po¬ 
tential  equal  on  the  opposite  boundaries.  The  re¬ 
sulting  solution  gives  the  same  physical  result  as 
the  higher-scaling  Ewald  method. 

The  Poisson  solver  was  then  used  in  the  multi¬ 
grid  solution  of  the  Kohn-Sham  equation  [8,  9]. 
The  entire  problem  was  represented  discretely  on 
the  grid,  including  the  nuclear  charge  density.  In 
this  way,  the  full  electrostatic  problem  for  all  elec¬ 
trons  and  nuclei  is  solved  in  a  single  linear  scaling 


step.  The  time  spent  in  solving  the  Coulomb  prob¬ 
lem  composes  only  a  small  portion  of  the  overall 
computational  cost.  The  evolving  self-consistent 
effective  potential  was  generated  by  a  combination 
of  the  electrostatic  potential  and  the  VWN  ex¬ 
change  correlation  term  [24].  The  accuracy  of  the 
method  was  tested  in  several  model  atomic  and 
molecular  systems.  While  accurate  results  were 
obtained  for  small  Z  atoms,  the  core  states  of 
atoms  such  as  Ne  are  not  accurately  represented 
on  a  uniform  grid  large  enough  to  handle  the  2  s 
and  2  p  orbitals. 

The  purpose  of  the  present  article  is  to  present 
further  numerical  tests  of  the  real-space  multigrid 
methods.  I  first  give  a  brief  review  of  our  multi¬ 
grid  method  and  then  summarize  the  equations 
which  are  solved  numerically,  namely,  the  Pois¬ 
son,  Kohn-Sham,  and  Possion-Boltzmann  equa¬ 
tions.  Then  results  are  presented  for  atomic  struc¬ 
ture  and  it  is  shown  that  the  full  approximation 
scheme  (FAS)  multigrid  method  of  Brandt  et  al. 
[25]  works  well  for  small  Z  atoms  on  uniform 
grids,  but  fails  for  larger  Z  due  to  poor  representa¬ 
tion  of  the  Is  core  and  the  2s  state.  Nonetheless, 
converged  numerical  results  can  be  obtained  by  a 
simple  nested  iteration  procedure.  Results  are  also 
presented  for  the  CO  molecule  which  provides  a 
good  test  since  the  dipole  moment  is  a  difficult 
quantity  to  obtain  accurately.  New  methods  for 
solving  the  nonlinear  Poisson-Boltzmann  equation 
in  ab  initio  colloid  simulations  (colloids  treated 
explicitly,  ions  as  a  field)  and  for  carrying  out 
high-order  conservative  electrostatic  computations 
are  also  described. 

Our  work  is  related  to  recent  work  of  the  groups 
of  Chelikowsky  et  al.  [5],  Bernholc  et  al.  [4],  and 
Gygi  and  Galli  [6]  However,  each  of  these  methods 
has  employed  pseudopotentials  for  the  core  states, 
while  in  our  work  we  have  treated  the  nucleus  and 
core  electrons  also  explicitly  on  the  numerical  grid. 
Part  of  the  advantage  of  the  real-space  grids  is  that 
varying  resolution  can  be  employed  in  different 
parts  of  space  without  huge  computational  over¬ 
heads,  and  we  seek  to  exploit  this  feature  in  the 
all-electron  calculations.  Gygi  and  Galli  [6]  utilized 
curvilinear  coordinate  calculations  for  variable  res¬ 
olution,  but  care  must  be  taken  to  ensure  that  the 
correct  order  of  approximation  is  obtained  during 
the  multigrid  coarse  grid  corrections  or  else  the 
accuracy  of  the  solution  can  be  sacrificed  [26].  Our 
high-order  conservative  equations  method  main¬ 
tains  the  correct  order  over  the  whole  domain  [27]. 
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Multigrid  Description 

The  multigrid  technique  was  developed  in  the 
1970s  to  overcome  the  critical  slowing  down  (CSD) 
problem  which  occurs  in  real-space  relaxation 
methods  for  solving  partial  differential  equations 
[28].  In  short,  multigrid  combines  iterations  on  the 
finest  scale  with  correction  cycles  which  pass  to  a 
sequence  of  coarser  scales  to  smooth  modes  of  the 
error  on  a  large  range  of  length  scales.  In  this  way, 
the  solution  is  obtained  to  within  the  errors  caused 
by  placing  the  problem  on  the  grid  in  roughly 
10-20  fine-scale  iterations.  The  methods  were  first 
developed  to  solve  linear  elliptic  problems  like  the 
Poisson  equation.  Subsequently,  more  general 
multigrid  algorithms  were  developed  to  handle 
nonlinear  problems  including  eigenvalue  problems 
[25].  Here,  I  describe  our  multigrid  method;  an¬ 
other  detailed  discussion  was  given  in  a  recent 
article  [8]. 

The  technique  employed  here  is  the  FAS  nonlin¬ 
ear  multigrid  method.  The  method  is  equivalent  to 
the  standard  full  multigrid  method  for  linear 
problems,  but  can  treat  nonlinear  problems  such 
as  the  Kohn-Sham  equations  and  the  Poisson- 
Boltzmann  equation  as  well.  Consider  the  Poisson 
equation  represented  in  real-space  matrix  notation: 

LU^f,  (1) 

where  L  is  the  finite  difference  Laplacian,  U  is  the 
potential  {u  denotes  the  current  approximation); 
and  /  is  -47rp  (where  p  is  the  charge  density). 
This  matrix  equation  can  be  solved  by  one  of  the 
standard  iterative  relaxation  techniques  such  as 
the  Jacobi  method  or  successive  overrelaxation 
(SOR,  used  here  for  the  multigrid  smoothing  steps 
on  each  scale).  However,  for  a  given  grid  spacing, 
the  solution  stalls  after  initial  rapid  reduction  of 
the  errors. 

This  is  the  point  at  which  the  multigrid  coarse 
grid  correction  cycle  is  performed  to  decimate  er¬ 
rors  at  longer  wavelengths.  The  problem  is  passed 
to  the  next  coarser  grid  (H  =  2h,  where  H  and  h 
are  the  coarse  and  fine  grid  spacings,  respectively), 
and  a  modified  equation  is  solved  (consider  a 
two-level  problem  here  where  level  one  is  the 
coarse  scale,  and  level  two,  the  fine): 

+  tT  (2) 


The  superscript  labels  indicate  the  coarser  level;  I2 
is  an  operator  which  locally  averages  (or  restricts) 
the  charge  density;  and  is  the  defect  correction 
and  is  given  by 

7-1  =  (3) 

This  function  causes  the  coarse  grid  problem  to 
"optimally  mimic"  the  fine  grid  problem;  put  an¬ 
other  way,  if  the  exact  solution  were  passed  from 
the  fine  grid,  no  correction  would  occur. 

Iterations  are  performed  on  Eq.  (2),  and  then  the 
fine  grid  solution  is  corrected  via 

-h  (4) 

Here,  is  the  interpolation  operator  which  takes 
the  correction  back  to  the  fine  scale,  level  2.  The 
FAS  approach  can  be  used  for  nonlinear  problems 
since  the  iterations  are  performed  on  the  function 
itself  on  all  scales,  and  thus  no  linearity  of  the 
equations  is  assumed.  A  clear  and  detailed  de¬ 
scription  of  the  FAS  method  is  given  in  the  eigen¬ 
value  paper  of  Brandt  et  al.  [25]. 

We  used  a  high-order  finite  difference  represen¬ 
tation  for  the  Laplacian  [8,  29].  Along  one  of  the 
three  orthogonal  axes,  the  eighth-order  Laplacian 
is  given  by 

,  1 

V^u(i)  =  5Q4q;^2  [  -  4)  +  128uii  -  3) 

-1008M(i  -  2)  +  8064M(f  -  1) 

- 14350  M(i)  +  8064(f  +  1) 

-1008M(f  +  2)  +  128M(i  +  3) 

-9u(i  +  4d]  +  O(h^).  (5) 

The  three-dimensional  version  is  just  the  sum  of 
the  three  orthogonal  components.  Hence,  there  are 
3x9  ~  2  =  25  terms  in  the  Laplacian,  as  opposed 
to  7  for  the  second-order  case.  There  is  thus  a 
three-  to  fourfold  increase  in  the  time  of  each 
iteration;  however,  a  significant  gain  in  accuracy 
results  which  allows  a  much  larger  grid  spacing  /z, 
and  this  benefit  far  outweighs  the  increase  in  time 
of  each  iteration.  The  present  calculations  on  a 
uniform  grid  would  not  be  feasible  with  just  the 
second-order  form.  The  charge  density  is  repre¬ 
sented  on  the  grid,  with  the  nuclei  occupying  a 
single  grid  point  with  a  charge  density  of  Z/h^. 
For  the  Poisson-Boltzmann  equation,  the  charge 
density  is  replaced  by  the  nonlinear  exponential 
terms  (below).  The  orbitals  in  the  Kohn-Sham 
equations  are  represented  similarly  as  a  3d  array 
on  the  grid. 
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In  solving  the  Kohn-Sham  self-consistent  eigen¬ 
value  equations,  we  closely  followed  the  FAS  pre¬ 
scription  given  by  Brandt  et  al  [25]  with  the  addi¬ 
tion  of  the  self-consistent  loop  as  the  effective 
potential  changes  during  the  solution  process.  The 
same  FAS  equations  apply,  except  the  Laplacian  L 
is  replaced  by  the  Hamiltonian  in  coordinate  space 
and  eigenvalue  and  orthogonalization  steps  must 
be  added.  In  the  Brandt  method,  they  proposed 
that  these  steps  need  only  be  performed  on  the 
coarse  scales  since  they  are  global  operations.  This 
dramatically  reduces  the  computational  cost  since 
a  Gram-Schmidt  process  scales  as  where 

q  is  the  number  of  orbitals  and  total 

number  of  grid  points.  Therefore,  this  step  scales 
as  the  third  power  of  the  number  of  electrons.  By 
carrying  out  the  orthogonalization  on  the  next 
coarser  scale,  the  overall  time  is  reduced  by  a 
factor  of  8.  Also,  it  has  recently  been  shown  that 
accurate  results  can  be  obtained  with  localized 
orbitals  [16],  so  the  orthogolization  can,  in  princi¬ 
ple,  be  reduced  effectively  to  order  N. 

We  compute  the  eigenvalues  on  the  next  coarser 
scale  to  the  finest  scale  and  perform  the  orthogo¬ 
nalization  there  also.  The  eigenvalues  from  the 
coarse  scale  are  then  used  in  the  propagation  of 
the  orbitals  on  the  fine  scale.  This  method  requires 
that  an  accurate  representation  of  the  orbitals  is 
possible  on  the  coarser  scale  to  the  point  of  yield¬ 
ing  eigenvalues  which  correctly  mimic  those  on 
the  finer  scale.  This  condition  is  not  satisfied  for 
the  higher  Z  atoms,  especially  for  the  2  s  state 
which  has  a  cusp  and  an  oscillation  not  too  distant 
from  the  nucleus.  The  method  works  well  for  mul¬ 
tiple  orbitals  of  the  hydrogen  atom  and  the  ground 
state  of  the  helium  atom  on  a  uniform  grid,  but 
not  for  the  C,  O,  and  Ne  atoms  examined  here. 
However,  a  simple  nested  procedure  where  the 
solution  just  passes  successively  from  coarse  to 
fine  scales  (without  coarse  grid  corrections)  con¬ 
verges  and  we  have  used  this  method  to  obtain  the 
numerical  results  for  those  cases.  It  is  clear  that  the 
grid  refinements  will  allow  a  faithful  representa¬ 
tion  in  the  more  strongly  varying  regions.  These 
difficulties  are  not  encountered  in  the  electrostatics 
computations.  At  the  end  of  the  self-consistency 
steps,  we  perform  one  final  Gram-Schmidt  pro¬ 
cess  on  the  fine  scale  followed  by  a  Ritz  projection 
which  generates  the  optimal  occupied  subspace 
(i.e.,  all  errors  are  orthogonal  to  the  occupied  sub¬ 
space). 


Theory 

For  the  quantum  chemical  calculations,  the 
Kohn-Sham  equations  of  the  density  functional 
theory  [30]  were  solved  using  the  VWN  [24] 
parametrization  of  the  exchange  correlation  poten¬ 
tial.  All  calculations  were  carried  out  at  the  spin- 
restricted  level.  The  Kohn-Sham  equations  are  as 
follows: 

[ lAi(r)  =  e,i/^,(r),  (6) 

with 

=  z;,„„(r)  +  f  dr'  +  p(r)).  (7) 

The  electron  density  is  obtained  from  the  orbitals: 

N/2 

p(r)  =  2  ^  (8) 

i=i 

The  evolving  electrostatic  potential  is  obtained  by 
multigrid  solution  of  the  Poisson  equation: 

V^cpir)  =  -477p^o^(r).  (9) 

As  mentioned  above,  the  nucleus  is  represented 

discretely  on  the  grid  and  the  Poisson  equation  is 
thus  solved  simultaneously  for  the  potential  due 
to  the  nuclei  and  electrons  in  a  single  linear  scaling 
step.  This  yields  the  first  two  terms  on  the  rhs  of 
Eq.  (7).  Note  that  a  self-energy  for  the  nuclear 
distribution  must  be  subtracted  in  computing  total 
energies.  This  is  a  one-time  calculation  on  a  finite 
grid  for  a  given  order  of  the  Laplacian,  since  the 
self-energy  scales  as  Z^/h. 

In  related  work,  we  have  begun  to  examine 
efficient  numerical  techniques  for  simulating  col¬ 
loid  systems  at  the  ab  initio  level  where  the  point 
ions  of  the  solution  are  represented  as  a  field  and 
the  charged  colloids  are  discrete  charges  on  the 
lattice  [31].  These  methods  stem  from  the  lattice 
field  theory  derived  by  Coalson  and  Duncan  [32, 
33].  This  is  an  exact  statistical  mechanical  theory 
of  the  ion  gas  in  the  field  of  the  colloids,  and 
the  Poisson-Boltzmann  equation  emerges  as  the 
saddle  point  (mean  field)  in  their  functional  inte¬ 
gral  representation.  Corrections  to  the  Poisson- 
Boltzmann  equation  can  be  made  but  are  very 
costly  computationally.  The  Poisson-Boltzmann 


480 


VOL.  65,  NO.  5 


REAL-SPACE  SOLUTION  OF  ELECTROSTATIC  PROBLEMS 


equation  is  known  to  be  quite  accurate  for  mono¬ 
valent  ions  at  low  to  moderate  concentrations  in 
solution.  Typically,  with  colloid  particles  of  charge, 
say  600 e,  something  on  the  order  of  10^  ions  would 
be  required,  which  is  far  beyond  the  capabilities  of 
current  simulation  methods  for  charged  systems. 
Thus,  the  Poisson-Boltzmann  equation  makes  a 
bridge  from  a  continuum  description  of  the  ions  to 
a  discrete  representation  of  the  colloids: 

V  •  [e(r)V4>ir))  =  -47r[  p,(r)  + 

(10) 

In  this  nonlinear  equation,  e(r)  is  the  (perhaps) 
spatially  varying  dielectric  constant,  p/r)  is  the 
colloid  source  charge,  is  the  equilibrium  charge 
density  at  infinity  for  the  positive  ions  (similarly 
for  the  negative  ions),  and  v(r)  is  an  excluded 
volume  potential  preventing  the  ions  from  pene¬ 
trating  into  the  colloid.  In  our  preliminary  calcula¬ 
tions,  we  have  held  efiO  constant  in  space.  The 
Helmholtz  free  energy  of  the  ion  gas  (potential  of 
mean  force  for  the  colloids)  can  be  obtained  from 
the  potential  (^(r)  [32].  The  FAS  multigrid  tech¬ 
nique  was  used  to  solve  this  equation  for  a  pair  of 
colloid  particles  over  a  range  of  separations  to 
generate  the  interaction  potential  in  aqueous  solu¬ 
tion.  The  same  FAS  method  as  used  for  solving  the 
Poisson  equation  applies  as  long  as  the  charge 
density  is  replaced  by  the  full  nonlinear  expression 
on  the  rhs  of  Eq.  (lol 


Numerical  Results 

The  hydrogen  atom  presents  a  prototype  prob¬ 
lem  with  a  fixed  potential  to  examine  the  perfor¬ 
mance  of  the  eigenvalue  solver  by  itself.  The  elec¬ 
trostatic  potential  was  generated  numerically  on 
the  grid  using  the  Poisson  solver  with  the  nuclear 
charge  at  the  center  of  the  calculation  box.  An 
eighth-order  finite  difference  representation  was 
used  for  the  Laplacian,  both  for  the  Poisson  and 
Kohn-Sham  equations.  All  the  computations  dis¬ 
cussed  here  were  carried  out  on  a  33^  grid  except 
the  Poisson-Boltzmann  calculations  (below)  which 
were  on  a  65^  grid.  The  first  five  eigenfunctions 
and  eigenvalues  were  computed,  corresponding  to 
the  Is,  2s,  and  2p  orbitals. 

The  numerical  data  are  presented  in  Table  I. 
The  eigenvalues  are  most  accurate  for  the  2  p  states, 
which  do  not  have  a  density  at  the  nucleus.  The  Is 


TABLE  I _ 

Multigrid  data  for  hydrogen  atom. 


H  atom  multigrid  data 

Exact 

MG 

CG 

A(1s) 

-0.50000 

-0.51873 

-0.51874 

A(2s) 

-0.12500 

-0.12727 

-0.12889 

A(2p.1) 

-0.12500 

-0.12477 

-0.12472 

A(2p.2) 

-0.12500 

-0.12477 

-0.12472 

A(2p.3) 

-0.12500 

-0.12477 

-0.12473 

<r>(1s) 

1.5 

1.348 

<r>(2s) 

6.0 

5.859 

<r>(2p.1) 

5.0 

5.006 

<r>(2p.2) 

5.0 

5.006 

<r>(2p.3) 

5.0 

5.013 

The  fine  grid  spacing  was  h  =  ^.0  au.  All  data  are  given  in 
au. 


and  2  s  eigenvalues  are  in  error  by  4  and  2%, 
respectively,  while  the  2p  eigenvalues  are  only 
0.02%  in  error.  The  three  2p  orbitals  are  degener¬ 
ate  to  5  decimal  places,  which  is  some  indication 
of  the  accuracy  of  the  method.  The  difference  in 
accuracy  between  Is /2s  and  2p  states  reflects  the 
impact  of  the  distributed  nucleus  and  the  grid 
representation  on  the  orbitals  with  a  cusp.  Note 
that  the  coarse  grid  (CG)  eigenvalues  are  very 
close  to  the  final  fine  grid  results,  which  shows 
that  a  relatively  good  representation  is  possible 
there.  The  first  orbital  moments  are  also  given  and 
these  also  show  that  the  Is  and  2s  states  have  the 
largest  errors.  Cuts  through  orbitals  are  shown  in 
Figure  1;  the  errors  in  the  Is  and  2s  states  concen¬ 
trate  near  the  nucleus.  The  FAS  multigrid  data 
were  obtained  with  20  fine-scale  iterations.  While 
the  multigrid  cycles  converge  to  results  very  close 
to  the  fully  converged  grid  results,  the  least  con¬ 
verged  orbital  (as  monitored  by  the  residual)  is  the 
2  s  state,  which,  again,  provides  some  indication  of 
the  difficulty  this  orbital  causes  during  the  multi¬ 
grid  corrections. 

The  Kohn-Sham  DFT  self-consistency  step  was 
then  introduced  into  the  multigrid  solver.  Here,  I 
present  numerical  results  for  the  He  and  Ne  atoms, 
since  our  previous  results  contained  a  minor  error 
in  the  VWN  potential.  This  only  slightly  changed 
the  quantitative  results  for  total  energies  but  did 
alter  the  individual  eigenvalues.  The  multigrid 
solver  located  the  ground  state  of  the  He  atom 
with  15  fine-scale  iterations;  the  computed  energy 
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r[au] 

FIGURE  1 .  One-dimensional  cuts  through  the  five  hydrogen  orbitals.  The  analytic  curves  are  also  presented  for  the  1s 
and  2s  states  (dotted  curves  with  larger  magnitude  at  the  nucleus).  One  of  the  p  orbitals  is  nearly  a  node  along  this 
direction. 


was  -2.832  au  vs.  the  exact  result  of  -2.835  au.* 
The  first  moment  of  the  ground  state  orbital  ((r)) 
was  0.926  au  as  compared  to  the  exact  Hartree- 
Fock  value  of  0.927  au  [34]. 

The  all-electron  Ne  atom  provides  a  more  se¬ 
vere  challenge,  and  the  multigrid  solver  did  not 
converge  on  the  uniform  grid  due  to  the  difficul¬ 
ties  discussed  above.  The  coarse  grid  eigenvalues 
were  substantially  different  from  the  converged 
fine  grid  results  (especially  the  2  s  orbital),  so  the 
coarse  grid  corrections  did  not  further  reduce  the 
errors.  In  this  case,  a  simple  nested  procedure  was 
used,  passing  successively  from  coarse  to  fine 
scales.  In  previous  multigrid  work  [8],  we  com¬ 
puted  eigenvalues  and  orthogonalized  on  all  scales, 
and  this  technique  converged  (at  more  computa¬ 
tional  expense).  The  computed  total  energy  of  the 
Ne  atom  was  —128.417  au,  which  compares  with 
the  exact  numerical  result  of  -128.233  au.  The 

*We  obtained  numerically  exact  VWN  results  from  Dr. 
Svetlana  Kotochigova  in  the  atomic  physics  research  group 
directed  by  Dr.  Charles  Clark  at  NIST. 


eigenvalues  were  Is  (-29.253  vs.  -30.306  exact), 
2s  (-1.690  vs.  —1.323  exact),  and  2p  (-0.438  vs. 
—0.498  exact).  The  orbital  moments  were  very  close 
to  our  previous  results.  In  agreement  with  the 
findings  for  the  hydrogen  atom,  the  2  p  orbitals  are 
better  represented  on  the  grid  than  are  the  Is  and 
2  s  states. 

An  important  question  for  molecular  systems  is 
to  what  extent  errors  near  the  nucleus  might  affect 
electron  densities  for  valence  electrons  and,  thus, 
chemical  binding.  The  CO  molecule  was  chosen 
since  both  the  atoms  have  an  even  number  of 
electrons,  and  the  molecule  has  a  relatively  simple 
"closed-shell"  electronic  structure  (as  compared 
say  to  C2).  The  results  were  obtained  by  nested 
iterations  as  discussed  above  (Table  II).  The  sepa¬ 
rated  atom  energies  obtained  in  the  spin-restricted 
approximation  were  adjusted  according  to  the  re¬ 
cent  study  of  Baerends  et  al.  [35]  to  obtain  the 
correct  spin-polarized  separated  atom  limit.  Spin 
restriction  should  have  only  a  very  minor  effect  on 
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TABLE  II _ 

Data  for  the  CO  molecule  calculation. 


CO  molecular  data 

Exact 

Grid 

A(1s(0  core)) 

-18.745(Xa) 

-16.619 

A(1s(C  core)) 

-9.91 2(X  a) 

-9.376 

Ado-) 

-1.045(Xa) 

-1.194 

\(2a) 

-0.489(Xa) 

-0.562 

A(lTr  X  2) 

-0.413(Xa) 

-0.403 

A(3o-) 

-0.304(Xa) 

-0.319 

12.8eV(LDA) 

13.3  eV 

Eb 

1 1 .2  eV(expt) 

Eb 

7.9  eV(HF) 

^tot 

-112.79(HF) 

-109.54 

^tot 

-113.377(expt) 

Dipole 

0.24  D(Xa)C"0'^ 

0.266  DC"0  + 

Dipole 

0.112  D(expt)  C“0  + 

Dipole 

0.274  D(HF)  CO- 

The  fine  grid  spacing  was  h  =  0.3560  au,  which  implies  a  grid  cutoff  of  =  39  Ry.  The  internuclear  separation  was  2.136  au.  All 
energies  are  in  au  unless  otherwise  indicated.  The  individual  eigenenergies  are  first  given,  followed  by  molecular  binding  energies 
and  dipole  moments  in  Debyes.  HF  refers  to  the  Hartree-Fock  limit. 


the  CO  molecule.  In  the  molecular  calculations,  the 
boundary  potential  was  obtained  by  a  multipole 
expansion  (to  quadrupole  order)  of  the  charge 
density. 

The  eigenvalues  for  comparison  were  obtained 
from  the  Xa  calculations  of  te  Velde  and  Baerends 
[36].  Thus,  it  is  a  close  but  not  exact  comparison. 
Generally,  except  for  the  core  states,  the  agreement 
is  reasonable.  The  binding  energy  is  quite  close  to 
the  exact  LDA  result  [37]  and  is  an  improvement 
over  the  pseudopotential  result  (11.1  eV)  of 
Chelikowsky  et  al.  [5]  The  dipole  moment  is  very 
close  to  the  numerically  exact  X  a  result  [37],  which 
should  be  quite  close  to  the  full  LDA  number. 
Pseudopotential  calculations  [5]  gave  a  result  (0.10 
D  C“0^)  which  differs  appreciably  from  the  exact 
DFT-Xa  number.  The  highest-lying  a  orbital  is 
shown  in  a  contour  plot  in  Figure  2.  This  compares 
closely  to  the  corresponding  Hartree-Fock  orbital 
given  by  Huo  [38]. 

The  accuracy  for  the  binding  energy  and  dipole 
moment  are  surprising  in  view  of  the  crudity  of 
the  treatment  near  the  two  atomic  nuclei.  The 
effective  cutoff  in  the  grid  representation  is  only  39 
Ry,  which  is  far  smaller  than  would  be  required  to 
obtain  fully  converged  all-electron  plane-wave  re¬ 
sults.  At  any  rate,  the  results  are  encouraging  and 
show  that  viable  numerical  results  for  molecular 
binding  are  possible  with  the  grid  techniques.  The 
next  step  is  to  improve  the  resolution  in  the  core 


region  via  grid  refinement  techniques  (below)  to 
test  how  this  affects  the  overall  accuracy  and 
multigrid  efficiency  of  the  method. 

Next,  results  are  presented  for  the  computation 
of  interaction  free  energies  between  colloid  parti¬ 
cles  where  the  ion  gas  is  modeled  at  the  mean-field 
level.  Here,  a  second-order  Laplacian  was  em¬ 
ployed,  and  the  Poisson-Boltzmann  equation  was 
solved  with  the  FAS  multigrid  nonlinear  tech¬ 
nique.  Twenty  iterations  were  used  on  the  fine 
scale,  and  the  free  energy  was  converged  to  a 
small  fraction  of  kT  by  this  point.  The  interaction 
energy  curve  is  presented  in  Figure  3.  Consistent 
with  previous  studies,  the  pair  interaction  is  purely 
repulsive  in  the  solution.  Recent  experiments  by 
Larson  and  Grier  [39]  suggested  that  potential 
minima  may  exist  when  the  colloid  particles  are 
confined  between  plates  or  when  in  a  crystal  ge¬ 
ometry.  We  are  beginning  to  examine  the  many- 
body  colloid  interactions  to  probe  whether  nonad¬ 
ditive  effects  occur  for  trimers  and  higher  n-mers. 
In  preliminary  results,  we  see  no  indication 
of  nonadditivity  for  the  trimer  [40].  While  the 
Poisson-Boltzmann  equation  is  inherently  nonlin¬ 
ear  and  thus  many-body,  it  neglects  ion  correla¬ 
tions,  so  the  lack  of  nonadditivity  in  our  calcula¬ 
tions  may  reflect  this  fact. 

Finally,  I  discuss  a  new  high-order  conservative 
equation  scheme  which  should  allow  accurate 
computations  on  composite  grids  [27].  As  dis- 
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FIGURE  2.  Contour  plot  of  the  highest-lying  cr  orbital  of  the  CO  molecule.  The  carbon  atom  is  on  the  bottom. 


cussed  above,  these  refinements  will  be  crucial 
both  for  accuracy  and  multigrid  convergence  rea¬ 
sons,  especially  in  the  region  of  atomic  nuclei  in 
all-electron  calculations.  To  motivate  the  need  for 
the  conservative  equations,  imagine  a  simple  two- 
dimensional  compositive  grid  with  a  patch  in  the 
center  with  grid  spacing  one-half  the  rest  of  the 
domain.  When  the  multigrid  correction  cycle  is 
carried  out  on  the  coarser  scale,  the  defect  correc¬ 
tion  r  is  defined  only  over  the  region  covered  by 
the  refinement  patch.  It  is  easy  to  show  that  the 
sum  of  r  over  the  patch  is  not  zero.  Most  of  the 
terms  actually  do  cancel  in  the  interior,  but  some 
terms  are  left  near  the  boundaries.  By  looking  at 
Eq.  (2),  we  can  see  this  implies  that  the  total  source 
strength  has  been  changed.  Nonconservation  of 
source  strength  can  pollute  the  solution  over  the 
whole  domain,  especially  in  the  case  of  the  long- 
ranged  Coulomb  force.  Bai  and  Brandt  [26] 
developed  a  second-order  scheme  to  balance  the 
fluxes  on  the  boundary  and  applied  it  to  a  two- 
dimensional  problem.  We  extended  their  tech¬ 


nique  to  three  dimensions  (finite  and  periodic  sys¬ 
tems)  and  have  now  generalized  it  for  higher-order 
equations.  The  method  also  requires  high-order 
interpolation  near  the  patch  boundary  to  ensure 
correct  behavior  over  the  whole  domain.  The  re¬ 
sulting  r  boimdary  terms  resemble  higher-order 
flux  operators,  but  do  not  correspond  exactly.  The 
net  result  is  that  the  sum  of  r  is  zero  to  machine 
precision  and  high-order  solutions  are  possible 
over  the  whole  composite  domain.  Details  of  the 
method  will  be  presented  shortly  [27].  Efforts  are 
underway  to  incorporate  the  conservative  equa¬ 
tions  into  the  Poisson  and  Kohn-Sham  solvers  to 
improve  the  resolution  near  the  nuclei. 


Summary 

A  real-space  multigrid  method  has  been  devel¬ 
oped  for  solving  the  linear  Poisson  and  nonlinear 
Kohn-Sham  and  Poisson-Boltzmann  equations. 
The  advantages  of  the  grid  techniques  include 
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Lattice  spacings 

FIGURE  3.  Interaction  potential  of  mean  force  between  two  colloid  particles  as  a  function  of  lattice  spacing.  The 
computation  was  performed  on  a  65^  grid.  The  lattice  spacing  was  1600  au  and  the  charge  on  each  colloid  was  600e 
The  bulk  Ion  concentration  yielded  a  total  number  of  48,000  positive  Ions  and  49,200  negative  ions  in  the  calculation 
box.  The  dielectric  constant  was  taken  to  be  that  of  bulk  water  at  room  temperature.  The  mean  Debye  length  under 
these  conditions  is  roughly  8000  au,  or  five  lattice  spacings.  The  colloids  occupied  one  cubical  lattice  unit.  The 
Interaction  is  given  in  units  of  kT. 


adaptivity  and  linear  scaling  capabilities.  For  the 
Kohn-Sham  case,  tests  have  been  carried  out  on 
atoms  and  small  molecules.  The  results  show 
promise  that  the  grid  techniques  can  generate  rela¬ 
tively  accurate  all-electron  results,  even  though  the 
core  region  is  treated  crudely  if  the  grid  is  uni¬ 
form.  With  the  incorporation  of  composite  grid 
techniques,  greater  accuracy  can  be  located  near 
the  nuclei  with  only  a  small  increase  in  computa¬ 
tional  cost.  Future  work  will  focus  on  incorporat¬ 
ing  our  new  high-order  conservative  equations 
scheme  [27]  into  the  quantum  chemical  grid 
method.  Other  areas  of  interest  are  the  finite  tem¬ 
perature  DFT  theory  of  Alavi  et  al.  [41]  and  real- 
space  density  matrix  methods  [13,  18,  19].  In  re¬ 
lated  work,  the  Poisson-Boltzmann  solver  will  be 
used  to  study  colloid  interactions  in  large  systems 
and  carry  out  Monte  Carlo  simulations  of  colloids 
and  polyelectrolyte  chains  in  solution. 
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ABSTRACT:  A  parabolic  potential  supports  resonances.  We  examine  how  the 
resonance  energies  are  transformed  into  Floquet  quasi-energies  when  the  particle  is 
simultaneously  subjected  to  a  monochromatic  electric  field.  ©  1997  John  Wiley  &  Sons,  Inc. 
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Introduction 

The  parabolic  (or  inverted  harmonic)  potential 
has  been  studied  extensively  for  the  same 
reason,  which  gives  a  special  importance  to  the 
harmonic  potential.  The  lowest  energy  levels  in  a 
well  of  arbitrary  shape  can  often  be  calculated 
accurately  from  a  local  quadratic  approximation  of 
the  potential  around  the  equilibrium  position.  The 
diatomic  molecule  is  the  best  example  of  this  situ¬ 
ation  [1].  For  a  polyatomic  molecule,  the  normal 
mode  approximation  reduces  complicated  poten¬ 
tial  energy  surfaces  to  a  sum  of  harmonic  poten- 

Correspondence  to:  R.  Lefebvre. 


tials  [2].  When  a  chemical  reaction  is  considered, 
the  potential  energy  surface  in  the  region  of  the 
transition  state  is  usually  written  as  a  sum  of 
harmonic  potentials  plus  a  parabolic  term  ex¬ 
pressed  with  the  reaction  coordinate  [3].  In  a  one¬ 
dimensional  treatment  of  the  reaction  the  trans¬ 
missivity  through  the  parabolic  barrier  represents 
the  cumulative  reaction  probability,  which  is  the 
basic  quantity  to  calculate  the  microcanonical  and 
canonical  reaction  rate  constants  [4].  Bell  [5]  has 
made  an  extensive  use  of  this  approximation.  An 
interesting  application  to  heavy-atom  tunneling  as 
an  explanation  for  the  automerization  of  cyclobu¬ 
tadiene  is  given  by  Carpenter  [6].  Use  of  a  parabolic 
barrier  has  also  been  considered  as  a  useful  test  for 
the  elaboration  of  the  method  based  on  autocorre- 
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lation  functions  of  the  reactive  flux  operator  to 
calculate  thermal  rate  constants  [7].  This  approach 
avoids  the  preliminary  evaluation  of  state-to-state 
reaction  probabilities. 

Because  the  scattering  of  a  particle  by  a  parabolic 
barrier  is  one  of  the  few  exactly  soluble  problems 
of  nonrelativistic  quantum  mechanics  [8-10],  this 
is  the  starting  point  of  the  mapping  method  ad¬ 
vanced  by  Miller  and  Good  [11]  to  derive  a  semi- 
classical  scattering  method  valid  close  to  the  top  of 
an  arbitrary  barrier,  both  below  and  above  the 
barrier  maximum.  It  has  also  been  possible  to 
define  solutions  of  the  wave  equation  with  outgo¬ 
ing  boundary  conditions,  which  are  the  analog  of 
the  resonant  states  (that  is  corresponding  to  poles 
of  the  scattering  matrix)  [10,  12].  The  idea  of  asso¬ 
ciating  resonances  to  barriers  has  been  further  ex¬ 
tended  and  applied  to  the  identification  of  reaction 
thresholds  by  Friedman  and  Truhlar  [13, 14].  As  an 
example  it  has  been  shown  [14]  that  the  lifetimes 
of  the  transition  states  of  the  reactive  collision 
H  +  H2  H2  +  H  are  well  accounted  for  by  the 
resonances  of  smallest  width  of  the  effective 
parabolic  barriers  associated  with  the  potential 
maxima  along  the  vibrationnally  adiabatic  poten¬ 
tial  curves. 

New  perspectives  have  been  opened  recently  in 
a  quite  different  field,  which  could  make  the 
parabolic  potential  to  correspond  to  a  realistic  situ¬ 
ation.  It  has  been  shown  by  Sundaram  et  al,  [15] 
that  by  grading  the  composition  x  in  superlattices 
of  type  GaAs/A4Gai_^  it  is  possible  to  grow  a 
sample  presenting  a  so-called  parabolic  quantum 
well  (in  fact  a  harmonic  well)  between  two  barri¬ 
ers.  This  is  useful  to  produce  a  resonant  tunneling 
transistor  with  equally  spaced  peaks  in  the  charac¬ 
teristics  [16,  17].  This  grading  technique  can,  in 
principle,  be  used  to  produce  a  potential  of  arbi¬ 
trary  shape,  among  which  the  parabolic  barrier. 

In  this  study  we  consider  the  scattering  of  a 
particle  by  a  parabolic  barrier  in  the  presence  of  a 
monochromatic  field.  The  effect  of  a  field  on  a 
particle  trapped  in  a  double-well  potential  has 
been  very  actively  studied  in  recent  years.  The 
most  spectacular  result  is  that  under  certain  condi¬ 
tions  tunneling  through  the  barrier  separating  the 
wells  can  be  completely  suppressed  [18-20].  Our 
problem  is  somewhat  different  since  the  barrier 
separates  two  regions  where  free  (rather  than 
bound)  motion  is  possible.  We  are  thus  close  to  the 
situations  met  when  a  rearrangement  takes  place, 
for  instance  electron  or  atom  exchange.  We  will 
show  that  the  Floquet  quasi-energies  and  wave 


functions  of  this  situation  can  be  exactly  deter¬ 
mined.  These  resonances  should  play  a  role  in  the 
reaction  in  the  presence  of  the  field  since  methods 
exist  to  reconstruct  the  cumulative  reaction  proba¬ 
bility  from  Siegert  eigenvalues  and  eigenfunctions 
[4,  21]  even  when  the  resonances  do  not  corre¬ 
spond  to  the  intuitive  idea  of  metastable  states 
with  long  lifetimes. 

In  the  following  section  we  recall  that  the  reso¬ 
nance  energies  and  eigenfunctions  for  a  parabolic 
barrier  can  be  derived  in  a  very  simple  way  with 
the  help  of  the  complex  rotation  method  with  a 
particular  value  given  to  the  rotation  angle.  The 
third  section  is  devoted  to  the  derivation  of  the 
Floquet  quasi-energies  and  eigenfunctions  when 
the  particle  is  interacting  with  a  monochromatic 
field.  The  interaction  is  written  in  the  length  gauge. 
We  show  in  the  fourth  section  that  the  quasi-en¬ 
ergies  can  also  be  obtained  from  a  limiting  proce¬ 
dure  applied  to  the  eigenvalues  of  a  time-indepen¬ 
dent  Hamiltonian  [22].  This  procedure  is  valid  for 
an  arbitrary  oscillator  interacting  with  a  one-mode 
field  and  yields  analytical  results  in  the  present 
case.  Finally  the  fifth  section  describes  a  numerical 
method  applicable  also  to  an  arbitrary  oscillator. 
This  is  based  on  a  propagation  plus  matching 
technique  [23]  that  yields  directly  the  Fourier  com¬ 
ponents  of  the  Floquet  wave  function.  Some  exam¬ 
ples  of  multicomponent  Floquet  eigenfunctions  for 
the  parabolic  barrier  are  presented.  Complex  rota¬ 
tion  (or  scaling)  of  the  coordinate  is  required  to 
implement  this  method.  We  study  the  transition 
from  complete  scaling  to  incomplete  (or  exterior) 
scaling.  This  allows  for  an  investigation  of  the 
shape  of  the  components  of  the  wave  function  as 
they  would  result  from  a  treatment  without  scal¬ 
ing  since  it  is  possible  to  extend  arbitrarily  far  the 
point  of  junction  between  real  and  complex  inte¬ 
gration  paths. 


Resonances  of  a  Parabolic  Barrier 

For  a  Hamiltonian  written  in  reduced  units: 

^2 

H  =  -  (1) 

the  elements  of  the  scattering  matrix  have  poles 
given  by  the  formula 

E,=  -  i(o, (2v  +  l),  (2) 
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This  result  can  also  be  obtained  in  a  very  simple 
way  with  a  complex  rotation  of  the  coordinate  [4, 
12].  Because  this  technique  will  be  useful  when  a 
time-periodic  term  is  added  to  the  Hamiltonian, 
we  repeat  here  this  derivation.  After  substitution 
of  to  X  in  the  wave  equation,  one  looks  for 
solutions  vanishing  for  p  +  oo  of  the  equation: 


^2/61  ,2  ^2 
e  (OqP 


X(p)  =  Exip). 


(3) 


With  the  choice  6  =  7r/4  the  wave  equation  is 
transformed  into: 


dp^ 


•  2  2 

1(0^  p 


x(p)  =  Exip).  (4) 


After  multiplication  by  i  this  becomes: 


+  0^1  p^ 


X(p)  =  iEx(p)- 


(5) 


This  shows  that  ;^(  p)  is  an  eigenfunction  of  the 
harmonic  oscillator  Xv^P^  ^ 
the  corresponding  eigenenergy  (OqUv  -h  1).  Equa¬ 
tion  (2)  for  is  recovered.  In  Ref.  [12]  it  has  been 
shown  in  detail  how  to  recover  the  harmonic  oscil¬ 
lator  wave  functions  by  the  substitution  x 

into  the  parabolic  cylinder  functions 
which  are  the  solutions  of  the  unsealed  wave 
equation  of  the  parabolic  barrier. 


Floquet  Eigenfunctions  and 
Ouasi-energies  for  a  Particle 
Scattered  by  a  Parabolic  Barrier  and 
Interacting  with  an  External  Field 

The  Hamiltonian  is  now: 

H,  = - r  -  (O^X^  -h  \XCOs((i)t).  (6) 


where  Xv^  is  oscillator  wave  function  of 

quantum  number  v  and  x^it)  is  a  solution  of  the 
classical  equation  of  motion: 

-y(0  -h  2o)lx^(t)  -f-  Acos(a)f)  =  0.  (8) 

L(t)  is  the  Lagrangian: 
x^(t) 

- ^2^2(^)  _  Ax^(t)cos(o;t).  (9) 

In  the  spirit  of  the  method  giving  the  reso¬ 
nances  of  the  field-free  case,  we  scale  x  -  xj,t)  in 
the  factor,  which  gives  the  spatial  behavior  of  the 
wave  function.  This  variable  becomes  u(t)  =  r]X  — 
r]X^(t),  We  also  make  the  appropriate  sign  changes 
in  the  classical  equation  of  motion  and  the  La¬ 
grangian.  The  time-dependent  wave  function  is 
written: 


0  =  ^(w(f))exp(-z£0 


X  exp 


.  (10) 


After  introduction  of  this  function  in  the  time- 
dependent  Schrodinger  equation  and  some  alge¬ 
bra,  it  is  found  that  most  terms  compensate  and 
that  ^(w)  obeys  the  equation: 

E^(u)  ^  -  7]^  - (11) 


Choosing  y]  =  gives  the  time-indepen- 

dent  equation  of  a  harmonic  oscillator  already  met 
in  the  field-free  case  [Eq.  (4)].  This  shows  that  ^(u) 
is  in  fact  ^^((^  ”  and  that  E  is 

-io)Q(2v  +  1).  However  to  identify  a  Floquet 
quasi-energy,  it  is  necessary  to  obtain  a  solution  of 
the  time-dependent  Schrodinger  equation  of  the 
form: 


We  start  from  the  general  solution  of  the  time- 
dependent  Schrodinger  equation  given  by  Kerner 
[24]  for  a  harmonic  oscillator  interacting  with  the 
same  field  [the  Hamiltonian  differs  just  by  the  sign 
given  to  the  potential  in  Eq.  (6)]: 


0  =  -  x^(t))exp(-i{2v  -h  DcoqO 


X  exp 


x,(0 

2 


(x  -  x^it)) 


(7) 


^^(x,  0  =  exp[  -fEpt]T>^(x,  t)  (12) 

with  <I>^(x,  0  a  periodic  function  of  time.  This 
defines  the  Floquet  quasi-energy  Ep  and  the  Flo¬ 
quet  eigenfunction  <F^(x,  0.  In  order  to  achieve 
this  goal  we  choose  a  periodic  solution  of  the 
classical  equation  of  motion  for  x^(f)  and  we  sepa¬ 
rate  out  the  nonperiodic  time-dependent  factor  in 
the  wave  function  (Breuer  and  Holthaus  [25]).  The 
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periodic  solution  of  the  classical  equation  is: 

2Acos(  wf) 


x,it) 


0)^  +  4:(x)q 


(13) 


and  the  nonperiodic  factor  in  the  wave  function  is: 


The  eigenfunctions  of  the  y  oscillator  can  be 
used  to  perform  a  diabatic  expansion  of  an  eigen¬ 
function  of  this  Hamiltonian: 

0(x,  y)  =  'Lv^„{x)x„{y)-  (19) 

n 


exp[—i(jOQ(2v  +  l)t]exp 


fj  L(r)dr 
i - 1 


.  (14) 


The  coupled  equations  for  the  W„(x)  are  found 
to  be: 


Introduction  of  x^(t)  as  given  by  Eq.  (13)  into 
the  Lagrangian  allows  for  the  calculation  of  the 
integral  in  Eq.  (14).  The  Floquet  quasi-energies  are 
finally: 


-  -io)o(2v  +  1)  +  2  (15) 

2{(o  +  4(^0 ) 

We  note  the  similarity  of  this  result  with  that  of 
the  harmonic  oscillator  [22,  25].  The  shift  with 
respect  to  the  field-free  energy  (purely  imaginary 
in  the  present  case)  is  independent  of  the  quantum 
number  and  differs  by  a  change  in  the  sign  of  the 
contribution  of  the  potential. 


Another  Route  to  the  Floquet 
Quasi-Energies  of  the  Parabolic 
Barrier 

We  show  now  that  these  quasi-energies  can  be 
obtained  in  an  extremely  simple  way  already  used 
with  success  for  the  harmonic  oscillator  [22]. 

A  Floquet  solution  of  the  time-dependent 
Schrodinger  equation  can  be  written: 


dX^ 

+  Ax 


2  2 
-  coix^ 


WJx) 


W„_i(x)  + 


n  +  1 


0) 


W„,,(x) 


=  (E  -  na))W„(x). 


(20) 


For  very  large  values  of  n  the  couplings  for  the 
relevant  VV„(x)'s  for  a  given  A  depend  weakly  on 
n.  As  shown  by  Shirley  [26],  in  this  limit  the 
components  of  the  wave  function  of  the  time-inde¬ 
pendent  approach  merge  into  the  components  of 
the  Floquet  eigenfunction.  For  this  equivalence  to 
hold  use  the  correspondence  rule 

A  A\/f7 

-  =  lim  (21) 

^  A— >0,n-^oo  y  (O 

The  eigenfunctions  and  eigenenergies  of  H(x,  y) 
can  be  calculated  exactly.  The  limit  given  by  Eq. 
(21)  is  to  be  taken  afterward.  With  two  new  vari¬ 
ables  x'  and  y'  written: 


n=  +00 

+^(x, t)  =  exp[ -/Epf]  Y.  (16) 

n=  -CO 

The  LJ„(x)'s  obey  the  coupled  differential  equa¬ 
tion 


x'  =  X  cos(^)  “  y  siniO), 
y'  =  X  siniO)  +  y  cos(^). 


(22) 


the  Hamiltonian  becomes  separate  if  6  satisfies: 


dX^ 


+V(x) 


U„(x)  +  —[U„_,{x)  +  U„^,(x)] 
=  iE,-noj)U„ix).  (17) 


tan(20)  = 


0) 


+  (x)i 


We  consider  also  the  time-independent  Hamil¬ 
tonian: 


The  new  frequencies  are  given  by: 


(23) 


H(x,  y)  = 


dx^ 


2  2 

“WqX^  ■ 


4h-f 

^y"  1 2/ 


y^-l-Axy.  _f)[2  =  —  _|.  |^_j  _  _sin(20), 

(18)  (24) 


0) 
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( 


“  WoSin^(0) 


+ 


A 

2 


sin(2^). 


(25) 


the  shift  is: 


A' 

^  2(a)^  4(Wo) 


(33) 


The  eigenenergies  are  the  sums  of  the  resonance 
energies  of  the  new  parabolic  barrier  and  of  the 
eigenenergies  of  the  decoupled  harmonic  oscilla¬ 
tor: 


=  -ii2n,  +  DHi  +  (n2  +  (26) 


in  complete  agreement  with  Eq.  (15). 


Algorithm  for  Resonance  Wave 
Functions 


The  shift  induced  by  the  coupling  is: 

""  ■”^*(2^1  +  l)(ni  -  Wq) 

+  {n2  +  |)(fi2  ~  0)).  (27) 

As  A  ->  0,  we  have  a>o  and  ^  (o. 

However,  the  correspondence  rule  requires  n2  to 
go  to  infinity.  We  are  left  with: 

A„„„,  =  «2(D2  -  «).  (28) 

Since,  as  A  ^  0,  we  have  ^  0,  ^  can  be  given 

the  value: 


0  = 


+  (On 


(29) 


The  frequency  fi2  can  be  obtained  from: 


a. 


=  —  cl)q 0^  H-  ("^1  (^  ~ 


=(0 


1  + 


4A^ 


(o^io)^  -h  4a>Q) 
A  binomial  expansion  gives 

2A2 

^  a)(o)^  +  4a>o) 


(30) 


(31) 


The  Floquet  wave  function  given  in  Eq.  (10)  can 
be  written  in  principle  in  a  standard  way  [cf.  Eq. 
(16)]  showing  the  spatial  function  U^(x)  associated 
to  each  "photon"  state  (i.e.,  is  to  each  exponential 
However,  the  wave  fimction  is  a  product  of 
five  exponentials  of  periodic  functions  of  time.  The 
Fourier  expansion  of  the  Floquet  function  is  there¬ 
fore  a  product  of  five  Fourier  series  which  could  at 
least  formally  be  written  compactly  with  general¬ 
ized  Bessel  functions  [27].  Rather  than  trying  to 
analyze  this  rather  complicated  expression,  we  will 
show  that  the  graphical  display  of  these  Fourier 
components  can  be  easily  obtained  from  a  rather 
general  numerical  algorithm. 

We  have  developed  previously  [23]  an  efficient 
method  to  obtain  the  eigenenergy  and  the  eigen¬ 
function  of  a  resonance  in  a  multichannel  situa¬ 
tion.  This  search  for  a  Floquet  quasi-energy  and  its 
associated  wave  function  belongs  to  this  class  of 
problem.  We  recall  here  the  main  steps  in  this 
approach.  The  Fox-Goodwin  propagation  method 
[28]  is  used.  This  consists  first  in  the  definition  on 
a  grid  of  the  inward  and  outward  matrices: 

K  =  (34) 

Pm  =  U^i(Up-\  (35) 

where  the  U^/s  are  matrices  of  independent  vector 
solutions  of  the  coupled  equations  at  point  m  of 
the  grid. 

The  propagation  is  performed  with: 


The  shift  is: 


^2(112 


))  =  2n. 


(»(  (x)^  “h  4ci>q  ) 


(32) 


K-i=  [2l3(x)  -  a(x  +  h)Tl]~\(x  -  h),  (36) 
and 


After  introduction  of  the  correspondence  rule 
[Eq.  (21)],  with  identified  with  v  and  ^2  dropped 


=  [2p(x)  -a(x-  h)F:,r\(x  +  h),  (37) 
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where  a(x)  and  j8(x)  are  the  Numerov  matrices: 

a(x)  =  h  1  + —Wix)  ,  (38) 

P(x)  =  h\l-  —mx)\,  (39) 


with  W(x)  =  El  -  V(x),  V(x)  being  the  potential 
matrix.  The  matching  condition  at  point  n  is: 

detiPJ- (P„%,)"'|  =  0.  (40) 

The  initial  values  for  the  U's  can  be  taken  as 
zero  in  classically  forbidden  regions,  since  there  is 
spontaneous  generation  [29]  of  the  appropriate 
boundary  conditions  when  propagating  from  such 
a  region  toward  a  matching  point  in  the  classically 
allowed  region.  This  spontaneous  generation  holds 
also  when  complex  rotation  is  used,  since  this  has 
the  same  effect  on  the  wave  function  as  propaga¬ 
tion  into  a  classically  forbidden  region. 

The  entire  procedure  leads  to  an  estimate  of  the 
left-hand  side  of  Eq.  (40)  and  is  repeated  until  this 
condition  is  satisfied.  The  Newton-Raphson  proce¬ 
dure  can  be  applied  because  we  look  for  a  zero  of 
an  analytic  function  of  the  complex  energy.  The 
matching  relation  is  also  the  compatibility  condi¬ 
tion  for  a  set  of  homogeneous  equations  satisfied 
at  the  matching  point  by  the  components  of  the 
vector  solution.  It  is  possible  to  calculate  them, 
except  for  a  normalization  factor,  and  then  to 
propagate  them  away  from  the  matching  point  in 
both  directions  by  using  the  matrices  and 
The  same  prescription  is  used  in  Johnson's  renor¬ 
malized  Numerov  method  [30].  The  storage  of  all 
these  matrices  guarantees  [31]  that  no  overflow  in 
closed  channels  or  loss  of  independence  occurs 
when  reaching  a  classically  forbidden  region  (or 
equivalently  a  region  with  damping  of  the  compo¬ 
nents  when  complex  rotation  is  used). 


Examination  of  Some  Floquet 
Eigenfunctions  of  the  Parabolic 
Barrier 

The  emphasis  in  the  context  of  resonances  asso¬ 
ciated  to  barriers  has  been  so  far  on  the  associated 
complex  energies  [8-10].  Little  has  been  said  about 
the  behavior  of  the  resonance  wave  functions.  In 


Ref.  [12]  some  scattering  wave  functions  calculated 
at  an  energy  equal  to  the  top  barrier  energy  have 
been  shown.  This  energy  represents  the  real  part  of 
all  resonance  energies  [cf.  Eq.  (2)]  of  the  field-free 
case.  The  scattering  wave  functions  show  clearly  a 
localization  in  the  region  of  maximum  potential 
energy.  In  Ref.  [21]  the  resonance  energies  of  the 
Eckart  barrier  are  derived  and  a  particle-in-a-box 
technique  is  used  to  calculate  some  resonance 
functions  for  both  the  symmetric  and  nonsymmet- 
ric  cases.  Localization  within  the  potential  region 
is  again  demonstrated. 

We  will  display  now  some  resonance  complex 
wave  functions  of  both  the  field-free  case  and 
field-on  case  for  a  parabolic  barrier. 


FIELD-FREE  CASE 

We  consider  the  following  values  for  the  param¬ 
eters: 


(x)  =  1.2,  Wq  =  0.8,  A  =  0.5  (arbitrary  units) 

(41) 


The  "zero-point"  field-free  (z^  =  0)  resonance  is 
studied  first.  Two  options  are  examined 

Complete  Scaling  of  the  Coordinate 

Coordinate  x  is  changed  into  everywhere. 
A  grid  of  2000  points  with  step  0.01  is  used. 
Although  the  choice  6  =  7r/4  has  been  shown  to 
transform  a  resonance  wave  function  of  the  barrier 
into  a  bound-state  wave  function  of  the  harmonic 
well  (cf.  discussion  in  second  section),  examination 
of  the  factor  which  is  common  to  all 

resonance  wave  functions  shows  that  localization 
is  obtained  for  a  range  of  angles,  with  0  <  0  <  ir/l. 
We  give  in  Figure  1  the  field-free  resonance  wave 
function  for  the  following  values  of  the  rotation 
angle:  6  =  7r/4,  0.5,  0.3,  and  0.5  rad.  For  all  values 
of  6  except  the  smallest  one,  the  resonance  energy 
has  a  seven-figure  agreement  with  the  expected 
result  (-iO.S).  For  0  =  0.05  rad  the  resonance  en¬ 
ergy  is  found  to  be  —0.001658  —  iO.7948.  This  poor 
estimate  is  due  to  the  fact  that  vanishing  of  the 
wave  function  at  the  two  edges  of  the  grid  begins 
to  be  a  constraint,  as  shown  on  the  figure  (panel 
with  6  =  0.05).  A  wider  grid  would  restore  the 
accuracy. 
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FIGURE  1.  Field-free  resonance  wave  functions  of  a  barrier  with  complete  scaling  and  various  rotation  angles. 
Parameters  in  arbitrary  units  are:  barrier  half  frequency  o)q  =  0.8,  field  frequency  (w  =  1 .2,  coupling  constant  A  =  0.5. 
The  grid  extends  from  - 10  to  10  (arbitrary  units).  The  calculation  with  6  ~  0.05  rad  does  not  localize  strongly  enough 
the  wave  function,  and  a  poor  estimate  of  the  resonance  energy  Is  obtained  (see  text).  For  6-^14,  the  wave  function 
is  real.  For  all  other  values  of  the  angle  the  wave  function  is  complex.  The  normalization  in  all  cases  is  chosen  so  as  to 
give  unit  value  to  the  amplitude  of  the  real  part  of  the  wave  function  at  the  matching  point  (x  =  0).  The  panels  give  the 
real  and  the  imaginary  parts  of  the  wave  function,  the  imaginary  parts  being  given  by  the  thiner  curves. 


Exterior  Scaling  [32-34] 

Coordinate  x  is  changed  into  Xq  +  (x  —  Xo)c^^ 
for  X  >  Xq  and  into  -Xq  +  (x  +  Xq)c^^  for  x  < 
-Xq,  Xq  being  some  positive  abcissa.  The  real  part 
of  the  integration  range  extends  from  —Xq  to  +Xq. 
This  has  the  advantage  of  displaying  on  the  real 
segment  the  wave  function  as  it  would  look  in  a 
real  coordinate  treatment  (with  complex  boundary 
conditions  compatible  with  the  asymptotic  behav¬ 
ior).  With  this  procedure  the  asymptotic  conditions 
for  starting  the  left  to  right  and  the  right  to  left 
propagations  are  again  the  vanishing  of  the  com¬ 
ponents  of  the  resonance  wave  function.  We  take 
Xq  ^  5.  The  coordinate  axis  is  deformed  in  the 
following  way:  from  —10  to  —5  the  integration 
axis  makes  an  angle  tt  +  0.5  rad,  with  the  real  axis; 


between  —5  and  +5  it  is  along  the  real  axis;  from 
+  5  to  +10  the  integration  axis  makes  an  angle 
+  0.5  rad  with  the  real  axis. 

Figure  2  gives  the  wave  function  of  the  field-free 
resonance  wave  function  for  quantum  number 
u  =  0.  The  energy  was  found  to  be  -  z0.80000017. 
The  effect  of  complex  rotation  for  |x|  >  5  is  clearly 
visible. 

FIELD-ON  RESONANCE  WAVE  FUNCTION 

The  parameters  are  now  (Oq  =  0.5,  (o  =  1,  and 
A  =  1.  We  note  that  for  such  a  choice  the  shift  of 
the  harmonic  oscillator  has  a  singularity,  since  in 
Eq.  (15)  o)q  has  to  be  replaced  by  Icoq.  The  calcula¬ 
tion  is  now  a  multichannel  one  done  with  the 
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no  scaling  from  -5.  to  +5. 


FIGURE  2.  A  field-free  resonance  wave  function  of  a 
barrier  with  exterior  scaling.  Parameters  are  those  of 
Figure  1.  The  grid  extends  from  -10  to  10  (arbitrary 
units).  The  scaling  is  applied  to  the  two  segments  from 
- 1 0  to  - 5  and  from  -1-5  to  -MO  with  an  angle  0  =  0.5 
rad.  The  energy  provided  by  the  calculation  is  E  = 
-0.13(-6)  -  /0.800017  (expected  value  E  =  -0.8). 
Normalization  as  In  Figure  1,  the  real  part  of  the  wave 
function  being  unity  at  x  =  0. 


procedure  described  in  the  third  section.  The  strat¬ 
egy  is  as  follows:  the  coupled  equations  [Eq.  (17)] 
are  infinite  in  number,  and  truncation  is  obviously 
necessary.  We  call  the  channel  with  n  =  0  the 
reference  channel  With  no  field  present  the  reso¬ 
nance  energies  and  eigenfunctions  of  this  channel 
are  considered  as  describing  the  system.  We  note 
that  the  uncoupled  channel  with  arbitrary  n  also 
represent  the  field-free  system,  but  with  eigenener- 
gies  displayed  by  no).  This  is  a  distinctive  feature 
of  the  Floquet  theory.  When  the  field  is  present, 
the  energies  close  to  those  of  the  reference  channel 
are  being  determined.  The  additional  channels  are 
introduced  in  a  symmetrical  way,  one  above  and 
one  below  the  reference  channel  at  a  time.  Conver¬ 
gence  with  respect  to  the  number  of  channels  is 
then  studied,  since  the  critical  number  for  a  given 
accuracy  of  the  quasi-energy  can  vary  widely.  Table 


TABLE  I _ 

Convergence  toward  the  correct  resonance 
energy  with  the  number  N  of  channels  in  the 
multichannel  approach  to  Floquet  treatment  of  a 
parabolic  barrier.^ 


N  Resonance  energy 


1 

-0.90(-1l) 

-/0.50000000 

3 

0.23492517 

-/0.56062542 

5 

0.25010394 

-/0.49954770 

7 

0.24982569 

-/0.50052883 

9 

0.25003014 

-/0.500000811 

11 

0.24999985 

-/0.50000035 

13 

0.25000008 

-/0.50000002 

® Parameters  are  in  arbitrary  units:  barrier  half-frequency  o)q 
-  0.5,  frequency  of  the  field  w  =  1 ,  coupling  A  =  1 ,  rotation 
angle  6  =  0.5  rad.  The  results  with  complete  scaling  or 
exterior  scaling  for  |x|  >  5  differ  at  most  by  one  unit  in  the 
last  figure.  Expected  resonance  energy;  E  =  0.25  -  /O.5. 
A/  =  1  is  the  field-free  calculation. 


I  shows  how  the  exact  energy  is  approached  pro¬ 
gressively.  A  7-figure  accuracy  is  achieved  in  this 
example  with  13  channels.  Figure  3  gives  five  of 
the  components  of  the  13-channel  Floquet  eigen¬ 
function,  including  the  reference  channel  (panel 
n  =  0),  and  two  channels  above  (n  =  1, 2)  and  two 
below  (n  =  -1,-2).  Despite  the  number  of  chan¬ 
nels  needed  for  an  accurate  result,  the  amplitude 
of  the  wave  function  of  the  reference  channel  is 
still  dominating  the  Floquet  wave  function  (the 
components  are  not  drawn  on  the  same  scale). 

Figure  4  shows  the  Floquet  wave  functions  of 
the  same  model,  with  the  same  choice  of  functions, 
but  with  now  exterior  scaling  for  |x|  >  5.  The 
wave  functions  on  the  interval  —  5  <  x  <  +5 
are  calculated  for  real  x:  this  is  exactly  how 
they  would  appear  if  (complicated)  boundary 
conditions  valid  for  a  real  coordinate  treatment 
were  introduced  in  the  solution  of  the  coupled 
equations. 


Conclusions 

We  have  shown  that  the  methods  (both  analyti¬ 
cal  and  numerical)  which  can  be  used  to  obtain  the 
Floquet  quasi-energies  and  eigenfunctions  of  the 
harmonic  oscillator  require  little  change  to  give  the 
corresponding  information  for  the  parabolic  bar- 
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FIGURE  3.  Field-free  and  field-on  resonance  wave  functions  for  a  parabolic  barrier,  with  complete  scaling  (c.  sc.).  The 
parameters  are  given  in  Table  I.  Panel  A  =  0:  the  complex  resonance  wave  function  of  the  parabolic  barrier  In  the 
absence  of  the  field.  In  the  other  panels:  five  of  the  components  of  the  13-channel  Floquet  eigenfunction  which  leads  to 
an  accurate  determination  of  the  resonance  energy  (7-figure  agreement  with  the  analytical  result,  see  Table  I)  The 
reference  channel  has  n  =  0.  Channels  n  =  -hi  and  n  =  -h2  are  above  the  reference  channel,  n  =  - 1  and  n  =  -2  are 
below.  Every  wave  function  or  component  wave  function  Is  complex.  The  real  parts  of  the  wave  function  when  the  field 
Is  off  (A  =  0),  or  of  the  reference  channel  wave  function  (n  =  0)  when  the  field  Is  on,  are  unity  at  x  =  0.  The  thiner  curves 
represent  the  imaginary  parts  of  the  functions.  Note  that  the  components  are  not  drawn  all  on  the  same  scale. 


rier.  This  shows  that  the  Floquet  problem  of  the 
parabolic  barrier  can  be  added  to  the  very  few 
soluble  cases  of  Floquet  theory.  Since  the  Floquet 
eigenfunctions  make  up  a  complete  set  for  the 
expansion  of  an  arbitrary  solution  of  the  time-de- 
pendent  Schrodinger  equation,  they  could  be  used 
to  examine  how  the  transmission  across  a  barrier  is 
affected  by  the  presence  of  a  field.  It  is  legitimate, 
when  the  energy  approaches  that  of  the  potential 
maximum,  to  use  locally  a  parabolic  approxima¬ 
tion  for  the  potential.  This  is  under  study. 

It  is  also  interesting  to  note  that  in  the  high- 
frequency  regime  (o)  »  (Wq)  the  shift  in  Eq.  (15) 
goes  to  the  ponderomotive  energy  A^/2aj^.  If  a 


barrier  maximum  is  interpreted  as  a  reaction 
threshold  [13,  14],  this  means  that  the  threshold  is 
displaced  upward  by  the  ponderomotive  energy, 
in  a  way  similar  to  the  shift  occurring  for  the 
ionization  threshold  in  an  intense  field  [35]. 
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FIGURE  4.  The  model  and  selection  of  wave  functions  are  the  same  as  in  Table  I  and  Figure  3,  but  now  exterior 
scaling  (ext.  sc.)  is  applied  for  |x|  >  5  on  the  integration  grid.  Damping  of  the  functions  starts  outside  the  real  interval 
-5  <  X  <  +5.  Same  conventions  and  remarks  for  normalization  as  in  Figure  3. 
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ABSTRACT:  Formalism  of  density-functional  theory  (DFT)  is  based  on  the  calculus  of 
variation.  In  the  Hohenberg  and  Kohn  theorem,  a  variational  equation  minimizing 
electronic  energy  with  respect  to  an  electron  density  is  constructed.  The  calculus  of 
variation  allows  one  to  formulate  a  problem  which  is  reciprocal  to  an  original  one.  Also, 
we  may  consider  the  problem  of  finding  the  electron  density  determining  a  given  energy 
E  =  E[  p]  for  a  maximum  number  N  =  N[  p]  of  the  electrons  forming  the  system.  In  this 
work,  the  reciprocal  variational  problem  is  discussed.  Mathematical  considerations  are 
followed  by  a  presentation  of  an  application  of  the  reciprocal  problem  (maximum 
entropy  principle).  Other  possibilities  of  the  applications  are  sketched.  ©  1997  John  Wiley 
&  Sons,  Inc.  Int  J  Quant  Chem  65:  499-501,  1997 

Key  words:  density-functional  theory;  variational  principle;  reciprocal  problem; 
maximum  principle 


Introduction 

There  are  many  similarities  among  various  ap~ 
plications  of  density-functional  methods.  One 
of  the  main  properties  of  this  parallelism  is  an 
obligation  of  a  variational  principle.  The  most  fun¬ 
damental  application  of  the  variational  principle  is 
a  minimization  of  the  electronic  energy  which, 
according  to  the  Hohenberg-Kohn  theorem,  is  a 
imique  functional  E  =  EJ_  p]  of  an  electron  den¬ 
sity  for  a  given  external  potential  v  =  i?(r)  [1].  The 
ground-state  density-functional  theory  (DFT)  can 
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be  generalized  to  equilibrium  states  in  ensembles 
of  very  many  positive  and  negative  ions  at  temper¬ 
ature  6  and  chemical  potential  pu  [2-5].  FFaving  a 
variational  principle  in  hand,  one  can  construct  a 
variational  equation  dependent  on  the  external  po¬ 
tential  v(x). 

Generally,  three  minimum  principles  can  be  ap¬ 
plied  in  DFT: 

(i)  For  the  ground-state  energy, 

~  ^HK  '  P  dr) 

(ii)  For  the  equilibrium  Fielmholz  free  energy, 

H  -  E,  -  eS; 

(iii)  For  the  grand  canonical  potential, 

il  =  E^-  eS-  piN, 
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where  H  and  (1  may  be  derived  as  Legandre's 
transforms  of  the  energy  while  S  =  S[  p]  is  an 
entropy  and  N  =  N[  p]  determines  a  number  of 
the  particles  in  the  system.  In  the  variational  treat¬ 
ment,  one  minimizes  the  functionals  for  EJ_  p], 
H[  p],  and  El[  p]  via  a  variation  of  the  electronic 
density  function  p(r). 


tion  corresponding  to  the  functional 

N  =  N-  XE.  (4) 

It  results  from  the  above  that 

E  -  jnJ  =  -A(E  -  jlN), 

(5) 


Reciprocal  Principle  in 
Zero-temperature  Limit 

Assuming  differentiability  of  the  energy  func¬ 
tional  Ey[  p]  =  E[  p],  the  variational  principle 
states  that  the  density  p,  which  minimizes  energy 
functional,  is  that  one  which  satisfies  the  equation 

8{E[  p]  -  plN[  p]}  =  0  (1) 


or,  in  the  form  of  the  Euler-Lagrange  equation. 


dE[p] 

dp 


=  fi. 


(2) 


The  constancy  of  p.  is  a  result  of  variation  of  the  p 
function  under  an  isoperimetric  condition  N[  p]  = 
const. 

As  a  functional  £  =  £[  p]  is,  in  general,  a  many- 
to-one  relation,  the  calculus  of  variation  gives  no 
unique  inverse  of  the  derivative  (2)  [6].  But  the 
calculus  of  variation  allows  one  to  consider  a  re¬ 
ciprocal  problem  which  is  formulated  through  a 
change  of  the  roles  of  the  optimized  functional  and 
the  constraint  [7].  According  to  the  reciprocal  prin¬ 
ciple,  a  stationary  state  of  a  system  of  constant 
energy  is  that  one  which  is  constructed  by  a  maxi¬ 
mum  number  of  the  electrons. 

Let  a  given  energy  be  equal  to 

E  =  const  (3) 

and  p  be  a  corresponding  electron  density  due  to  a 
maximum  number  of  the  electrons.  We  can  replace 
the  density  p  by  a  density  p'  such  that  N[  p']  = 
N',  £'  <  £  and  then  deform  p'  so  that  it  goes  into 
p"  while  N'  and  £'  are  increased.  It  can  be  per¬ 
formed  so  that  the  energy  again  becomes  equal  to 
£.  But,  then,  there  is  N"  >  N',  i.e.,  N"  >  N,  which 
contradicts  the  definition  of  N.  In  the  general  case, 
we  speak  about  a  stationary  value  of  the  functional 
N[  p]  with  a  constrained  £[  p]  =  Eq.  The  problem 
may  be  reduced  to  a  solution  of  the  Euler's  equa- 


and  from  a  necessary  condition  of  maximization. 


and 


8(N  -  XE) 

8p 


=  0 


(6) 


(7) 


Consequently,  after  cancellation,  the  same  equa¬ 
tion  as  in  the  simple  problem  can  be  obtained: 


~X8(E  ~  pN) 


=  0 


and,  finally  [compare  with  Eq.  (2)], 


(8) 


(9) 


Also,  p  equals  the  chemical  potential:  p  =  p  = 
const. 

It  follows  from  the  above  that  the  calculus  of 
variation  allows  one  to  treat  the  problem  of  the 
stability  of  the  electronic  system  through  the 
equivalent  reciprocal  problem  of  maximization  of 
the  electron  number  for  a  given  energy  of  the 
analyzed  system. 


Reciprocal  Variational  Principle  at 
IXonzero-temperature  Limit 

Let  us  take  a  system  at  T  ^  0  and  consider  the 
changes  which  preserve  the  number  of  the  elec¬ 
trons  N  =  N[  p].  The  energy  £[  p]  of  the  electronic 
system  must  be  minimized  subject  to  the  following 
constraints: 

(i)  The  electron  density  integrates  to  the  con¬ 
stant  number  of  the  electrons,  to  which  the 
Lagrange's  multiplier  p  is  attached;  and 

(ii)  The  entropy  equals  the  observed  value,  S  = 
S[  p]  =  const,  with  the  Lagrange's  multi¬ 
plier  6. 
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The  variational  principle  states  that 

6{Ejp]  -  es[  p]  -  fiN[  p]}  =  0,  (10) 


or  in  the  form  of  the  Euler's  variational  equation. 


8E  85 

8p  8p 


—  /X  =  0, 


(11) 


According  to  the  reciprocal  principle,  the  equilib¬ 
rium  state  can  be  achieved  for  a  given  N,  E  =  const 
and  for  the  electron  density  that  gives  a  maximum 
entropy  S[  p].  In  this  case,  the  maximum  condition 
reads 


85  =  0  (12) 

for  the  functional 


5  =  5  -  aE  -  f^N. 
The  respective  Euler's  equation  is 
85  8(5  -aE-  pN) 

8p  8p 

and,  after  transformation, 

8E  13  1  85 
8p  a  a  8p 


(13) 


(14) 


(15) 


Comparison  of  the  above  expression  with  Eq,  (11) 
proves  that,  for  the  multipliers  1/a  =  ^  and  jS/a 
^  -  pi,  there  is  an  equivalency  of  the  simple  prob¬ 
lem  of  the  energy  minimization  and  the  reciprocal 
problem  of  the  maximization  of  the  entropy. 


Summary 

Calculus  of  variation  allows  one  to  consider  the 
reciprocal  problem  of  the  original  problem  mini¬ 
mizing  the  energy  of  the  electronic  system.  This,  as 
we  have  shown,  can  be  established  directly  by 
constructing  and  solving  the  corresponding 
Euler-Lagrange  equation.  The  proposed  method 
can  be  used  in  the  description  of  the  equilibrium 


state  of  a  given  energy,  equal  to  the  experimental 
value,  but  of  a  varying  number  of  the  electrons 
forming  the  system.  Such  an  attitude  seems  to  be 
of  a  special  significance  from  the  point  of  view  of 
possible  application  for  the  cases  with  a  charge 
flow  between  chosen  loges.  In  [8],  a  metallic  crys¬ 
tal  composed  of  spherical  muffin-tins  was  consid¬ 
ered,  It  seems  that  further  applications,  e.g.,  in 
making  a  population  analysis  for  molecules,  may 
be  interesting.  A  reciprocal  principle  maximizing 
N[  p]  for  the  assumed  experimental  electronic  en¬ 
ergy  gives  a  theoretical  basis  for  semiempirical 
methods. 

Another  maximum  principle  presented  in  this 
work  is  the  maximum  entropy  principle.  It  is  of 
interest  as  to  how  the  maximum  hardness  princi¬ 
ple  [9"11]  may  be  established  from  the  reciprocal 
problem.  One  must  look  for  an  answer  using  the 
second  equation  of  the  hierarchy  of  the  equation 
for  the  energy  functionals  [8,  12-13]. 
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ABSTRACT:  The  time-dependent  Schrodinger  equation  for  the  1-D  Hj  molecule 
(with  both  nuclear  and  electronic  degrees  of  freedom  included)  was  solved  numerically. 
A  wave-function  splitting  technique  was  used,  which  allows  one  to  circumvent  the 
problem  of  lost  information  due  to  commonly  used  absorbing  boundaries  of  the  electron 
flux.  This  technique  allows  us  to  calculate  the  above-threshold  ionization  (ATI) 
photoelectron  kinetic  energy  spectra  in  the  presence  of  moving  nuclei,  as  well  as 
complete  spectra  of  dissociating  protons,  beyond  the  Born-Oppenheimer  approximation. 
A  considerable  enhancement  of  the  ATI  spectra,  with  respect  to  the  spectra  generated  by 
a  H  atom,  were  found.  The  peaks  seen  in  calculated  Coulomb  explosion  spectra  of 
protons  agree  well  with  the  predictions  of  recent  theoretical  work  related  to  the 
phenomenon  of  charge-resonance-enhanced  ionization  (CREI).  ©  1997  John  Wiley  &  Sons, 
Inc.  Int  J  Quant  Chem  65:  503-512,  1997 


Introduction 

The  interaction  of  intense  laser  pulses  (inten¬ 
sity  1  >  10^^  W/cm^)  with  atoms  and 
molecules  leads  to  many  new  interesting  multi¬ 
photon,  nonperturbative  phenomena  such  as 
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high-order  harmonic  generation  and  above¬ 
threshold  ionization  (ATI).  This  has  been  an  area 
of  active  research  in  the  past  decade  [1-4]  and 
there  has  been  extensive  progress  in  the  under¬ 
standing  of  these  processes  in  atoms.  The  situation 
for  molecules  is  not  as  advanced  because  of  the 
complexity  arising  from  the  additional  nuclear  de¬ 
grees  of  freedom.  Despite  the  fact  that  the  elec¬ 
tronic  and  nuclear  characteristic  time  scales  are 
very  different  (attosecond,  10”^^  s  vs.  femtosec¬ 
ond,  10”^^  fs),  the  intense  laser  field  induces  a 
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complex  correlation  between  both  motions,  which 
we  attempt  to  study  by  solving  numerically  the 
time-dependent  Schrodinger  equation  (TDSE)  us¬ 
ing  the  exact,  1-D,  three-body  Hamiltonian  for  a 
molecular  ion  in  an  intense,  linearly  polarized 
laser  field.  We  reported  earlier  results  of  simula¬ 
tions  based  on  TDSE  for  H  J  with  3-D  electron  and 
1-D  protons  for  212  and  600  nm  wavelength  [5,6]. 
Similar  calculations  for  1-D  electrons  and  1-D  pro¬ 
tons  were  also  reported  recently  [7].  In  all  these 
studies,  absorbing  boundaries  were  used  to  re¬ 
move  the  electron  flux  from  the  finite  electron  grid 
(in  [7],  a  nuclei  absorber  was  used  as  well).  There¬ 
fore,  so  far,  ATI  kinetic  spectra  in  presence  of 
moving  nuclei  have  not  been  calculated.  Similarly, 
removing  the  electron  flux  causes  the  loss  of  infor¬ 
mation  about  Coulomb  exploding  protons,  with 
which  the  lost  electron  flux  was  linked  (i.e.,  the 
information  about  the  p  p  +  e~  channel  was 
lost)  and,  consequently,  the  high-energy  part  of  the 
reported  proton  spectrum  was  incomplete. 

We  report,  in  the  present  article,  the  first  com¬ 
plete  calculations  of  Coulomb  explosion  spectra  of 
protons  and  the  ATI  electron  kinetic  energy  spec¬ 
tra  from  the  dissociating  .  The  difficulty  with 
the  lost  information  through  absorbing  boundaries 
is  overcome  with  the  help  of  the  wave-function 
splitting  technique  [8,9],  which  allows  one  to  prop¬ 
agate  in  time  the  absorbed  part  of  the  wave  func¬ 
tion  in  the  asymptotic  region  (i.e,,  for  the  electron 
coordinate  z  far  from  the  molecular  center  of  mass) 
using  a  much  faster  algorithm  than  that  used  in 
the  zone  close  to  nuclei.  We  first  describe  this 
technique  for  the  frozen  internuclear  distance,  in 
detail,  in  the  following  section  of  our  article.  In  the 
third  section,  we  present  details  of  our  exact  (and, 
therefore,  non-Born-Oppenheimer,  with  unfrozen 
nuclei)  calculations  and  show  how  the  wave-func¬ 
tion  splitting  technique  was  implemented  for  the 
case  of  moving  nuclei.  In  the  fourth  section,  we 
present  the  kinetic  energy  spectra  of  the  dissociat¬ 
ing  nuclear  fragments,  which  show  clear  signa¬ 
tures  of  the  charge-resonance-enhanced-ionization 
(CREI)  phenomenon.  CREI  was  first  discussed  in 
[10]  and  further  studied  using  models  with  frozen 
nuclear  motion  [11-13].  Thus,  it  remained  to  be 
investigated  whether  the  effect  of  enhanced  ioniza¬ 
tion  would  be  washed  out  by  the  nuclear  dynam¬ 
ics.  Our  previous  non-Born-Oppenheimer  calcula¬ 
tions  [5,6]  did  not  show  very  clear  CREI  peaks  in 
the  proton  spectra  due  to  the  above-mentioned 
electron  flux  losses  through  the  absorbing  bound¬ 
aries.  By  contrast,  the  wave-function  splitting  tech¬ 


nique  yields  sharp  Coulomb  explosion  peaks  due 
to  CREI  and  also  a  weaker  peak  originating  from 
the  center  of  the  initial  wave  packet.  The  fifth 
section  contains  the  ATI  electron  spectra  from  , 
showing  considerable  enhancement  of  peaks,  both 
in  the  low-energy  part  and  in  high-energy  part:  A 
very  broad  high  energy  tail  in  energy  spectra  of 
electrons  are  seen,  extending  up  to  1611^,  where 
=  e^El^/Amo)^  is  the  electron  ponderomotive 
energy.  Typically,  the  bulk  of  (about  99%)  ionized 
electrons  have  strong  ATI  peaks  in  the  energy 
range  less  than  3U^  (this  limit  corresponds  to  the 
maximum  drift  velocity  for  electrons  born  in  the 
continuum  with  zero  kinetic  energy  [14]).  Re¬ 
cently,  it  was  pointed  out  that,  contrary  to  the 
harmonics  generation  spectra,  the  ATI  electron 
spectra  do  not  have  a  sharp  cutoff  but  extend 
much  further  than  3U^  due  to  the  rescattering  of 
an  electron  returning  to  the  nucleus  [14-17].  So 
far,  these  effects  were  much  discussed  for  ATI 
from  atoms.  Recent  measurements  of  extremely 
hot  (multi-keV)  electrons  ejected  from  a  xenon 
cluster  [18]  suggest  that  these  rescattering  effects 
may  be  much  stronger  in  molecules  and  clusters 
than  in  atoms  since  the  electrons  ionizing  toward 
neighboring  dissociating  nuclei  can  scatter  on  them 
and  absorb  additional  photons.  Obviously,  such  a 
possibility  does  not  exist  in  atoms  or  atomic  ions. 
Therefore,  the  hot  electrons  are  less  likely  in  atoms, 
since  they  can  originate  only  from  the  returning 
electrons  scattered  on  one  nucleus,  while  in  the 
dissociating  molecule,  an  ionizing  electron  can  en¬ 
counter  directly  on  its  path  another  scatterer  on 
which  additional  photons  can  be  absorbed.  The 
investigation  of  this  issue  is  the  main  focus  of  the 
fifth  section. 

We  use,  in  the  present  article,  a  1-D  model  for 
both  electronic  and  nuclear  degrees  of  freedom.  It 
appears  that  the  1-D  dynamics  describes  reason¬ 
ably  well  the  movement  of  the  atomic  electrons  [2] 
in  the  linearly  polarized,  intense  laser  fields  (1  > 
10^^  W/cm^)  and  reproduces  correctly  such  phe¬ 
nomena  as  high  harmonic  generation  or  ATI.  Since 
the  molecules  in  intense  fields  are  aligned,  due  to 
the  torque  from  laser-induced  polarizabilities  [19], 
one  may  expect  that  1-D  molecular  dynamics  can 
also  reproduce  the  most  saliant  features  of  the 
dynamics  in  intense  laser  fields  ((/  >  10^^  W/cm^, 
i.e.,  above  the  ionization  threshold),  in  particular, 
in  the  long  wavelength  regime,  when  the  molecule 
dissociates  via  the  bond  softening  mechanism  and 
the  vibrational  trapping  does  not  take  place  [4]  (it 
was  shown  recently  [20]  that  for  wavelengths  much 
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shorter  than  600  nm  and  for  intensities  below  the 
ionization  threshold  the  vibrational  trapping  can 
cause  dissociating  fragments  to  be  emitted  at  right 
angles  to  the  polarization  direction).  Such  a  1-D 
molecular  dynamics  was  presented  recently  [7].  It 
appear  that  the  potential  surfaces  resulting  from 
1-D  electron  motion  are  quite  close  to  exact  3-D 
surfaces,  providing,  thus,  a  model  for  nuclear  mo¬ 
tions  with  correct  molecular  harmonic  and  anhar- 
monic  constants  as  well  as  correct  dissociation 
energy  [7]. 


Description  of  the  Wave  Function 
Spiitting  Aigorithm  for  Solving  the 
Time-dependent  Schrddinger  Equation 

One  of  the  theoretical  approaches  to  calculate 
the  ATI  spectra  relies  on  exact  numerical  solutions 
of  the  time-dependent  Schrodinger  equation 
(TDSE)  on  recently  available  powerful  computers. 
So  far,  this  approach  has  been  successfully  applied 
to  one-electron  atoms  and  molecules  (or  two  1-D 
electrons  [21]).  Typically,  in  such  numerical  simu¬ 
lations,  the  initial  wave  function  of  the  electron  is 
well  localized  in  space  and  later  becomes  spread 
over  very  large  distances.  For  a  short,  intense,  40  fs 
laser  pulse,  the  front  edge  of  the  wave  packet 
of  the  ionizing  electron  can  be  as  far  as  4000 
bohr  from  the  nucleus  (for  J  =  2  X  10^^  and  A  = 
600  nm)  and  continues  to  expand  for  longer  times. 
Thus,  to  avoid  reflections  from  the  boundaries, 
prohibitively  large  boxes  (grids  with  32,000  points 
or  more),  in  which  the  electron  wave  packet  is 
fully  contained,  must  be  used.  The  most  common 
solution  to  this  difficulty  is  to  use  much  smaller 
boxes  with  absorbing  boundary  conditions.  This 
approach  allows  one  to  calculate  the  ionization 
rates  and  harmonic  generation  spectra  but  it  does 
not  allow  one  to  calculate  the  photoelectron  (or 
ATI)  spectra,  since  the  external  part  of  the  wave 
function  is  absorbed  and  lost.  Also,  in  our  recent 
3-D  numerical  studies  of  competition  between  ion¬ 
ization  and  dissociation  [5,6],  the  use  of  absorber 
for  the  electron  flux  led  to  a  considerable  loss  of 
information  about  the  nuclear  movement,  i.e.,  we 
were  unable  to  reproduce  completely  the  spectra 
of  dissociating  fragments  from  the  Coulomb  explo¬ 
sion  channel  since  these  fragments  were  linked 
with  the  absorbed  electron  flux.  In  this  article,  we 
develop  a  technique  which  allows  one  to  recon¬ 
struct  the  absorbed  electron  wave  packet  in  mo¬ 


mentum  space  and  propagate  it  in  time,  analyti¬ 
cally,  to  any  time  after  the  pulse  turn-off.  Thus, 
with  the  help  of  this  technique,  one  can  calculate 
efficiently  the  photoelectron  spectra  after  the  turn¬ 
off  of  the  pulse.  A  similar  technique  was  used 
earlier  [22]  for  calculating  the  kinetic  energy  spec¬ 
tra  of  photodissociation  products,  with  the 
laser-molecule  coupling  neglected  in  the  asymp¬ 
totic  region.  Recently  [23],  this  technique  has  been 
generalized  for  nondecaying  (for  large  distances) 
dipole  couplings  (for  dissociating  molecules)  and 
for  the  calculation  of  the  ATI  spectra  from  time- 
dependent  Schrodinger  equation  [8,9,24].  In  the 
present  work,  we  extend  this  technique  for  solving 
the  TDSE  for  molecules  with  moving  nuclei.  Since 
the  wave-function  splitting  technique  is  quite  com¬ 
plex  in  practice,  we  present,  first,  this  technique 
for  the  case  of  frozen  nuclei  (or  for  the  hydrogen 
atom)  and  explain  in  the  next  section  how  the 
technique  was  implemented  for  the  case  when  the 
nuclei  move.  Let  us  consider  a  time-dependent, 
1-D  Schrodinger  equation  "written  (in  au)  in  the 
following  form: 

dip{z,0  I  I  d'^  \ 

i— - +Vc(z,R)  +2£(OU(z,0, 

dt  \  2  oZ  j 

(1) 


where 

Vc(z,R) 


1 

]/l  +  (z-R/lf 

1 

V^l  +  (z  +  R/lf 


(2) 


and  E(t)  is  the  laser  electric  field.  This  equation  is 
usually  discretized  and  solved  numerically  for  \z\ 
<  We  tested  this  method  [8]  for  the  case  of  the 
1  -  D  H  atom  in  the  laser  field  of  intensity  I  =  2.2 
X  10^^  W/cm^,  A  =  630  nm,  and  =  20  cycles 
(1  cycle  =  2,17  fs).  We  compared  our  results  with 
those  from  [17].  The  photoelectron  spectra  from 
this  reference  show  a  plateau  extending  up  to 
8U^  ^  70  eV  with  a  slowly  decaying  tail  up  to  130 
eV  (70  photons  absorbed),  where  = 
e^£^/(4m(y^)  is  the  electron  ponderomotive  en¬ 
ergy.  Thus,  the  fastest  electron  has  a  speed  ^ 
2.3  au  and  in  =  43  fs  can  be  as  far  as  4050  bohr. 
Typically,  the  integration  step  in  space  is  62  =  0.25 
bohr,  which  means  that  the  grids  with  over  32,000 
grid  points  must  be  used  for  a  43  fs  pulse.  For  n 
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times  longer  pulses,  n  time  greater  grids  should  be 
used.  This  constitutes  a  serious  difficulty,  even  for 
l-D  problems.  Therefore,  a  fast  algorithm  for  large 
z's  is  very  desirable.  In  particular,  since  the 
Coulomb  forces  are  smooth  and  negligible  for  large 
z's,  larger  integration  steps  in  space  5z,  as  well  as 
much  larger  steps  in  time,  can  be  used  asymptoti¬ 
cally. 

Let  us  divide  the  total  grid  into  three  overlap¬ 
ping  regions  in  space,  in  which  different  evolution 
algorithms  will  be  used:  the  internal  and  two  ex¬ 
ternal  (asymptotic,  for  positive  and  negative  z's) 
parts  defined  by 

internal:  |z|  <  z^-„  and  external:  z^^  <  |z|  <  z^, 

(3) 

where  Zj„  -  z^^  =  Zq  >  0  is  the  size  of  the  overlap 
of  both  regions.  In  this  overlapping  (or 
"matching")  region,  an  absorbing  potential  is  in¬ 
troduced,  which  we  have  chosen  in  the  following 
form: 

^(,5(2)  =  for  z,,<\z\<Zi„. 

(4) 

The  electron  wave  function  ijjiz,  jR,  0  is  split  into 
internal  and  external  parts,  by  applying 
this  absorbing  potential  in  the  overlapping  region, 
at  times  t  ^  k  8t,  where  8t  is  much  larger  than 
the  integration  time  dt  (dt  =  0.03  au,  8t  = 
150  dt  =  0.05  cycle,  i.e.,  in  our  simulations  for  A  = 
600  nm,  8t  is  much  less  than  is  the  laser  cycle) 
with  the  help  of  the  following  formulas: 


\jfiz,  R,  0  =  ipi„(z,  R,  t)  +  Ijf^^iz,  R,  t), 

(5) 

ipiniz,  R,  t)  =f(z,  dt)il/(z,  R,  t), 

(6) 

R,  0  =  (1  -f(z,  SO)ip(z,  R,  t), 

(7) 

where 

f{z,8t)  =exp(-afy,Jz)) 

for  z,^<|z|<z,„. 

(8) 

/(z,  8t)  =  1  for 

Iz|  <  Z.xr 

(9) 

and 

f(z,  8t)  =  0  for 

|z|  >  2,„. 

(10) 

Our  calculation  scheme  relies  on  the  assumption 
that  in  the  asymptotic  zone  (defined  by  |z|  >  z^^) 
the  Coulomb  potential  can  be  neglected.  This  al¬ 


lows  us  to  perform  the  time  evolution  exactly  and 
over  an  arbitrary  time  interval  by  calculating  first 
the  wave  function  in  the  velocity  gauge  ijj^iz.t) 
with  the  help  of  the  formula 

i/f^(z,  R,f)  =  exp(-2  A(t,0)z)tA^^(z,  R,  0,  (11) 
where 


is  the  electric  field  area  over  the  interval  ^2* 
Next,  we  calculate  the  Fourier  transform  (p^ip,  t) 
of  ik^iz,  0,  which  evolves  in  time  according  to  the 
equation 

%( V'  h'^<Pv(  P>  hf  (13) 

where  the  exact  propagator  U  is  given  by 

=  exp|--^*^(p^  +  2A(t,0)p 

+  A2(f,0))df|.  (14) 

In  our  algorithm,  we  perform  the  series  of  fol¬ 
lowing  operations  at  each  time  tj^  =  k  8t: 

(i)  The  exact  time  evolution  (including  field 

+  Coulomb  potential)  of  is  evaluated 
in  the  internal  zone  (|z|  <  Zj„)  using  the 
split-operator  spectral  method  [25],  as  orig¬ 
inally  applied  to  [25]. 

(ii)  The  outgoing  wave  is  eliminated  from  the 
internal  zone  with  the  help  of  Eq.  (6). 

(iii)  The  splitting  operation  described  by  Eq.  (7) 
is  performed  to  provide  the  outgoing  wave 

defined  in  the  "matching  zone,"  z^^  < 

|z|  <  Z/«- 

(iv)  The  velocity  gauge  wave  function  (in 
the  external  region)  and  its  Fourier  trans¬ 
form  (p^  are  calculated  in  the  overlap 
region: 

h) 

=  (Itt)  ^  f exp(-ipz)i//^(z,  t,^)  dz.  (15) 

•'2 

(v)  The  momentum  wave  function  (p(  p,  f^)  is 
propagated  to  some  final  time  >  t^  (at 
which  the  photoelectron  spectra  are  to  be 
calculated),  using  the  exact  formula 

(16) 
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(vi)  The  momentum  wave  function  cp^i  p,  R, 

is  added  to  R,  tjrYs  accumulated 

at  previous  steps,  i.e.: 

+  %(p,  R,  tj). 

(vii)  The  loop  ends  here  and  calculation  starts 
at  step  (i)  with  the  integer  k  replaced  by 
/c  +  1. 

The  most  difficult  part  of  the  scheme  presented 
here  is  the  choice  of  the  free  parameters:  8t, 

and  Zq.  Several  obvious  necessary  (but  not 
sufficient)  physical  condition  should  be  satisfied. 
First,  the  external  zone  (|z|  >  z^^)  in  our  scheme  is 
the  asymptotic  region  in  which  we  neglect  the 
Coulomb  forces;  therefore,  we  must  require  that  at 
z  =  Zg^  the  Coulomb  force  is  much  less  than  the 
electric  force  i.e.: 

~  .  (17) 

^ex 

Second,  our  scheme  is  based  on  the  assumption 
that  the  wave  entering  the  '"matching"  zone  (of 
length  Zq)  does  not  return  back  to  the  nucleus,  as 
well  as  the  wave  exiting  this  zone  does  not  return 
to  it.  This  means  that  we  should  require  that  the 
classical  ponderomotive  radius  is  much  smaller 
than  is  z^^  and  also  smaller  than  the  size  of  the 
matching  "zone"  Zq: 


time  necessary  to  move  for  the  fastest  electron 
across  the  matching  "zone."  Assuming  that  the 
fastest  electron  has  the  energy  we 

thus  get 


8t 


Zq 

4ymUp  ■ 


(19) 


In  our  calculations,  8t  is  also  much  less  than  the 
laser  cycle.  Finally,  the  absorption  should  occur  on 
the  distance  Zq  longer  that  de  Broglie  wavelength 
of  the  slowest  electron.  Assuming  that  the  slowest 
electron  has  the  energy  h  co,  we  thus  get 


Zo 


(20) 


For  the  maximal  laser  intensity  chosen  in  this 
article,  I  ==  2.0  X  10^^  W/cm^,  and  the  wavelength 
A  =  600  nm,  the  values  of  two  relevant  parameters 
are  (Xq  bohr.  Up  =  0.24  au  =  6.5  eV,  and 

CO  =  0.076  au  =  2.066  eV.  We  list  in  Table  I  the 
values  of  all  parameters  which  we  used  in  our 
calculation  and  which  satisfy  all  inequalities  Eqs. 
(17)-(20). 

The  spectra  of  ionized  electrons,  defined  for 
each  R,  where  calculated  according  to  the  formula 

S(E,R)  = 

I  /  ^\2\  rn 

+  |»4p(£),R,(,)|)^,  (21) 


and  (Xq  «  Zq,  where  Kq  = 


eE 


0  2  * 
mo) 


(18) 


where  <p+,(^_)  are  wave  function  corresponding 
to  the  electron  moving  in  the  direction  of  positive 
(negative)  direction  along  the  z  axis,  respectively, 
and  p(E)  =  \/2mE .  We  checked  the  method  by 


The  above  conditions,  (17)  and  (18),  ensure  also 
that  all  possible  interferences  in  continuum  (which 
are  expected  to  take  place  in  the  near  nuclei  zone) 
are  appropriately  taken  into  account  by  the  exact 
algorithm  used  in  the  internal  zone.  In  addition, 
we  must  require  that  the  matching  should  be  re¬ 
done  after  the  time  interval  8t  shorter  than  the 

TABLE  1 

comparing  our  results  for  a  1-D  H-atom  with  those 
of  [17,24]  and  with  the  results  from  the  one-box 
calculations.  Slight  disagreement  between  the 
heights  (not  positions)  of  several  first  photoelec¬ 
tron  peaks  was  seen  if  our  spectra  were  calculated 
right  at  the  end  of  the  pulse.  This  disagreement 
disappears  when  the  spectra  are  calculated  at  later 

Values  of  parameters  (in  atomic  units)  used  in  the  wave-function  spiitting  scheme. 

dt  8t  dz  Zq 

^in  ^ex  ^abs  ^max 

0.03  4.5  0.25  512  150 

4196  7168  2  70  840 

rijn  is  number  of  grid  points  in  the  interna!  zone  (|z|  <  z,„)  and 
space  with  the  momentum  resolution  Ap  =  2'TT/7^6S  a.u. 

is  the  number  of  grid  points  in  the  external  zone,  in  momentum 
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times,  tj:  >  tp  (with  the  laser  turned-off  for 
since  then  all  slow  electrons  have  enough  time  to 
leave  the  matching  zone.  Therefore,  in  all  calcula¬ 
tions,  we  allow  the  system  to  evolve  without  the 
laser  field  for  several  subsequent  cycles  and  then 
calculate  the  ATI  spectra  (see  [8]  for  more  details). 

The  two-box  integration  technique  presented 
here  was  first  applied  for  calculating  the  ATI  spec¬ 
tra  from  a  molecule  frozen  at  an  internuclear 
distance  R  [8]  and  those  results  were  used  later  for 
comparison  with  the  full  non-Born-Oppenheimer 
calculations,  presented  in  next  sections. 


A  Non-Born-Oppenheimer  1-D  Model 
of  in  Intense  Laser  Fields 


The  potential  V'(-(z,  R)  was  neglected  in  the  exter- 
nal  zone,  whereas  the  Coulomb  repulsion  1  /R  was 
retained  in  H(R). 

We  used  laser  pulses  of  total  duration  20  cycles 
(40  fs),  having  the  wavelength  A  =  600  nm,  with  a 
five-cycle  rise  and  fall.  The  values  of  various  pa¬ 
rameters  used  in  the  present  calculations  are  given 
in  Table  L  We  performed  numerically  the  time 
evolution  of  the  wave  functions  and 

the  external  cpip,  R,  t)  for  an  additional  three  cy¬ 
cles  (6  fs),  i.e.,  up  to  the  final  time  tj  =  23  cycles, 
using  the  Hamiltonian  without  the  electric  field,  in 
order  to  allow  some  more  electron-proton  waves 
to  leave  the  interaction  region. 

The  initial  wave  function,  at  t  =  0,  was  as¬ 
sumed  to  be  in  the  following  form: 


We  solved  numerically  the  complete,  three- 
body,  time-dependent  Schrodinger  equation  for 
moving  electrons  and  nuclei  (in  au): 


i -  =  H(z,  R,  Oipiz,  R,  t) 

dt 

where  X 

H{z,R,t)  =  Hj,{R)^V^{z,R)  +  H^{z),  (22) 

dz^ 


H^(z)  =  -/3vt  "  KzEit), 


1  1 
H(R)  = - 4- 


dR^  R ' 


(2m^  -h  m^) 


K  =  1  -h 


2m^  +  m, 


(23) 
•,  (24) 


iP(z,R,0)  =  x(R)h(^rZ),  (25) 

where  iPq(R,  z)  was  the  eigenfunction  of  the  elec¬ 
tron  Hamiltonian  in  H  J  (without  the  laser  field)  at 
fixed  R.  Two  distinct  initial  condition  at  f  =  0 
were  used: 

(i)  Hj  was  initially  prepared  in  its  vibrational 

=  6  ;^(R)  =  ;^6(R)  state,  or 

(ii)  at  t  =  0,  H  2  was  prepared  in  a  superposi¬ 
tion  of  Hj  vibrational  states,  coinciding  with 
the  ground  vibrational  state  of  the  H2 
molecule,  i.e., 

t(^)  =  To“'=  (26) 

v  =  G 


where 


and  and  (m^  =  1  in  au)  are,  respectively, 
the  electron  and  proton  masses.  The  Hamiltonian 
of  Eq.  (22)  is  the  exact  three-body  Hamiltonian 
obtained  after  separation  of  the  center-of-mass  mo¬ 
tion  [5].  The  time  evolution  of  the  internal  wave 
function  was  performed  in  the  same  way  as 
described  in  [5,25]  (using  the  split-operator 
method),  whereas  the  external  part  (obtained  re¬ 
covered  from  the  absorber  at  each  3 1  step) 
(p^^\  p,  R,  t)  was  evolved  with  the  help  of  the 
propagator  [Eq,  (14)]  for  times  and  ^2  =  ^1  +  ^^ 
where  8t  150  dt  (as  previously).  After  each 
''big"  time  step  8t  was  accomplished,  the  new 
portion  (pip,  R,t  +  8t)  was  added  to  the  previ¬ 
ously  accumulated  portion.  The  operation 
exp(~idtH(R))  was  applied  to  the  accumulated 
part  of  cp^^Kp,  R,  t)  at  each  "small"  time  step  dt. 


-00 

c,=  (27) 

•'0 

are  the  Franck-Condon  factors. 

The  latter  initial  condition,  (ii),  is  close  to  the 
experiments,  in  which  Hj  is  prepared  by  a  steeply 
rising  edge  of  a  pulse,  which  rapidly  ionizes  the 
H2  molecule. 

To  interpret  the  full  calculation  results,  we  first 
calculated  the  ionization  rates  for  170  values  of 
R's,  frozen,  in  the  range  from  3  to  33  bohr.  The 
ionization  rates  for  the  laser  wavelength  A  =  600 
nm  are  displayed  in  Figure  1.  One  observes  strong 
charge  resonance  enhanced  ionization  (CREI)  peaks 
around  7  bohr,  which  were  discussed  recently  by 
many  authors  [10-13],  The  CREI  effect  is  incredi¬ 
bly  strong  for  the  relatively  low  intensity  J  =  5  X 
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FIGURE  1 .  Ionization  rates  as  a  function  of  internuclear 
distance  for  laser  intensities  /  =  5  x  10“'^,  2  x 

and  4X10^"^  W/cm^  at  wavelength  600  nm. 


10^^  W/cm^  and  practically  disappears  at  J  =  4  X 
10^^  W/cm^,  when  at  a  short  internuclear  distance 
R  =  4  au,  the  ion  ionizes  with  a  rate  close  to 
the  H-atom  rate  and  the  ionization  rate  curve  is 
relatively  flat.  Note  that  in  addition  to  those  peaks 
some  unexpected  features  appear  for  larger  R  val¬ 
ues;  in  particular,  for  lower  intensities,  I  =  5  X  10^^ 
W/cm^,  one  observes  a  deep  minimum  in  the 
ionization  rate  at  R  =  16  bohr.  It  originates  from 
the  fact  that  an  ionizing  electron  from  one  proton 
comes  under  the  influence  of  the  second  proton 
and  does  not  have  enough  time  to  escape  from  the 
second  proton  before  the  electric-field  changes  sign. 
Thus,  this  electron  becomes  "trapped"  between 
the  two  protons.  One  may  expect  that  the  position 
of  this  minimum  is  proportional  to  the  classical 
radius  of  the  electron  in  the  laser  field  aQ.  This  is 
indeed  confirmed  by  the  position  of  this  minimum 
for  a  twice  higher  intensity,  which  occurs  \/2  times 
further.  We  are  currently  attempting  to  explain 
this  minimum  using  classical  dynamics  of  the  elec¬ 
tron  [26,27]. 


Kinetic  Energy  Spectra  of  Dissociating 
Nuclear  Fragments 

The  wave  functions  R,  tj),  cp{p,  R,  t^)  al¬ 

low  us  to  calculate  the  simultaneous  spectra  of 
nuclear  fragments  Sy^Cz,  tj)  and  S^^(p,  p^,  tj), 


where  p,  p^^  are  the  momenta  of  the  electron  and 
the  protons,  by  simply  calculating  the  Fourier 
transform  of  the  asymptotic  part  (large  R)  of 
R,  ty),  (p{  p,  R,  tj)  with  respect  to  the  R  vari¬ 
able  and  next  calculating  the  square  of  absolute 
values  of  Fourier  transforms  (see  [6]  for  details). 
The  integrated  over  electronic  degrees  of  freedom 
spectra  S(  pj^)  are  plotted  in  Figure  2  (the  contribu¬ 
tions  from  internal  and  external  boxes  are  added 
up). 

Two  different  initial  condition  were  used  in  our 
calculations  (see  the  previous  section  for  a  descrip¬ 
tion  of  these  conditions),  on  which  the  spectra 
depend  strongly.  We  show  in  Figure  2(a)  spectra 
obtained  from  H2  which  initially,  at  f  =  0,  was 
prepared  in  its  vibrational  v  ^  6  state  (two  laser 
intensities  were  used  I  =  10^^  and  2  X  10^^ 


*0123456789 


NUCLEAR  KINETIC  ENERGY  E  (eV) 


FIGURE  2.  Kinetic  energy  spectra  of  dissociating 
nuclear  fragments.  The  numerical  simulation  was 
initialized  either  (a)  from  the  v  =  6  state  of  the  H  J  or  (b) 
from  the  Franck-Condon  superposition  of  vibrational 
states  of  HJ,  defined  by  Eqs.  (26)  and  (27).  Laser 

intensities  used:  ( - )  /  =  10^^  W/cm^;  ( . )  I  =  2 

xIO^"^  W/cm^,  the  wavelength  A  =  600  nm. 
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W/cm^),  whereas  in  Figure  2(b),  we  show  the 
spectra  resulting  from  prepared,  at  f  =  0,  in  a 
superposition  of  vibrational  states,  coinciding  with 
the  ground  vibrational  state  of  the  H2  molecule, 
multiplied  by  the  electronic  wave  function  of 
[see  Eqs.  (25)-(27)].  Figure  3  shows  the  same  spec¬ 
tra  on  a  logarithmic  scale.  The  low-energy  peaks, 
seen  Figure  2,  are  related  to  the  bond-softening 
mechanism  [4]:  In  Figure  2(a),  there  are  two  very 
close  peaks,  at  0.26  and  0.43  eV.  Their  position  is 

close  to  the  net  absorption  of  one  photon  (pre¬ 
sumably  via  a  multiphoton  process)  from  the  v  =  6 
state:  The  energy  released  into  the  dissociation 
channel  from  this  process,  when,  initially,  the 
molecule  was  in  its  vibrational  state  v,  is  given  by 
a  simple,  conservation  energy  formula: 

E  =  nfico-(-I^-E,),  (28) 

where  =  0.67  au  is  the  ionization  potential  of 
the  1-D  H  atom,  the  energy  vibrational  level  v  of 
Hj  (from  which  photons  are  absorbed),  and  n  is 
the  net  number  of  absorbed  photons  (in  general,  it 


0  2  4  6  8  10  12  14  16  18 


NUCLEAR  KINETIC  ENERGY  E  (eV) 

FIGURE  3.  Same  as  in  Figure  2  but  on  a  logarithmic 
scale. 


is  a  multiphoton  process:  an  absorption  of  m  pho¬ 
tons  followed  by  the  reemission  of  k  photons,  with 
the  condition  that  n  —  m  ~  k  >  0).  For  the  wave¬ 
length  A  =  600  nm  used  in  the  present  work,  h  co 
=  2.066  eV.  Thus,  from  z;  =  6,  for  n  =  1,  a  peak  at 
0.542  eV  is  expected,  and  from  i?  =  5,  at  0.343  eV. 

Two  factors  may  be  responsible  for  the  fact  that  in 
our  simulation  two  peaks  are  seen,  instead  of  one 
expected  at  0.542  eV:  (i)  First,  a  rapid  rise,  in  five 
cycles,  can  populate  the  neighboring  v  =  5  state 
(or,  in  other  words,  the  peak  is  shifted  by  the  fact 
that  our  pulse  is  spectrally  very  broad),  and  (ii)  the 
downward  Stark  shift  of  the  level  v  =  6  occurs. 
Similarly,  the  low-energy  peak  seen  in  Figure  2(b) 
originates  from  the  net  two-photon  absorption  from 
low  vibrational  states:  Using  n  =  2,  =  0,  1  or  2, 

one  gets  the  peak  positions  at  1.25, 1.50,  or  1.74  eV, 

respectively,  whereas  the  simulation  (which  used 
the  H2  V  =  0  state,  as  initial  input  for  the  vibra¬ 
tional  wave  function)  yields  a  broad  peak  at 
E  =  1.06  eV.  Again,  a  considerable  spectral  width 

of  our  pulse  and  the  Stark  shifts  are  the  main 
reason  of  the  observed  discrepancy  between  the 
position  of  expected  and  seen  peaks,  i.e.,  the  pres¬ 
ence  of  photons  having  a  frequency  only  5%  less 
than  that  of  the  carrier  frequency  can  shift  the  peak 
by  0.2  eV  or  more  and  thus  explain  the  observed 
shifts. 

The  high-energy  broad  peaks,  E  >  3.8  eV,  rep¬ 
resent  the  p  +  p  +  e~  Coulomb  explosion  channel, 
since  in  our  calculation,  they  originate  from  the 
external  part  of  the  wave  function  (p(  p,  jR,  ty).  At 
laser  intensity  I  =  10^^  W/cm^,  we  see  a  broad 
peak  at  E  =  3.8  eV.  If  the  Coulomb  explosion  oc¬ 
curred  with  initial  kinetic  energy  equal  to  zero, 
then  we  can  conclude  that  the  Coulomb  explosion 
should  be  initiated  at  R  =  =  7.16  bohr,  in  order 

to  release  the  Coulomb  energy  Ecoif/om&  = 
(according  to  the  relation  Ecou/omi?  ^  27.21  eV/R^, 
where  is  in  bohr),  which  is  in  agreement  with 
Figure  1,  in  which  three  peaks  are  seen,  at 
5.6,  6.8  and  7.6  bohr.  If,  in  addition,  one  takes  into 

account  the  fact  that  at  R  ^  R^  the  dissociating 
H  +  p  have  an  initial  nonzero  kinetic  energy 
[varying  between  0.2  to  0.8  eV;  see  the  first  peak  in 
Fig.  2(a)],  one  can  easily  understand  the  appear¬ 
ance  of  a  broad  peak  centered  at  3.8  eV:  Simply, 
the  explosions  from  three  CREI  peaks  seen  in  Fig¬ 
ure  1  will  merge  to  one  broad  structure.  At  higher 
intensities.  Figure  1  shows  a  considerable  shift  of 
CREI  peaks  toward  the  lower  values  of  R,  in 
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agreement  with  the  shift  of  the  Coulomb  explosion 
peak  resulting  from  our  calculations  [seen  in  Fig. 
2(b)],  which  is  now  centered  at  E  =  4.8  eV  and  is 
broader  than  at  the  lower  intensity.  It  extends  up 
to  8  eV,  which  agrees  well  with  the  lower  limit  of 
CREI  peak  in  Figure  1,  which  is  sharply  cut  at  3.05 
bohr.  Summarizing,  the  broad  peaks  in  the  p  p 
channel  seen  in  Figure  3  agree  well  with  the  expec¬ 
tations  for  the  CREI  peaks  based  on  the  model 
with  frozen  nuclei. 

We  display  in  Figure  3  the  same  spectra  as  in 
Figure  2  but  on  a  logarithmic  scale,  in  order  to 
discuss  the  high-energy  tail.  One  notes  that  the  tail 
of  proton  spectra  depends  dramatically  on  the 
initial  conditions:  Initializing  our  simulation  from 
the  i;  =  6  of  yields  spectra  sharply  cut  below 
10  eV  [Fig.  3(a)],  whereas  the  initialization  from 
the  V  =  0  state  of  H2  gives  a  small  (100  times 
lower  than  the  principal  peak)  but  a  clear  peak  at 
£  =  14  eV  [Fig.  3(b)]  which  originates  from  the 
Coulomb  explosion  from  the  equilibrium  position 
of  the  H2  molecule  =  2  au  (note  that  the  equi¬ 
libria  positions  in  our  1-D  model  are  2.0  and  2.6 
au,  instead  of  1.6  and  2,  for  FI 2  and  for  Hj, 
respectively). 


Kinetic  Energy  Electron  Spectra  from 
a  Dissociating  Ion 

The  ATI  spectra  presented  in  this  section  were 
obtained  by  integrating  over  R  the  formula  (21),  in 
which  the  external  box  function  (p(p,R,tf),  ob¬ 
tained  from  the  complete,  non-Born-Oppenheimer 
calculation  was  used.  The  normalized  (i.e.,  the  area 
under  the  spectrum  curve  is  the  ionization  proba¬ 
bility  Pj)  ATI  spectra  are  shown  in  Fig.  4(1  =  10^^ 
W/cm^)  and  in  Figure  5  (2  X  10^^  W/cm^).  Our 
exact  non-Born-Oppenheimer  results  (solid  lines) 
are  compared  with  the  ATI  spectra  originating 
from  the  1-D  H  atom  (□  □  □)  and  from  frozen 
at  R  =  R^  (a  A  a).  One  notes  a  considerable  en¬ 
hancement  of  the  spectrum  around  3U^  and  en¬ 
hancement  at  very  large  energies,  up  to  1617^,  A 
frozen  nuclei  calculation  shows  such  enhancement 
at  particular  large  distances  (33  au  or  more  [9]), 
which  presumably  are  close  to  integer  multiples  of 
the  classical  radius  agi  Classical  trajectory  calcula¬ 
tions  show  that  the  electron  reaches  its  maximal 
kinetic  energy  at  distances  given  by  [26,27] 

^max  =  rmrivo/w  +  Kq),  (29) 

where  Vq  is  the  electron's  initial  velocity. 


electron  energy  (photon  energy) 


FIGURE  4.  ( - )  ATI  photoelectron  kinetic  energy 

spectrum  from  Hg  complete  non-Born-Oppenheimer 
calculations  compared  (a  a  a)  with  spectra  of  the  nuclei 
frozen  at  P  -=  =  7.6  au  and  (□  □  □)  with  spectra  of 

1-D  H  atom.  The  laser  intensity  I  ^  10^"^  W/cm^,  the 
wavelength  A  =  600  nm,  and  the  simulation  was 
initialized  from  the  v  =  6  state  of  HJ .  The  pulse  duration 
was  20  cycles  (1  cycle  =  2  fs),  with  a  five-cycle  rise  and 
five-cycle  turn-off.  The  ATI  spectrum  was  calculated  after 
40  cycles.  All  spectra  are  normalized:  The  area  under 
each  curve  is  equal  to  the  ionization  probability. 


electron  energy  (photon  energy) 


FIGURE  5.  Same  as  in  Figure  4  but  for  the  laser 
intensity  /  =  2  x  W/cm^  and  =  7.0  au. 
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Concluding  Remarks 

We  have  reported  that  dissociative-ionization  of 
molecular  ions  exhibits  several  interesting  features 
seen  in  nuclear  fragment  and  electron  kinetic  en¬ 
ergy  spectra.  The  expected  (theoretical  and  experi¬ 
mental)  low-energy  peaks  in  proton  spectra  do 
show  up  in  our  spectra  calculated  from  the  non- 
Born-Oppenheimer  dynamics.  The  positions  of 
these  peaks  agree  well  with  theoretical  predictions 
for  Coulomb  explosions  released  from  the  CREI 
critical  distance  R^.  In  addition  to  those  peaks,  we 
observe  one-  and  two-photon  absorption  peaks,  as 
well  as  weak  peaks  related  to  the  Coulomb  explo¬ 
sion  from  the  H2  equilibrium  internuclear  dis¬ 
tance.  The  ATI  electron  kinetic  energy  spectra  show 
unexpectedly  high  tails,  extending  up  to  15U^, 
related,  presumably,  to  the  inelastic  collisions  of 
electrons  moving  from  one  nucleus  to  another. 
Finally,  we  emphasize  the  usefulness  of  the  wave- 
function  splitting  technique  for  exact  calculations 
of  kinetic  energy  spectra  of  photofragments  emerg¬ 
ing  from  high-intensity  laser  interaction  with 
molecules.  This  technique  allowed  us  to  recover 
the  electron  flux  usually  lost  at  absorbing  bound¬ 
aries  in  previous  calculations,  with  a  considerable 
saving  in  computer  memory  and  CPU  time. 
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ABSTRACT:  A  convenient  Gauss-Laguerre  quadrature  formula  is  presented  for 
integrands  which  depend  on  the  radial  coordinates  and  r2  of  two  bodies  as  well  as  on 
their  relative  distance  This  formula  generalizes  the  analytic  method  by  Calais  and 
Lowdin  [J.  Mol.  Spectrosc.  8,  203  (1962)]  to  cases  when  the  analytical  evaluation  is  not 
possible  and  defaults  to  an  exact  method  when  it  is.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J 
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Introduction 

Prompted  by  renewed  interest  in  experimental 
studies  of  highly  charged  ions,  small  atomic 
systems  have  again  become  the  focus  of  theoretical 
and  experimental  investigations.  Heliumlike  sys¬ 
tems,  in  particular,  are  once  more  center  stage  in  a 
variety  of  different  research  projects.  Parallel  to 
this  development  in  ion  physics  runs  a  similar 
activity  in  the  study  of  atomic  processes  in  plas¬ 
mas.  Here,  too,  multiply  ionized  two-electron  sys¬ 
tems  are  being  studied  as  well  as  negative 
ions — predominantly,  the  negative  hydrogen  atom, 
which  is  an  important  agent  of  opacity  in  astro- 
physical  plasmas. 
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The  focus  of  much  of  modern  atomic  physics  is 
on  electron  correlation  in  various  systems  and  pro¬ 
cesses.  This  interest  is  shared  by  new  develop¬ 
ments  in  the  physics  of  highly  ionized  systems, 
including  clusters,  and  by  studies  of  atomic  sys¬ 
tems  in  plasma  environments.  Two-electron  sys¬ 
tems,  in  particular,  play  an  important  role  in  this 
research  because  of  the  relative  ease  with  which 
correlation  effects  can  be  identified  and  calculated. 
New  aspects  such  as  the  need  to  include  relativis¬ 
tic  effects  in  highly  charged  ions,  for  instance,  or 
the  direct  inclusion  of  screening  potentials  in 
atomic  calculations  [1,  2]  call  for  new  or  at  least 
modified  theoretical  approaches. 

The  present  research  originated  from  ideas  to 
include  realistic,  time-dependent  screening  poten¬ 
tials  into  atomic  calculations  from  scratch,  and,  by 
doing  so,  to  calculate  line  broadening  and  the 
lowering  of  the  continuum  threshold  simultane¬ 
ously  and  on  the  same  footing.  While  the  time-de¬ 
pendent  aspect  is  not  our  focus  here,  the  need  for 
wave  functions  of  larger  angular  momenta  than  S 
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and  P  states  for  heliumlike  ions  stimulated  the 
present  study  which  builds  on  the  landmark  arti¬ 
cle  by  Calais  and  Lowdin  [3]  in  which  the  authors 
gave  an  analytic  expression  for  nested  triple  inte¬ 
grals  of  the  type  "exponential  function  times  inte¬ 
ger  powers"  in  all  three  arguments.  Their  Eq.  (28) 
reads 


[KLM]  ^ 


\-r2\ 


KLM 

2K\L\M\  ELL 

p=0  q  =  0  r  =  0 


(p  +  q)'. 
q\p\ 


(K  -  p  +  r)l  (L  —  q  +  M  —  r)[ 
r\(K-p)\  (M-rVXL-qV. 


x(B  +  (1) 


Computationally,  this  formula — whenever  it  can 
be  applied — is  advantageous  over  the  quadrature 
formula  to  be  presented  in  the  third  section.  In 
many  cases  of  interest,  however,  the  latter  pro¬ 
vides  substantial  convenience  for  programming 
without  compromising  accuracy.  Furthermore,  it 
can  be  generalized  to  the  case  of  more  general 
integrands  for  which  analytic  results  cannot  be 
obtained.  In  these  cases,  the  quadrature  will  no 
longer  be  exact  but  still  of  a  high  degree  of  accu¬ 
racy. 


A  Quadraure  Formula  for  Integrals  of 
the  Calais-Ldwdin  Type 

The  following  development  makes  use  of  four 
well-known  properties  of  the  Gauss-Laguerre 
quadrature  formula: 

■  Accuracy:  The  quadrature  replacement 

-oc  N 

I  f(x)e~''dx- 
•'o 

is  exact  if  /(x)  is  a  polynomial  in  x  of 
degree  2N  +  2  or  less  and  the  quantities 
and  are  the  standard  Laguerre  points  and 
weights  as  given,  e.g.,  in  Abramowitz  and 
Stegun  [4]. 


Scaling:  An  integral  of  the  form 


with  exactly  the  same  weights  W„, 
Shifting  of  the  lower  integral  limit: 

-00 

/  f(x)e  ^dx 
•'o 

=  e~'J  f  /(^  +  y)e''^dy 

Interchanging  of  nested  integrals: 

(  dxf(x)e'~^^  f  dyg(x/)e~^^^ 

*'o  -^0 

=  f  dxg(x)e~^^  f  dyf(y)e~^'^. 

•'n  *^v 


Repeated  application  of  these  simple  procedures 
leads  to  a  convenient  quadrature  representation  of 
the  Calais-Lowdin  integral  of  Eq.  (1).  We  first 
replace  r^,  r2,  and  r^2  3//  z,  respec¬ 

tively,  and  get  rid  of  the  absolute  value  in  the 
lower  limit  by  separating  the  two  possibilities 
X  <  y  and  x  >  y.  Furthermore,  we  want  the  upper 
limit  to  be  infinite  and — in  the  end — the  lower 
limit  to  be  zero  in  order  to  be  able  to  apply 
Gauss-Laguerre  quadrature  techniques: 


-CO  00 

[KLM]-  j  dxx^e-^^j  dyy^e' 


By 


•'0 
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(  dzz^e  —  f  dzz^e 

'^y-x  '^y  +  x 


With  the  following  replacements,  the  integrals  in 
the  square  brackets  are  changed  to  the  desired 
form:  o'=C[z-(y--  x)],  so  that  dz  -  (1/C)dcr 
(we  notice  that  y  -  x  is  a  constant  limit  for  the 
first  of  the  integrals).  Similarly,  we  substitute  in 
the  remaining  integrals  cr=C[z--(y  +  x)],  cr  = 
C[  z  -  (x  -  y)]  and  o-=C[z-(x  +  y)],  respec- 
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tively.  By  this,  the  two  square  brackets  change  to 


g-C(y  +  x)  ^  /  p-  \M 
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We  combine  the  square  brackets  with  the  inte¬ 
grals  on  their  immediate  left  after  substituting 
T  =  (B  +  CXy  -  x)  and  r  =  ( A  +  C)(x  -  y),  re¬ 
spectively: 
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The  corresponding  quadrature  formula  is  given 
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The  final  integrations  require  only  scaling  but 
no  shifting  of  the  quadrature.  The  overall  result  is 
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We  notice,  in  particular,  that  all  integrands  have 
the  general  form  of  a  polynomial  multiplied  by  an 
exponential  function.  The  quadrature  gives  in  these 
cases  exact  results  provided  that  the  number  of 
quadrature  points  is  at  least  equal  to  (D/2)  -  1, 
where  D  is  the  degree  of  the  polynomial  in  one 
variable.  In  the  present  case,  the  degree  for  the 
variable  p  is  equal  to  M  +  X  +  L,  while  for  the 
variables  r  and  cr,  it  is  M  +  X  (or  M  +  L)  and  M, 
respectively. 


The  General  Quadrature  Formula 

The  more  general  integral 

/GO  00 

dr2g(r2)e~^^^ 

0  •'o 

can  as  well  be  represented  by  a  quadrature  for¬ 
mula.  Again,  if  the  functions  /,  g,  and  h  are 
polynomials  in  their  respective  arguments,  the 
representation  will  be  exact  provided  that  the 
number  of  quadrature  points  is  sufficiently  large. 
In  this  case,  the  sole  benefit  of  a  quadrature  repre¬ 
sentation  over  the  direct  evaluation  by  the 
Calais-Lowdin  techniques  is  a  substantial  saving 
in  programming  work.  If,  however,  the  functions 
are  not  polynomials,  an  optimally  scaled 
Gauss-Laguerre  quadrature  is  still  an  efficient  way 
to  evaluate  the  necessary  integrals.  The  various 


procedures  of  scaling,  shifting,  and  interchanging 
of  integrals  are  exactly  as  in  the  preceding  section 
and  need  not  be  repeated  here.  The  final  quadra¬ 
ture  formula  is  given  as 
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This  formula  requires  evaluations  of  the  func¬ 
tion  /  and  4N^  X  N,  evaluations  of  g,  while  the 
function  h  needs  to  be  calculated  4A4  X  N,  X  K 
times. 


Representative  Results 

The  results  of  this  section — energies  of  excited 
P  and  D  states  of  the  helium  atom — were  selected 
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to  assess  the  accuracy  of  both  the  method  and  the 
computer  program.  Their  physical  relevance  will 
be  discussed  in  a  forthcoming  publication.  The 
evaluation  of  eigenenergies  of  the  helium  atom  has 
a  long  history.  After  groundbreaking  work  by 
Hylleraas  [5,  6]  and  Pekeris  [7,  8],  the  use  of 
perimetric  coordinates  has  become  almost  custom- 
ary  for  bound-  and  resonance-state  calculations 
(see,  e.g.,  [9-11]).  To  our  knowledge,  only  the 
S-state  and  the  P-state  energies  have  actuality  been 
calculated  so  far.  Because  of  the  present  interest  in 
atomic  processes  in  plasmas,  all  Coulomb  interac¬ 
tions  in  the  Hamiltonian  were  replaced  in  the 
present  work  by  screened  interactions  of  the  De- 
bye-Hiickel  type  In  this  scheme,  one 

approximates  matrix  elements  with  pure  Coulomb 
interactions  to  a  high  degree  of  accuracy  by  setting 
the  parameter  D  equal  to  a  very  large  number.  In 
the  present  examples,  D  has  been  chosen  equal  to 
2.0  X  10^  when  comparison  with  Coulombic  re¬ 
sults  was  wanted. 

As  in  the  earlier  work,  the  form  of  the  wave 
functions  was  chosen  as 

=  E  Q  ;•  kir^Piicos 

i.j.k 

±r^Pj_(cos  (6) 

Here,  a  indicates  the  spin  multiplicity.  Singlet 
states  are  associated  with  the  upper  ( +  )  sign,  and 
triplet  states,  with  the  negative  sign.  The  letter  L 
indicates  the  overall  orbital  angular  momentum 
and  is  the  Legendre  polynomial  of  order  L. 
While  the  linear  parameters  are  obtained  from 
the  diagonalization  of  the  Hamiltonian  matrix,  the 
best  choice  of  the  nonlinear  parameters  a^,  Pj,  and 
yj^  is  an  open  question.  Different  strategies  were 
used  in  the  previous  publications  [9,  10].  In  the 
present  work,  we  follow  the  guidelines  of  the 
former  with  the  exception  that  we  scale  the  a,  j8, 
and  y  parameters  in  groups  until  satisfaction  of 
the  quantum  virial  theorem  has  been  achieved, 
because  we  hold  the  correct  distribution  of  kinetic 
and  potential  energy  for  an  important  property  of 
stationary  states.  The  results  of  these  calculations 
are  shown  in  Table  I,  rows  3-5.  The  entries  of  row 
3  are  very  close  to  the  data  in  row  2.  This  is  to  be 
expected  because  with  a  Debye  parameter  as  large 
as  2  X  10^  the  Debye  potential  is  almost  identical 
to  the  Coulomb  potential.  The  calculations  were 
performed  with  56  basis  functions  for  the  P  states 
and  84  for  the  D  states.  The  quadrature  formula 


TABLE  I _ 

For  various  values  of  the  Debye  parameter  D,  the 
binding  energies  of  the  lowest  states  of  overall  P 
and  D  symmetry  of  the  helium  atom  in  a  Debye 
plasma  environment  are  calculated  (rows  3-5) 
and  compared  to  previously  available  data  for  the 
case  of  only  Coulomb  interactions  (row  2). 


D  in  au 

V 

3p 

00 

2.12384® 

2.13316® 

2.05579'= 

2.05580*= 

2  X  10® 

2.12353 

2.13308 

2.05557 

2.05560 

200 

2.10861 

2.11816 

2.04072 

2.04075 

20 

1.98121 

1.99020 

1.91864 

1.91875 

The  results  of  rows  3-5  were  obtained  from  the  nonrela- 
tivistic  Hamiltonian  in  which  all  Coulomb  interactions  were 
replaced  by  Debye  potentials  e~^/^/r. 

®  Nonrelativistic  calculations  by  Thakkar  and  Smith  [10]. 

^  Experimental  data  taken  from  [12]. 

for  the  spatial  integration  was  used  with  six  (P 
states)  or  seven  (D  states)  quadrature  points  and 
weights  for  each  of  the  three  radial  coordinates. 

It  should  be  emphasized  that  the  results  of 
Table  I  can,  in  principle,  all  be  calculated  from  the 
original  formula  by  Calais  and  L5wdin  [Eq.  (1)]. 
Numerical  quadrature  is  not  required  in  the  com¬ 
putation  of  D  states  of  two-electron  systems  even 
when  all  Coulomb  interactions  are  replaced  by 
Debye  potentials.  To  employ  the  quadrature  for¬ 
mula  in  cases  where  an  analytical  formula  is  not 
available,  the  quadrature  formula  has  been  used  to 
calculate  two-electron  matrix  elements  in  which 
the  term  has  been  replaced  by  a  parameterized 
polarization  potential  [13].  These  potentials  contain 
products  such  as 

1  —  exp(  —  ar^)  1  —  exp(— ar|) 


Analytic  evaluation  is  no  longer  possible.  Al¬ 
though  the  numerators  are  rather  steep  cutoff 
functions,  and,  hence,  difficult  to  integrate  numeri¬ 
cally,  the  Laguerre  quadrature  yielded  in  all  cases 
considered  an  accuracy  of  three  digits  with  20 
Laguerre  points  and  five  digits  or  better  with  52 
points.  Product  wave  functions  of  very  diffuse 
Slater-type  orbitals  were  included  in  the  studies, 
but  no  systematic  investigation  has  been  done  yet. 
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ABSTRACT:  We  study  the  interrelation  between  molecular-shape  changes  and 
configurational  transitions  in  isolated  grafted  chains  ("polymer  mushrooms")  under 
geometrical  confinement.  The  confinement  effects  are  due  to  a  finite-size  obstacle, 
representing  a  simple  model  for  the  tip  of  an  atomic-force  microscope.  Using  an  off-lattice 
polymer  model,  we  monitor  how  the  molecular  shape  of  a  short  chain  is  affected  by  the 
geometry  of  the  tip.  We  show  the  occurrence  of  shape  transitions  as  the  chain  "tries  to 
dodge"  the  approaching  obstacle.  These  transitions  can  be  correlated  with  changes  in  the 
nature  of  the  dominant  chain  conformations.  Using  a  descriptor  of  self-entanglements  in 
the  polymer  chain,  we  propose  a  "phase  diagram"  indicating  the  regions  characterized 
by  free  chains,  confined  chains,  chains  that  can  "escape"  the  tip,  and  chains  whose  escape 
is  "frustrated,"  The  results  help  in  better  understanding  polymer  behavior  under 
compression,  and  they  can  be  useful  for  designing  interphases  with  targeted  properties. 
©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  519-530,  1997 


Introduction 

The  study  of  macromolecules  at  solid-liquid 
interfaces  finds  applications  in  the  modeling 
of  important  phenomena  such  as  film  adhesion, 
lubrication,  and  oil  recovery  [1].  In  addition,  the 
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behavior  of  polymers  confined  within  interfaces  is 
central  for  understanding  the  dynamics  in  cell 
membranes  and  the  stability  of  the  vesicles  used  in 
drug  delivery  [2]. 

The  models  commonly  used  to  study  polymers 
at  interfaces  include  two  basic  features:  (a)  a  num¬ 
ber  of  chain  molecules  grafted  onto  a  flat  surface 
[3-9],  and  (b)  various  perturbations  acting  on  the 
chains,  e.g.,  compression  or  a  liquid  flow  [10-19]. 
In  the  limit  of  low  surface  coverage,  one  can  ne¬ 
glect  chain-chain  interactions  and  reduce  the 
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model  to  a  single  grafted  chain  under  various 
deformations,  i.e.,  a  '"polymer  mushroom"  [20].  In 
this  work,  we  deal  with  the  deformation  of  a 
permanently  grafted  polymer  mushroom  under 
confinement. 

The  configurational  properties  of  geometrically 
confined  polymer  mushrooms  has  received  a  good 
deal  of  attention  recently  [11,  20-24] ,  In  the  case  of 
confining  grafted  chains  to  an  infinite  slab,  we 
have  shown  that  pressure  affects  differently  the 
various  features  that  characterize  the  shape  of  a 
polymer  [24].  For  instance,  it  is  found  that  molecu¬ 
lar  size,  molecular  anisometry,  and  the  complexity 
of  the  chain's  self-entanglements  do  not  behave 
identically  during  configurational  changes  [24], 
(The  notion  of  "molecular  size"  is  reserved  in  this 
work  for  the  mean  three-dimensional  space  actu¬ 
ally  spanned  by  the  distribution  of  atoms  in  a 
polymer  configuration.  It  must  not  be  confused 
with  the  "contour  length"  of  the  polymer  chain  or 
the  number  of  monomers.) 

Geometrical  confinement  can  also  be  used  to 
study  other  phenomena.  For  instance,  a  polymer 
mushroom  compressed  by  a  finite-size  obstacle  can 
be  used  to  simulate  the  effect  on  macromolecules 
of  a  scanning- tunneling  microscope  or  atomic-force 
microscope  tip  (AFM-tip)  [20,  21].  Recent  studies 
predicted  the  occurrence  of  marked  changes  in 
molecular  size  as  the  chain  "tries  to  dodge"  an 
approaching  AFM-tip,  These  changes  can  take  the 
form  of  "escape  transitions"  or,  under  special  con¬ 
ditions,  hysteresis  [20,  21].  Previous  studies  in  the 
literature  have  analyzed  these  configurational 
transitions  by  looking  at  the  scaling  regimes  in  the 
molecular  size  [3,  20,  21].  These  analyses,  based  on 
mean-field  approximations,  use  the  notion  that  a 
chain  can  be  broken  up  into  a  number  of  "blobs" 
whose  effective  size  is  predicted  from  the  partial 
number  of  monomers  (i.e.,  using  scaling  argu¬ 
ments)  [3].  This  approach,  however,  breaks  down 
in  short  polymers  [3,  10]  or  in  arbitrarily  long 
polymers  under  strong  compression  [21].  In  addi¬ 
tion,  previous  work  has  neglected  other  aspects  of 
polymer  shape  that  do  not  necessarily  behave  in 
the  same  fashion  (cf.  [24]).  In  the  present  work,  we 
want  to  explore  these  molecular-shape  transitions 
in  more  detail  for  systems  where  scaling  argu¬ 
ments  [3]  are  not  applicable. 

In  practice,  we  analyze  the  deformation  of  a 
short  polymer  chain  by  an  approaching  AFM-tip. 
Within  this  context,  we  address  the  following 


questions: 

(a)  Are  there  recognizable  configurational  tran¬ 
sitions  when  the  effect  of  confinement  is 
purely  entropic  (i.e.,  when  the  AFM-tip 
changes  only  the  configurational  space  and 
not  the  energy  of  the  chains)?  This  situation 
corresponds  to  the  simplest  polymer  model, 
where  monomers  present  only  a  purely  re¬ 
pulsive  (excluded-volume)  interaction  and 
there  are  no  energy  barriers. 

(b)  Can  molecular  features,  other  than  molecu¬ 
lar  size,  detect  the  occurrence  of  configura¬ 
tional  transitions  during  compression? 

(c)  What  are  the  shape  features  characteristic  of 
the  dominant  chain  configurations  found 
under  all  possible  confinement  conditions? 
Is  there  a  finite-size  analog  to  a  "phase 
diagram,"  where  the  configurational 
"phases"  have  distinct  molecular  shape  fea¬ 
tures? 

We  address  these  issues  by  performing  com¬ 
puter  simulations  of  an  AFM-tip  interacting  repul¬ 
sively  with  the  polymer  mushroom.  The  AFM-tip 
consists  of  a  truncated  cone  with  a  flat  disklike 
face  exposed  to  the  grafting  surface.  The  polymer 
chain  is  modeled  as  an  off-lattice  continuum  with 
excluded-volume  interaction.  Within  the  specifica¬ 
tions  of  this  simple  model,  we  explore  the  com¬ 
pression  of  the  polymer  mushroom  under  AFM- 
tips  with  variable  disk  radii  and  distance  to  the 
grafting  surface. 

The  work  is  organized  as  follows:  The  next 
section  discusses  the  models  used  for  the  tip  and 
the  chain,  as  well  as  the  sampling  of  configura¬ 
tional  space.  A  following  section  discusses  the 
evaluation  of  the  mean  values  for  molecular-shape 
descriptors.  Another  section  presents  the  results 
for  shape  transitions  and  configurational  changes. 
Further  comments  and  conclusions  are  found  in 
the  closing  section. 


Configurations  of  a  Model  Polymer 
Mushroom  with  Confinement 

We  deal  with  a  single  polymer  chain,  perma¬ 
nently  grafted  by  a  terminal  point  to  a  flat,  impen¬ 
etrable  surface.  The  grafted  chain  has  a  constant 
number  of  monomers  n.  The  polymer  is  approxi- 
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mated  by  a  bead-and-string  "necklace"  model, 
consisting  of  n  monomers  ("beads")  and  a  con¬ 
stant  bond  length  I  between  consecutive  beads 
[25].  In  absence  of  confinement,  the  linear  chain  is 
allowed  to  adopt  all  possible  conformations  sub¬ 
ject  to  two  constraints:  (a)  the  chain  does  not 
penetrate  the  grafting  surface,  and  (b)  the 
monomers  satisfy  an  excluded  volume  condition. 
The  excluded  volume  is  a  repulsive  interaction 
between  monomers,  where  nonbonded  beads  are 
kept  at  distances  of  at  least  (the  radius  of 
excluded  volume).  Configurations  are  found  as 
long  as  <  21.  (In  the  limit  21,  the  chain 

becomes  a  rigid  rod.) 

In  this  work,  we  deal  only  with  the  effects  of 
confinement  on  one  type  of  polymer  chain.  There¬ 
fore,  r^^,l,  and  n  are  taken  as  constants.  To  pro¬ 
duce  chains  whose  dimensions  are  comparable  to 
grafted  oligopeptides  in  the  presence  of  a  "good 
solvent,"  we  have  chosen  I  =  3.8  A,  =  3.43  A, 
and  n  =  25.  The  value  is  taken  from  the  repul¬ 
sive  part  of  the  Sutherland  potential  of  the 
molecule  of  CO2  [26].  Its  value  is  representative  of 
the  effective  mean  size  of  an  amino  acid  residue  in 
a  polypeptide  chain  in  a  good  solvent. 

The  chain  is  terminally  grafted  to  the  plane  by 
the  first  bead,  which  is  taken  as  the  origin  for  the 
coordinate  framework.  The  first  bond  (between 
beans  "1"  and  "2")  is  chosen  perpendicular  to  the 
plane,  and  it  defines  the  "z"  axis.  (This  particular 
grafting  condition  is  only  introduced  for  simplify¬ 
ing  the  computation  of  self-entanglements  [24].  It 
has  little  influence  on  the  accessible  mean  molecu¬ 
lar  shapes.) 

The  25-bead  polymer  chains  are  modeled  by  an 
off-lattice  continuum.  The  accessible  molecular 
shapes  are  known  for  "free"  (unconfined)  linear 
chains  [25,  27]  and  chains  confined  to  a  slab  de¬ 
fined  by  two  parallel  planes  [24].  In  our  present 
case,  a  finite-size  obstacle  introduces  new  con¬ 
straints  on  the  accessible  configurations. 

The  confining  AFM-tip  is  modeled  as  a  large 
truncated  cone,  pointing  toward  the  grafting  sur¬ 
face.  The  base  is  a  chosen  as  circular  disk  parallel 
to  the  surface,  centered  over  the  grafted  terminal 
end  of  the  chain.  The  shape  of  the  tip  is  specified 
by  three  parameters,  as  illustrated  in  Figure  1.  The 
distance  between  the  base  of  the  tip  and  the  graft¬ 
ing  plane  is  indicated  by  L  and  the  radius  of  the 
base  is  p.  The  conical  shape  is  defined  by  the  angle 
OL.  In  this  work,  only  one  value  of  the  angle  was 
considered  (a  =  45°). 


The  parameters  L  and  p  control  the  confine¬ 
ment.  A  number  of  special  cases  are  noted: 

(i)  For  a  fixed  L  value  (L  >  /),  confinement 
within  a  slab  is  ensured  when  p  =  (n  —  2)Z, 
representing  the  maximum  distance  a 
grafted  chain  can  be  stretched  parallel  to 
the  surface.  Therefore,  strong  confinement 
corresponds  to  the  case  of  large  p  and 
small  L  values. 

(ii)  For  a  fixed  p  value  (  p  >  0),  the  effects  of 
confinement  will  disappear  if  L  >  (n  —  1)1. 
(In  practice,  they  disappear  at  smaller  L 
values,  vide  infra).  At  this  point,  we  find 
the  mean  shape  of  the  free  grafted  chain. 
We  refer  to  this  case  as  the  "free"  or  "un¬ 
confined"  chain  limit. 

(iii)  For  an  arbitrary  L  value,  the  unconfined 
chain  limit  is  also  reached  if  the  disk  is 
vanishingly  small  (  p  ^  0).  The  limits  (ii) 
and  (iii)  provide  a  test  for  the  accuracy  of 
the  simulations. 

The  AFM-tip  is  a  repulsive  barrier  to  the  poly¬ 
mer  mushroom.  Therefore,  "acceptable"  configu¬ 
rations  must  have  neither  bead  nor  bond  penetrating 
the  tip.  This  condition  is  checked  bead  by  bead  as 
the  chain  is  built.  We  proceed  as  follows:  From  the 
second  bead,  we  start  a  random  walk,  with  con¬ 
stant  step-length  1.  Let  =  (x^,  1/,,  2,)  be  the  posi¬ 
tion  vector  of  a  bead  that  meets  the  excluded 
volume  criteria.  This  tentatively  accepted  bead  lies 
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FIGURE  1.  Geometrical  parameters  defining  the  shape 
of  the  AFM-tip  model. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


521 


ARTECA 


outside  the  tip  if 

x]  +  yf  >  [  p  -f  (z^-  -  Dtan  a]^,  for  >  L. 

(1) 

Whenever  Eq.  (1)  is  not  satisfied,  the  partial  chain 
conformation  is  rejected  and  the  search  restarts. 
However,  two  bonded  beads  could  still  lie  outside 
the  tip,  yet  their  bond  intersects  it.  Our  model 
excludes  also  this  possibility.  Whenever  two  con¬ 
nected  beads  with  positions  and  rj„i  satisfy  Eq. 
(1),  one  must  ensure  that  all  points  r  along  their 
bond  lie  outside  the  three-dimensional  space  occu- 
pied  by  the  tip,  i.e., 

r  =  r,_i  +  f(r,  -  r;_i)  ^  for  all  t  e  [0,1]. 

(2) 

This  condition  is  satisfied  if  the  following  second- 
degree  equation  for  t: 

x,.(f)"  +  Yiitf  =  [p  +  (Z;(f)  -  Dtan  af,  (3) 

has  no  real  solutions  within  t  e  [0,1],  where  Qj(0 
=  cji_i  +  tiq^  —  qi-i),  with  Q  =  X,  Y,  Z  used 
when  q  =  X,  y,  z,  respectively.  Whenever  a  par¬ 
tially  built  chain  satisfies  Eq.  (3)  with  t  e  [0, 1],  it 
is  rejected  and  the  search  restarts  with  a  chain 
grown  anew  from  the  grafted  terminal. 

Our  procedure  samples  the  configurational 
space  of  the  confined  grafted  chains  by  generating 
uncorrelated  conformers,  (Our  approach  is  essentially 
the  simplest  Monte  Carlo  search  [28],  where  zero- 
energy  conformers  are  accepted  and  infinite- 
energy  conformers  are  rejected.)  When  the  grafted 
polymers  are  "grown"  in  the  presence  of  the  AFM- 
tip,  all  chains  that  fail  the  criterion  for  excluded 
volume  or  tip  repulsion  are  abandoned  and  not 
continued.  We  retain  only  those  chains  that  are 
built  by  adding  all  beads  sequentially  without  a 
single  rejection.  These  off-lattice  simulations  are 
computationally  demanding,  but  they  produce  a 
convenient  set  of  independent  conformers.  Simple 
statistics  can  then  be  used  to  compute  mean 
molecular-shape  properties  and  study  the  effect  of 
confinement  on  them.  A  few  thousand  "accepted" 
conformers  are  enough  to  reach  accurate  estimates. 

Our  approach  poses  a  heavy  demand  on  the 
random  number  generators  used  for  sampling  con¬ 
formations.  One  must  ensure  that  the  generator  is 
not  exhausted  when  the  configurational  space  be¬ 
come  very  small,  e.g.,  for  small  L  and  large  p 
values.  The  performance  of  different  generators 


was  tested  and  compared.  Overall,  the  most  reli¬ 
able  generator  is  the  one  by  LTcuyer  [29,  30], 
which  combines  two  random  sequences,  with  ad¬ 
ditional  shuffling  to  produce  a  period  of  length  ca. 
2  X  10^®.  This  is  the  only  generator  that  gives  ac¬ 
ceptable  results  in  the  limit  of  strong  confinement. 
At  lower  confinement,  an  optimized  "shuffled" 
version  [29]  of  the  built-in  generator  in  DEC- 
Fortran  (MTH$Random,  with  a  period  of  ca.  10^^) 
produces  statistically  identical  results,  with  a 
slightly  faster  performance.  (All  computations  were 
carried  out  on  a  DEC  255/233  AXP  workstation.) 

As  an  illustrative  example  of  representative 
polymer  configurations.  Figure  2  shows  a  series  of 
AFM-tips  with  the  same  radius  ( p  =  20  A),  ap¬ 
proaching  the  grafted  polymer.  The  "accepted" 
configurations  satisfy  the  criteria  above  with  r^^  = 
3.43  A  and  n  =  25. 

Figure  2  illustrates  a  number  of  features  ob¬ 
served  in  our  simulations:  First,  as  the  tip  ap- 


L=20A 


L=15A 


L=  lOA 


L=lA 


FIGURE  2.  Typical  deformations  of  grafted  chains 
caused  by  an  approaching  AFM-tip  with  constant  radius, 
p  =  20  A. 
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proaches,  deformed  chains  fold  underneath  it  (i.e., 
they  become  effectively  "confined").  However,  if 
compression  is  increased  (and  the  shape  of  the  tip 
allows  it),  the  polymer  chain  can  "escape"  the 
approaching  tip  by  stretching  to  the  border  of  the 
disk  and  folding  outside  the  confining  space.  (Ac¬ 
tual  physical  escape,  of  course,  is  not  possible 
because  the  chain  is  permanently  grafted.)  In  the 
next  sections,  we  discuss  in  detail  these  transfor¬ 
mations  by  calculating  the  mean  molecular-shape 
features  of  sampled  chains. 

Note  that  we  do  not  analyze  the  dynamics  of 
deformation  and  "escape"  from  the  tip.  Rather,  we 
extract  static  information  by  computing  a  large 
number  of  polymer  configurations,  compatible 
with  the  imposed  geometrical  constraints.  More¬ 
over,  all  phenomena  studied  in  this  work  are 
strictly  due  to  the  interplay  between  the  spatial 
configurations  and  their  associated  shape.  There  is 
no  energy  cost  to  the  chain  deformations,  since  the 
polymer  has  only  a  hard-sphere  potential.  All  ef¬ 
fects  on  molecular  shape  are  exclusively  entropic 
and  due  to  a  change  in  configurational  space  vol¬ 
ume  during  compression.  In  this  sense,  the  present 
approach  complements  other  models  used  in  the 
literature  to  study  confinement  on  polymer  mush¬ 
rooms  [20,  21]. 


Mean  Molecular-Shape  Features  for 
Grafted  Polymer  Configurations 

An  "accepted"  chain  configuration,  denoted  by 
Ky,  can  be  characterized  by  a  number  of  molecu¬ 
lar-shape  features.  These  properties  include  some 
that  depend  exclusively  on  nuclear  (or  "bead") 
coordinates,  such  as  molecular  size  and  anisome- 
try.  Other  important  features  can  be  conveyed  by 
characterizing  the  chain's  self-entanglements, 
which  depend  both  on  nuclear  coordinates  and 
chain  connectivity  [31].  (Self-entanglements  refer 
to  the  "twists  and  turns"  in  a  given  linear  chain 
configuration.  In  principle,  entanglements  do  not 
depend  explicitly  on  the  molecular  size.)  Recently, 
we  showed  that  the  simultaneous  use  of  molecular 
size  and  self-entanglement  can  provide  a  thorough 
characterization  of  confined  polymer  conforma¬ 
tions  [24],  as  well  as  configurational  transitions 
during  molecular  dynamics  [32].  These  shape  fea¬ 
tures  do  not  necessarily  behave  the  same  way 
during  compression  [24].  In  this  work,  we  follow 
both  molecular  size  and  self-entanglements  when 


the  AFM-tip  approaches  the  grafted  chains.  Our 
goal  is  to  test  whether  chain  self-entanglements 
can  be  used  to  monitor  the  occurrence  of  configu¬ 
rational  "escape"  transitions. 

Molecular  size  is  measured  by  the  radius  of 
gyration.  Let  R^vi  be  the  position  of  the  center  of 
mass  and  {r^,  /  =  1, 2, . . . ,  n}  the  coordinates  for  the 
n  beads,  both  measured  with  their  origin  at  the 
grafted  terminal.  The  "instantaneous"  square  ra¬ 
dius  of  gyration  for  a  given  conformer  is 

=  (4) 

We  also  follow  the  mean  height  of  the  polymer  (h), 
measured  from  the  grafting  plane.  Using  the  coor¬ 
dinates  in  Eq.  (1),  we  have 

1  ^ 

h  ^  _  y  2  where  z,  >  0,  Vi.  (5) 

^i=i 

Chain  self-entanglements  take  into  account  the 
"trace"  of  polymer  chain,  in  addition  to  the 
monomer  positions.  An  effective  measure  of  entan¬ 
glement  complexity  can  be  given  in  terms  of  the 
probability  distribution  of  overcrossings,  denoted  by 
{ Ajy(n)}  [33],  This  distribution  measures  the  prob¬ 
ability  of  producing  a  two-dimensional  projection  of 
the  n-bead  polymer  with  N  "double  points"  (i.e., 
bond-bond  "crossings"),  averaged  over  all  possi¬ 
ble  projections.  The  more  "entangled"  the  chain, 
the  higher  is  the  probability  that  large  overcross¬ 
ing  numbers  dominate  the  projections.  In  contrast, 
a  rodlike  chain  is  unentangled  [i.e.,  AqM  is  the 
largest  contribution  to  the  probability]. 

A  single  descriptor  for  the  atove  distribution  is 
the  mean  number  of  overcrossings  N.  In  simple  words, 
N  is  the  "effective"  number  of  bond-bond  cross¬ 
ings  in  an  arbitrary  projection  of  the  grafted  chain, 
averaged  over  all  directions  in  space.  For  any  arbitrary 
projection  of  a  linear  polymer  configuration,  the 
maximum  possible  number  of  overcrossings  is 
maxN  =  (n  -  3)(n  -  2)/2  [33].  Depending  on  the 
shape  of  the  conformation  considered,  most  of  the 
projections  may  yield  low  N  values  (if  the  poly¬ 
mer  is  swollen  or  unentangled)  or  high  N  values 
(if  the  polymer  js  entangled).  The  mean  number  of 
overcrossing,  N,  will  usually  be  much  smaller 
than  maxN  (cf.  discussion  in  [27]  and  [33]).  In 
practice,  N  is  evaluated  as  an  average  over  many 
randomized  projections  (e.g.,  5  X  10^)  [24,  33]. 

If  the  ensemble  of  "accepted"  chain  configura¬ 
tions  is  denoted  by  {K^},  then  {DfK^)}  is  the  set  of 
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values  accessible  to  a  given  shape  descriptor.  Since 
the  sampled  configurations  are  statistically  inde¬ 
pendent,  the  configurationally  averaged  shape  de¬ 
scriptors  are  computed  as  the  simple  mean  within 
the  {Kj}  ensemble.  The  configurational  averages 
are  denoted  by  <  D). 

For  the  objectives  of  this  work,  we  focus  mostly 
on  the  changes  in  entanglements  during  confine¬ 
ment,  as  characterized  by  <N>.  Some  exact  and 
numerical  properties  are  known  for  this  descriptor 
in  the  case  of  long  random  walks  [27,  34-36]. 
Below,  we  discuss  the  behavior  of  (  N )  (and  con¬ 
trast  it  with  that  of  and  (h))  as  the  chains 

are  deformed  by  the  approaching  AFM-tip. 


Dependence  of  Configurationally 
Averaged  Shape  Descriptors  on 
Geometrical  Confinement 

Averaged  shape-descriptor  values  were  com¬ 
puted  for  various  tip  geometries,  as  specified  by 
the  disk  radius  p  and  the  tip-surface  separation  L. 
For  a  descriptor  D,  the  average  <  D(  p,  L))  is  eval¬ 
uated  over  chains  with  n  and  fixed.  Its  value 
for  'Tree"  grafted  chains  (i.e.,  with  no  confinement 
effects)  is  (D}free-  As  mentioned  above,  the  con¬ 
fined  chains  reach  (D}free  in  two  limit  conditions, 
when  the  top  is  far  away  from  the  surface  and 
when  the  tip  is  vanishingly  thin,  i.e., 

lim  <D(p,L)>=  lim  <D(p,L)>  =  (6) 

p->0  L^oo 

Our  simulations  verify  these  limits  within  the  ac¬ 
curacy  allowed  by  the  algorithm. 

To  understand  the  overall  changes  in  molecular 
shape  due  to  the  presence  of  the  tip,  we  discuss 
two  cases  separately:  (A)  the  compression  of 
grafted  chains  by  tips  with  constant  radius;  and 
(B)  confinement  due  to  tips  at  constant  distance  to 
the  surface,  but  variable  radius. 

COMPRESSION  EFFECTS  V^ITH  VARIABLE  L, 
AT  CONSTANT  p  VALUES 

The  basic  phenomenon  discussed  here  is  illus¬ 
trated  in  Figure  2.  In  agreement  with  results  in  the 
literature  for  long  chains  with  a  stretching  poten¬ 
tial  [20,  21],  we  find  conformations  where  the 
chains  escape  from  under  the  tip.  As  we  show 
below,  these  "transitions"  can  be  accompanied  by 
distinctive  changes  in  molecular  shape. 


First,  it  must  be  noted  that  only  some  molecu¬ 
lar-shape  descriptors  are  useful  for  following  shape 
transitions.  If  one  analyzes  the  mean  heights  of  the 
chains,  {h),  only  a  continuous  behavior  is  ob¬ 
served  during  compression.  Figure  3  shows  the 
results  for  the  variation  of  (ft)  as  a  function  of  the 
tip -surface  separation  L,  for  various  values  of 
disk  rad^ius  p.  The  tip  with  the  smallest  radius 
( p  =  3  A,  curve  a)  causes  small  confinement  ef¬ 
fects.  In  this  case,  the  mean  height  shows  the 
smallest  variation  in  terms  of  L.  In  contrast,  the 
confinement  effects  are  magnified  for  radii  p  >  40 
A  (curves  d  and  e).  (In  fact,  the  polymer  mush¬ 
room  is  effectively  confined  to  a  slab  for  all  L  and 
p  >  40  A.)  However,  as  the  radius  changes,  (h) 
shows  no  distinctive  features  that  can  be  associ¬ 
ated  with  shape  transitions. 

Figure  4  shows  a  more  interesting  behavior 
when  we  monitor  chain  self-entanglements  in 
terms  of  (N).  An  analysis  of  Figure  4,  reveals  the 
following  features: 

(a)  Confinement  effects  on  entanglements  dis¬ 
appear  for  L  >  40  A  (the  free  chain  limit). 

(b)  For  all  disk  radii,  (N)  shows  a  maximum  at 
a  particular  tip-surface  separation.  The 
maximum  is  the  largest  when  the  confine¬ 
ment  effects  are  maximized  (large  p)  and 
becomes  vanishingly  small  as  confinement 
effects  disappear  (small  p). 


Tip-surface  separation  L  (A) 


FIGURE  3.  Variation  in  the  configurationally  averaged 
height  of  the  grafted  chains  <h>,  as  a  function  of  tlp- 
surface  separation  L,  for  various  tip  radii  p.  The  labels 
stand  for  the  following  values  of  radius:  (a)  p  =  3  A;  (b) 
p  =  11  A;  (c)  p  =  19  A;  (d)  p  =  40  A;  (e)  p  =  70  A.  The 
last  two  curves  are  indistinguishable  at  the  scale  used. 
Error  bars  are  omitted  for  clarity. 
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Tip” surface  separation  L  (A) 

FIGURE  4.  Variation  in  configurationally  averaged 
mean  overcrossing  numbers  for  grafted  chains,  <A/>,  as 
a  function  of  tip -surface  separation  L,  for  various  tip 
radii  p.  The  curve  labels  are  the  same  as  in  Figure  3. 
Error  bars  are  omitted  for  clarity. 


(c)  The  tip-surface_separation  L  leading  to  the 
maximum  in  <  N  )  changes  with  the  radii  p. 

(d)  When  the  compression  effects  are  very 
strong  (i.e.,  at  very  small  L  values  for  a 
fixed  p),  the  mean  overcrossing  number  de¬ 
creases  fast. 

Recently,  we  discussed  the  occurrence  of  a  max¬ 
imum  in  self-entanglements  for  chains  confined 
within  a  slab  ( p  oo)  [24].  Using  similar  argu¬ 
ments,  we  can  interpret  some  of  the  results  above 
as  follows: 

(i)  Within  a  slab  ( p  oo)^  compression  can 
produce  compact  configurations.  As  L  de¬ 
creases,  chains  are  "forced"  to  fold  over 
themselves,  thus  increasing  the  complexity 
of  entanglements. 

(ii)  As  L  decreases  for  large  p,  the  configura¬ 
tional  space  accessible  to  the  chains  be¬ 
comes  smaller.  Eventually,  further  com¬ 
pression  squeezes  the  chains  closer  to  the 
surface,  thus  preventing  self-entangle¬ 
ments.  As  a  result,  ( N  >  diminishes  un^r 
strong  compression.  In  other  words,  (N) 
must  exhibit  a  maximum  as  function  of  L. 

o 

(iii)  For  smaller  tip  radii,  e.g.,  p  <  40  A,  the 
chains  can  access  to  a  larger  configura¬ 
tional  space  by  "stretching"  to  the  border 
of  the  tip  and  folding  away  from  the  disk 


(cf.  Fig.  2).  These  "partly  escaped"  chains 
avoid  compactification  by  the  tip,  thereby 
reducing  the  complexi^  of  their  entangle¬ 
ments.  As  a  result,  (N)  shows  a  smaller 
maximum  than  the  one  for  large  p. 

(iv)  For  very  small  disk  radii,  the  chains  can 
"dodge"  the  tip  for  all  L  values,  except 
when  compression  is  extreme  (L  values 
close  to  /).  These  "fully  escaped"  chains 
effectively  resemble  the  free  grafted  chains 
over  a  large  range  of  L  values. 

The  averaged  molecular  size,  given  in  terms  of 
the  mean  radius  of  gyration,  parallels 

qualitatively  the  results  of  the  mean  number  of 
overcrossings.  Figure  5  shows  the  variation  of 
as  a  function  of  L.  Whereas  (N)  exhibits 
a  maximum,  the  radius  of  gyration  exhibits  a  mini¬ 
mum  during  compression,  confirming  that  chains 
become  more  compact.  Note,  however,  that  the 
position  of  the  critical  points  are  not  coincident,  i.e., 
maximum  compactess  and  maximum  entangle¬ 
ment  complexity  are  not  found  simultaneously  for 
the  same  tip  geometry.  (This  agrees  with  the  re¬ 
sults  for  compression  within  slabs  [24]).  Therefore, 
the  behavior  of  (N)  can  provide  new  insights  on 
shape  transitions. 

_Summarizing,  the  presence  of  a  maximum  in 
<  N  >  for  large  p  values  is  a  indication  of  a  "shape 
transition"  between  compact  entangled  chains,  on 
the  one  hand,  and  squeezed  (minimally  entangled) 
chains,  on  the  other.  Flowever,  the  shift  of  such  a 


Tip-surface  separation  L  (A) 


FIGURE  5.  Variation  in  configurationally  averaged 
radius  of  gyration  for  grafted  chains,  as  a 

function  of  tip -surface  separation  L,  for  various  tip  radii 
p.  The  curve  labels  are  the  same  as  in  Figure  3.  Error 
bars  are  omitted  for  clarity. 
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maximum  with  p  suggests  a  new  regime,  where 
chains  reach  intermediate  compactness  and  entan¬ 
glements  through  '"escape  transitions,"  To  fully 
characterize  this  regime,  we  discuss  below  the 
behavior  of  <  N  >  as  a  function  of  the  disk  radius, 
at  constant  values  of  the  tip-surface  separation. 

COMPRESSION  EFFECTS  WITH  VARIABLE  p, 

AT  CONSTANT  L  VALLES 

We  consider  now  a  "virtual"  deformation, 
whereby  the  tip  remains  at  a  constant  distance  to 
the  grafting  surface  while  decreasing  its  radius  at 
the  base.  The  typical  shape  changes  accompanying 
this  "transformation"  are  displayed  in  Figure  6, 
for  variable  p  and  a  small  tip-surface  separation 
(L  =  7A). 

Figure  6  illustrates  how  chains  adopt  flattened 
conformations  at  strong  confinement  (e.g.,  p  =  40 
A  and  p  =  30  A).  In  this  case,  since  L  is  very 
small,  the  chain  cannot  fold  over  itself  and  it  is 
forced  to  stretch.  However,  since  the  disk  is  too 
wide  (or,  conversely,  the  chain  is  too  short),  only 
few  beads  can  marginally  "escape"  the  tip.  We  use 
the  term  "frustrated  chains"  to  denote  these  con¬ 
figurations,  where  the  polymer  "fails"  to  escape 
the  tip  by  any  significant  amount,  even  though  the 


P=40A 


Nl^ 

p  =  30A 


P=20A 


p  =  10A 


FIGURE  6.  Typical  deformations  of  grafted  chains 
caused  by  a  series  of  AFM-tips  with  variable  radius  p,  at 
a  constant  tip -surface  separation  (L  =  7  A). 


chains  are  stretched.  (The  term  is  used  in  only 
qualitative  analogy  to  the  concept  of  "frustration" 
in  rugged  potential  energy  surfaces  [37],  where 
conformations  trapped  in  local  minima  are  unable 
to  escape  to  lower  minima.  In  our  case,  "frustrated 
chains"  cannot  reach  the  entangled  conformations 
that  are  characteristic  of  free  grafted  chains.) 

The  occurrence  of  "frustrated  chain"  configura¬ 
tions  depends  strongly  on  the  values  of  L  and  p. 
For  small  disk  radii  p,  most  of  the  chain  can 
effectively  avoid  the  tip  by  adopting  "escaped 
chain"  configurations,  in  contrast  to  "frustrated 
chains"  (cf.  bottom  of  Fig.  6).  On  the  other  hand,  if 
confinement  is  reduced  by  increasing  the 
tip-surface  separation,  then  chains  will  be  able  to 
fold  over  themselves.  As  a  result,  their  mean  con¬ 
figurations  will  resemble  "confined  chains"  with 
high  entanglement,  rather  than  "escaped"  or 
"frustrated"  chains.  In  summary,  we  will  expect 
the  regime  of  "frustrated  chains"  to  disappear, 
regardless  of  the  tip  radius,  whenever  the 
tip-surface  separation  is  larger  than  some  critical 
value  Lq, 

Our  computer  simulations  convey  these  notions 
quantitatively.  Figure  7  shows  the  configuration- 
ally  averaged  mean  overcrossing  numbers  for  the 
"transformation"  in  Figure  6.  (The  case  L  =  7  A  is 
the  tightest  confinement  that  we  have  been  able  to 
handle.)  Despite  the  large  error  bars  associated 
with_configurational  fluctuations,  a  rich  behavior 
in  <N)  is  clear.  The  results  in  Figure  7  can  be 


FIGURE?.  Variation  in  configurationally  averaged 
mean  overcrossing  numbers  for  grafted  chains,  <A/>,  as 
a  function  of  tip  radius  p,  at  a  constant  tip -surface 
separation  (L  =  7  A).  Error  bars  represent  95% 
confidence  intervals. 
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summarized  as  follows: 

(a)  Chain  entanglements  are  constant  for  p  >  50 
A,  where  the  chain  is  effectively  confined 
within  a  slab.  We  find  here  "confined"'  con¬ 
formations. 

(b)  Chain  entanglements  diminish  when  the 
disk  radius  decreases  from  50  to  ca.  30  A, 
where  they  reach  a  minimum.  In  this  case, 
the  dominant  chain  conformations  become 
minimally  entangled  by  stretching  to  the  bor¬ 
der  of  the  disk,  where  only  some  terminal 
beads  can  escape.  We  find  here  "frustrated 
chain"  conformations. 

(c)  Chain  entanglements  increase  again  when 
the  disk  radius  is  reduced  below  ca.  30  A.  In 
this  case,  an  increasingly  larger  section  of 
the  chain  can  escape  the  tip  and  "fold"  on 
the  outside.  We  find  here  "escaped  chain" 
conformations. 

Figure  8  shows  the  corresponding  results  for  the 
variation  in  the  chain  mean  radius  of  gyration 
under  the  same  conditions.  In  this  particular  case, 
th£re  is  a  perfect  match  between  the  behavior  of 
(N)  and  As  the  radius  of  the  tip  is 

reduced,  the  maximum  in  marks  the  tran¬ 

sition  from  the  stretched  ("frustrated")  configura¬ 
tions  with  large  molecular  size  to  the  folded 
("escaped")  configurations  that  are  nwre  compact. 

The  presence  of  a  minimum  in  <  N )  vs.  p  is  a 
characteristic  of  small  L  distances.  Figure  7  can  be 


15 


12 


11.5  -1 - 1 - 1 - 1 - 1 - 1 

0  20  40  60  80  100 
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FIGURE  8.  Variation  in  the  configurationally  averaged 
radius  of  gyration  for  grafted  chains,  as  a 

function  of  tip  radius  p,  at  a  constant  tip  -surface 
separation  (L  =  7  A).  Error  bars  represent  95% 
confidence  intervals. 


contrasted  with  the  behavior  of  chains  at  larger 
tip-surface_distances.  Figure  9  shows  the  depen¬ 
dence  of  <N>  vs.  p  over  a  range  of  L  distances. 
(Error  bars  are  omitted  for  clarity.  The  smoothed 
curves  in  Fig.  9  are  pol3momial  fittings  to  the 
computed  mean  values  in  shape  descriptors.)  As  L 
increases,  the  curves  follow  a  qualitatively_similar 
pattern  until  L  ~  14  A.  For  L  >  14  A,  (N)  does 
not  have  a  minimum  for  the  nonzero  tip  radius.  In 
other  words,  the  dominant  configurations  for  L  > 
14  A  pass  directly  from  "confined  chains"  to 
"escaped  chains."  The  critical  tip-surface  separa¬ 
tion  for  this  transition  can  thus  be  estimated  at 
Lq  ~  3.71,  for  the  present  chains  where  /  —  1.11 
(The  value  of  will  depend  strongly  on  the 
excluded  volume  interaction.)  Concluding,  the 
slopes  features  represented  by  the  minima  in 
(N(p,  D)  vs.  p  appear  to  be  specifically  associ¬ 
ated  with  the  regime  of  "frustrated  chain"  config¬ 
urations. 

“PHASE  DIAGRAM’’  FOR  THE  REGIMES  OF 

ACCESSIBLE  MOLECULAR  SHAPES 

The  results  above  can  be  summarized  by  repre¬ 
senting  the  molecular-shape  features  accessible  to 
polymer  mushrooms  in  a  two-dimensional  ( p,  L)- 
plot.  We  can  regard  this  plot  as  the  analog  of  a 
"phase  diagram"  for  the  configurations  of  a 


FIGURE  9.  Variation  In  the  configurationally  averaged 
overcrossing  numbers  for  grafted  chains  (N),  as  a 
function  of  tip  radius  p,  at  various  tip -surface 
separations.  The  labels  stand  for  the  following  values  of 
tip -surface  separation:  (a)  /_  =  8  A;  (b)  /_  =  10  A;  (c) 

L  =  12  A;  (d)  /.  =  14  A;  (e)  L  =  17.5  A;  (f)  L  =  22.5  A.  For 
clarity,  the  last  two  curves  (e,  f)  are  indicated  by  thin 
lines  and  error  bars  have  been  removed. 
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finite-size  system.  In  this  context,  a  descriptor  of 
chain  entanglements  plays  the  role  of  an  order 
parameter  for  labeling  the  '"phases."'  [The  analogy 
is  only  qualitative  since  the  shape  properties  vary 
smoothly.  The  separation  into  sharp  regions  with 
distinct  dominant  conformations  is  somewhat  arbi¬ 
trary.  In  our  convention,  the  boundar^s  are  de¬ 
fined  by  the  ( p,  L)  values  satisfying  (  N(  p,  L)>  = 

(N)fre.] 

Figure  10  shows  all  the  results  for  (Nip,  L)>. 
The  global  minimum  is  found  at  small  L  and 
intermediate  p  values,  whereas  the  global  maxi¬ 
mum  appears  at  large  p  and  intermediate  L  val¬ 
ues,  The  flat  regions  on  this  surface  represent 
shapes  similar  to  those  of  free  grafted  chains. 

Using  Figure  10,  we  can  classify  the  configura¬ 
tions  according  to  distinctive  molecular-shape 
properties.  The  resulting  "phase  diagram"  based 
on  chain  entanglements  is  shown  in  Figure  11.  (A 
comparable,  yet  not  identical,  diagram  can  be  given 
in  terms  of  the  radius  of  gyration.)  Frustrated 
chains  (with  minimal  entanglement)  appear  at 
L  <  Lq,  whereas  fully  escaped  chains  (with  low 
entanglement)  can  only  be  found  at  small  p  val¬ 
ues.  The  separation  between  the  escaped  chains 
and  the  free  chains  (indicated  by  a  dotted  line)  is 
artificial,  since  a  faraway  AFM-tip  with  small  disk 
radius  will  not  affect  the  polymer  mushroom.  The 
remaining  boundary  curves  correspond  to  the  tip 
geometries  for  which  the  mean  molecular  shape  of 
the  chains  coinci(tes  with  that_of  a  free  grafted 
mushroom  [i.e,,  (Ni  p,  L))  =  (N)free]- 


Figure  11  summarizes  the  results  of  this  work. 
The  "phase  diagram"  conveys  the  interplay  be¬ 
tween  configurational  and  shape  transitions  due  to 
variable  confinement.  The  main  features  of  this 
diagram  should  remain  unaltered  for  longer  chains 
(whenever  the  same  repulsive  potential  is  used)  or 
for  chains  that  are  not  perpendicularly  grafted  to 
the  surface. 


Further  Comments  and  Conclusions 

In  this  work,  we  have  shown  that  chain  self¬ 
entanglements  can  be  used  for  monitoring  the  oc¬ 
currence  of  configurational  transitions  (i.e.,  a 
change  in  the  nature  of  the  dominant  configura¬ 
tions  due  to  an  external  perturbation).  Conceptu¬ 
ally,  the  main  result  is  the  fact  that  the  dominant 
configurations  can  indeed  be  classified  into  regimes 
(or  "phases")  by  using  the  behavior  of  molecular- 
shape  descriptors.  The  results  in  Figure  11  use 
mean  overcrossing  numbers  as  a  descriptor.  A 
similar  analysis  can  be  done  by  using  other  shape 
features,  although  "phase"  boundaries  will  shift 
since  distinct  descriptors  are  generally  uncorre¬ 
lated. 

The  diagram  in  Figure  11  should  be  a  valid 
qualitative  guide  to  predict  the  behavior  of  chains 
interacting  with  obstacles.  Note,  however,  that  the 
nature  and  number  of  the  "shape  regimes"  will 
change  with  the  model  chosen  for  the  AFM-tip. 
For  curved-surface  tips,  or  tips  with  indentations. 


FIGURE  10.  Dependence  of  the  mean  number  of  overcrossings  on  geometrical  confinement  over  a  wide  range  of 
values  of  tip  radius  p  and  tip -surface  separation  L. 
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0  50 

p(A) 

FIGURE  1 1 .  Qualitative  configurational  “phase” 
diagram  indicating  the  molecular-shape  regimes 
encountered  during  compression  by  a  flat  AFM-tip.  The 
boundaries  are  estimated  from  the  r^ults  in  Figure  10, 
using  the  convention  (N(p,L)}  =  {N)uee-  The  dotted 
line  indicates  that  the  distinction  between  free  chains 
and  “escaped”  chains  is  blurred  for  large  L  and  small  p 
values. 


a  more  complex  pattern  of  shape  transitions  can  be 
expected  [21]. 

The  present  results  from  chain  simulations  com¬ 
plement  previous  work  in  the  literature  that  is 
only  valid  for  very  long  chains  [20,  21].  Our  classi¬ 
fication  of  short-chain  conformers  according  to 
dominant  shape  features  could  not  have  been  done 
by  using  scaling  arguments. 

Our  results  contribute  to  a  better  understanding 
of  ''escape  transitions"  [20],  since  we  have  now 
shown  that  configurational  changes  analog  to 
"escape  transitions"  can  also  be  found  in  polymers 
with  purely  repulsive  potentials.  As  discussed 
above,  our  configurational  and  shape  transitions 
are  strictly  entropic  phenomena.  All  "accepted" 
conformations  have  equal  (zero)  energy,  as  there  is 
neither  attraction  between  monomers  or  nor  an 
energetic  cost  for  stretching  a  chain.  Nevertheless, 
the  results  show  that  "escape  transitions"  can  still 
be  recognized  by  the  behavior  of  molecular-shape 
descriptors.  Until  now,  these  interesting  phenom¬ 
ena  had  only  been  found  before  in  more  compli¬ 
cated  polymer  potentials  [20,  21].  Note,  however, 
that  our  model  has  no  energy  barriers  and  thus 
shape  transitions  will  not  exhibit  hysteresis  [21]. 
The  behavior  of  chain  self-entanglements  under 
compression  for  realistic  monomer-monomer  in¬ 


teractions  remains  an  interesting  question  open  to 
study. 
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ABSTRACT:  We  describe  the  development  and  applications  of  a  new  electronic 
structure  method  that  uses  a  real-space  grid  as  a  basis.  Multigrid  techniques  provide 
preconditioning  and  convergence  acceleration  at  all  length  scales  and  therefore  lead  to 
particularly  efficient  algorithms.  The  salient  points  of  our  implementation  include:  (i) 
new  compact  discretization  schemes  in  real  space  for  systems  with  cubic,  orthorhombic, 
and  hexagonal  symmetry  and  (ii)  new  multilevel  algorithms  for  the  iterative  solution  of 
Kohn-Sham  and  Poisson  equations.  The  accuracy  of  the  discretizations  was  tested  by 
direct  comparison  with  plane-wave  calculations,  when  possible,  and  the  results  were  in 
excellent  agreement  in  all  cases.  These  techniques  are  very  suitable  for  use  on  massively 
parallel  computers  and  in  0(N)  methods.  Tests  on  the  Cray-T3D  have  shown  nearly 
linear  scaling  of  the  execution  time  up  to  the  maximum  number  of  processors  (512).  The 
above  methodology  was  tested  on  a  large  number  of  systems,  such  as  the  molecule, 
diamond.  Si  and  GaN  supercells,  and  quantum  molecular  dynamics  simulations  for  Si. 
Large-scale  applications  include  a  simulation  of  surface  melting  of  Si  and  investigations 
of  electronic  and  structural  properties  of  surfaces,  interfaces,  and  biomolecules.  ©  1997 
John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  531-543,  1997 


Introduction 

Over  the  course  of  the  last  several  decades, 
plane-wave-based  methods  have  been  used 
to  perform  electronic  structure  calculations  on  a 
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wide  range  of  physical  systems.  The  Car- 
Parrinello  (CP)  and  other  iterative  methods  [1,  2] 
have  made  such  calculations  possible  for  systems 
containing  several  hundred  atoms  [3].  While  these 
methods  have  been  very  successful,  several  diffi¬ 
culties  arise  when  they  are  extended  to  physical 
systems  with  large  length  scales  or  containing 
first-row  or  transition-metal  atoms.  Special  tech- 
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niques  have  been  developed  to  handle  some  of 
these  problems.  Optimized  pseudopotentials  [4,  5], 
the  augmented-wave  method  [6],  plane  waves  in 
adaptive-coordinates  [7-9],  and  preconditioning 
combined  with  conjugate-gradient  techniques 
[10,11]  have  had  considerable  success;  however, 
these  techniques  are  still  constrained  by  the 
plane-wave  basis,  which  requires  periodic  bound¬ 
ary  conditions  for  every  system  and  fast  Fourier 
transforms  (FFTs)  to  efficiently  transform  between 
real  and  reciprocal  space.  In  particular,  FFTs  in¬ 
volve  nonlocal  operations  that  impose  constraints 
on  the  adaptability  of  these  algorithms  to  mas¬ 
sively  parallel  computer  architectures  because  they 
perform  best  on  problems  that  can  be  divided  into 
localized  domains. 

It  has  been  appreciated  for  some  time  that  there 
are  potential  advantages  to  performing  electronic- 
structure  calculations  entirely  in  real  space. 
Boundary  conditions  are  not  constrained  to  be 
periodic;  this  permits  the  use  of  nonperiodic 
boundary  conditions  for  clusters  and  a  combina¬ 
tion  of  periodic  and  nonperiodic  boundary  condi¬ 
tions  for  surfaces.  By  employing  nonuniform 
real-space  grids,  it  is  possible  to  add  resolution 
locally,  e.g.,  for  a  surface  or  cluster  calculation.  A 
basis  that  uses  a  high  density  of  grid  points  in 
regions  where  the  ions  are  located  and  a  lower 
density  of  points  in  vacuum  regions  can  lead  to 
order  of  magnitude  savings  in  the  basis  size  and 
total  computational  effort  [12].  More  importantly, 
the  use  of  a  real-space  basis  opens  up  the  possibil¬ 
ity  of  using  multigrid  iterative  techniques  to  ob¬ 
tain  solutions  of  the  Kohn-Sham  equations.  Multi¬ 
grid  methods  [13],  which  provide  automatic 
preconditioning  on  all  length  scales,  can  greatly 
reduce  the  number  of  iterations  needed  to  con¬ 
verge  the  electronic  wave  functions.  Furthermore, 
the  real-space  multigrid  formulation  does  not  in¬ 
volve  long-range  operations  and  is  particularly 
suitable  for  parallelization  and  0(N)  algorithms 
[14,  15],  because  every  operation  can  be  parti¬ 
tioned  into  hierarchical  real-space  domains. 

There  have  been  a  number  of  previous  real-space 
grid-based  electronic  structure  calculations.  The 
finite-element  method  was  applied  by  White  et  al. 
[16]  to  one-electron  systems.  They  used  both 
conjugate-gradient  and  multigrid  acceleration  to 
find  the  ground-state  wave  function.  Two  of  the 
present  authors  [12]  used  nonuniform  grids 
with  locally  enhanced  regions  in  conjunction  with 
multigrid  acceleration  to  calculate  the  electronic 
properties  of  atomic  and  diatomic  systems  with 


nearly  singular  pseudopotentials.  They  verified 
that  the  preconditioning  afforded  by  multigrid 
was  effective  in  multilength-scale  systems. 
Chelikowsky  et  al.  [17]  have  used  high-order 
finite-difference  methods  and  soft  nonlocal 
pseudopotentials  on  uniform  grids  to  study  the 
properties  of  Si  and  Ge  clusters.  The  present  au¬ 
thors  have  developed  a  real-space  grid  method 
with  multigrid  methods  and  tested  it  extensively 
against  plane-wave  results  [18].  Other  recent  real- 
space  work  includes  finite  differencing  on  warped 
grids  [19,  20],  wavelet  bases  [21,  22],  hierarchical 
nonlinear  grids  applied  to  diatomics  [23],  and  full 
potential  multigrid  calculations  for  atoms  and  di¬ 
atomics  [24]. 


Real-Space  Calculations 

Several  issues  that  are  absent  from  plane-wave 
or  orbital-based  methods  arise  when  using  a  real- 
space  grid  approach.  In  the  former  case  the  wave 
functions,  potentials,  and  the  electronic  density  are 
representable  in  explicit  basis  functions,  and  thus 
are  known  everywhere.  In  a  real-space  grid  imple¬ 
mentation,  these  quantities  are  known  only  at  a 
discrete  set  of  grid  points,  which  can  introduce  a 
spurious  dependence  of  the  Kohn-Sham  eigenval¬ 
ues,  the  total  energy,  and  the  ionic  forces  on  the 
positions  of  the  ions  with  respect  to  the  real-space 
grid.  We  have  developed  a  set  of  techniques  that 
can  overcome  these  difficulties  and  can  be  used  to 
compute  accurate  static  and  dynamical  properties 
of  large  physical  systems.  In  our  formalism  the 
wave  functions,  density,  and  potentials  are  di¬ 
rectly  represented  on  a  three-dimensional  real- 
space  grid  with  linear  spacing  and  number  of 
mesh  points  The  ions  are  represented  by 

soft-core  norm-conserving  pseudopotentials  [25]. 
Exchange  and  correlation  effects  are  treated  using 
the  local  density  approximation  (LDA)  of  density 
functional  theory. 

For  reasons  of  accuracy  and  computational  effi¬ 
ciency,  we  discretize  the  Kohn-Sham  equations  in 
a  generalized  eigenvalue  formulation: 

(1) 

A  and  B  are  the  components  of  the  Mehrstellen 
discretization  [26],  which  is  based  on  Hermite's 
generalization  of  Taylor's  theorem.  It  uses  a 
weighted  sum  of  the  wave  function  and  potential 
values  to  improve  the  accuracy  of  the  discretiza- 
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tion  of  the  entire  differential  equation,  not  just  the 
kinetic  energy  operator.  The  weights  for  the 
0(/2grid)  discretization  are  given  in  Table  L  We 
have  found  that  the  fourth-order  Mehrstellen  dis¬ 
cretization  produces  equivalent  or  better  accuracy 
than  the  sixth-order  central-finite-difference  one. 
The  higher  accuracy  of  the  Mehrstellen  approach  is 
achieved  by  using  a  nonseparable  discretization 
and  more  local  information  (second  nearest- 
neighbor  points,  for  example). 

We  have  found  that  pseudopotentials,  espe¬ 
cially  the  rather  hard  ones,  must  be  smoothed 
before  they  are  suitable  for  real  space  calculations. 
Smoothing  is  necessary  to  avoid  high-frequency 
variation  on  the  real-space  mesh  which  reduces 
accuracy  and  slows  convergence.  This  variation  is 
absent  from  plane-wave  calculations  because  the 
basis  cuts  off  the  large-G  components  of  the  poten¬ 
tials  exactly  [27].  To  achieve  this  same  cutoff  effect 
in  real  space,  we  use  the  following  filtering  proce¬ 
dure:  The  radial  pseudopotentials  are  Fourier- 
transformed  to  G  space.  They  are  multiplied  by  a 
cutoff  function  which  decreases  rapidly  beyond 
7r/{ah),  where  a  is  typically  in  the  range  1.5-2.0. 
The  filtered  potentials  are  then  back-transformed 
to  real  space.  We  have  found  that  the  Fourier- 
filtered  pseudopotentials  retain  the  accuracy  of 
those  used  in  plane-wave  calculations. 

To  test  the  smoothness  of  the  pseudopotentials, 
an  isolated  atom  or  a  bulk  system  is  displaced 
relative  to  the  grid.  The  magnitude  of  the  variation 
in  total  energy  is  a  measure  of  smoothness  of  the 
pseudopotentials.  In  Figure  1  we  show  the  change 
in  the  LDA  energy  of  a  carbon  atom  and  a  dia¬ 
mond  supercell  as  a  function  of  distance  from  a 
mesh  point.  In  addition,  the  degeneracy  pattern  of 
the  eigenvalues  and  the  ionic  forces  are  examined. 
If  the  pseudopotential  is  insufficiently  smooth,  the 
ionic  forces  in  a  symmetric  lattice,  for  example, 
will  not  be  small  (vanish). 

To  facilitate  comparisons  with  plane-wave  cal¬ 
culations,  we  define  an  energy  cutoff  for  the  real- 
space  calculation,  [Ry],  to  be  equal  to  that 

TABLE  I _ 


Mehrstellen  discretization  weights  in  3D 
(h  is  the  grid  spacing). 

Grid  point 

6h^*A 

6*B 

Central 

24 

6 

Nearest  neighbors 

-2 

1 

Second  neighbors 

-1 

0 

Displacement  along  grid  line  (units  of  h) 


FIGURE  1.  Variation  of  the  total  energy  as  a  carbon 
atom  and  a  64-atom  diamond  supercell  is  displaced 
rigidly  relative  to  the  grid  along  a  coordinate  axis.  The 
grid  spacing  h  is  0.336  bohr. 


of  a  plane-wave  calculation  that  uses  a  FFT  grid 
with  the  same  spacing  as  the  multigrid  calculation 
[28]. 

To  efficiently  solve  Eq.  (1),  we  have  used  multi¬ 
grid  iteration  techniques  that  accelerate  conver¬ 
gence  by  employing  a  sequence  of  grids  of  varying 
resolutions.  The  final  solution  is  obtained  on  a  grid 
fine  enough  to  accurately  represent  the  pseudo¬ 
potentials  and  the  electronic  wave  functions.  If  the 
solution  error  is  expanded  in  a  Fourier  series,  it 
may  be  shown  that  iterations  on  any  given  grid 
level  will  quickly  reduce  the  components  of  the 
error  with  wavelengths  comparable  to  the  grid 
spacing  but  are  ineffective  in  reducing  the  compo¬ 
nents  with  wavelengths  large  relative  to  the  grid 
spacing  [13,  29].  The  solution  is  to  treat  the  lower 
frequency  components  on  a  sequence  of  auxiliary 
grids  with  progressively  larger  grid  spacings, 
where  the  remaining  errors  appear  as  high- 
frequency  components.  This  procedure  provides 
excellent  preconditioning  for  all  length  scales  pre¬ 
sent  in  a  system  and  leads  to  very  rapid  conver¬ 
gence  rates.  The  operation  count  to  converge  one 
wave  function  with  a  fixed  potential  is  OfNgj.^^), 
compared  to  log  for  FFT-based 

approaches  [2]. 

To  summarize,  one  self-consistent  iteration  con¬ 
sists  of  a  multigrid  step  to  solve  Eq.  (1),  followed 
by  the  orthogonalization  of  the  orbibals  and  an 
update  of  the  electronic  density.  The  components 
of  the  effective  potential  that  depend  on  the  den¬ 
sity  (Hartree  and  exchange-correlation)  are  then 
recomputed  for  the  new  density,  and  the  process  is 
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repeated  until  self-consistency  is  reached.  The 
Hartree  potential  is  computed  by  solving  Poisson's 
equation  using  multigrid  iterations  on  the  corre¬ 
sponding  Mehrstellen  discretization.  The  nonlocal 
ionic  pseudopotential  is  evaluated  in  real  space 
using  the  Kleinman-Bylander  [30]  separable  form. 
The  operation  count  to  converge  one  wave  func¬ 
tion  with  a  fixed  potential  is  compared  to 

0(Ngj.id  log  Ngrid)  for  FFT-based  approaches  [2].  In 
addition,  all  operations  except  orthogonalization 
are  short-ranged,  which  allows  for  efficient 
parallelization  and  a  natural  implementation  of 
0(N)  algorithms  [14,  15]. 


Molecular  Dynamics 

Ab  initio  molecular  dynamics  (MD)  simulations 
require  accurate  ionic  forces  so  that  higher-order 
integrators  and  large  time  steps  can  be  used  to 
achieve  long  simulation  times.  If  the  grid  layout, 
whether  uniform  or  nonuniform,  does  not  change 
when  the  ions  move,  the  Hellmann-Feynman  (HF) 
force  theorem  [31]  applies  with  no  Pulay  correc¬ 
tions  [32].  However,  the  calculation  of  the  HF 
forces  on  a  real-space  grid  requires  careful  consid¬ 
eration  of  numerical  errors  because  the  derivative 
of  even  a  filtered  pseudopotential  can  introduce 
high-frequency  components.  The  quality  of  our 
forces  is  sufficient,  however,  for  even  long  simula¬ 
tions,  as  exemplified  in  Figure  2,  which  shows  that 
the  total  energy  for  a  64-atom  cell  of  bulk  Si  at 
1100  K  is  conserved  within  27  [xeY  per  atom. 


12.0 


7.0 


2.0 


0) 

•S  „3.o 


-8.0 


FIGURE  2.  Potential,  kinetic,  and  total  energy  of  a 
molecular  dynamics  simulation  of  a  64-atom  silicon  cell 
at  a  temperature  of  1 100  K.  Third-order  Beeman -Verlet 
integration  with  an  80-a.u.  time  step  was  used  for  the 
integration  of  the  ionic  equations  of  motion.  The  total 
energy  curve  is  multiplied  by  a  factor  of  1 00.  The  potential 
and  total  energies  have  been  shifted  by  251.171  a.u  so 
that  they  could  appear  together. 
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compact  nature  of  the  Mehrstellen  operator  re¬ 
quires  the  exchange  of  only  small  amounts  of  data 
between  nearest-neighbor  processors.  In  tests  of 
the  code,  ported  from  the  Cray  YMP-C90  to  the 
massively  parallel  Cray  T3D,  we  found  nearly  per¬ 
fect  load  balancing  (90%)  on  all  systems  tested. 
These  tests  used  up  to  512  processors. 


Test  Results 


Parallelization 

There  are  two  apparent  parallelization  schemes 
for  a  real-space  grid  method.  First,  the  orbitals 
could  be  distributed  over  processors  and  stored 
there.  Its  major  advantage  is  that  our  high  opti¬ 
mized  serial  code  would  require  few  changes. 
However,  the  orthogonalization  of  the  orbitals 
would  perform  inefficiently  because  entire  orbitals 
must  be  exchanged  between  processors.  Our  pre¬ 
ferred  scheme  is  data  decomposition,  where  real- 
space  domains  are  distributed  over  processors. 
Orthogonalization  becomes  sufficient  because 
overlap  integrals  can  be  computed  in  two  parts: 
the  intraprocessor  portion  is  fully  parallel,  and  the 
interprocessor  part  sends  only  the  intermediate 
sums  between  processors.  Just  as  important,  the 


A  significant  advantage  of  the  multigrid  method 
is  the  speed  of  convergence  to  the  electronic  ground 
state  for  a  given  initial  ionic  configuration.  For 
systems  requiring  a  small  energy  cutoff  or  of  small 
size,  the  speed  advantage  with  respect  to  CP-based 
methods  it  not  substantial.  However,  for  systems 
requiring  a  large  energy  cutoff,  or  of  large  dimen¬ 
sions,  this  advantage — as  measured  in  actual 
computational  time — is  typically  an  order  of  mag¬ 
nitude.  This  is  because  the  maximum  stable  time 
step  in  the  Car-Parrinello  method  must  be  much 
smaller  than  in  the  multigrid  approach.  Our  initial 
tests  of  the  real-space  multigrid  method  were  dis¬ 
cussed  previously  [12, 18]  and  included  a  full  com¬ 
parison  with  established  plane-wave  results. 

To  illustrate  the  ability  of  the  multigrid  method 
to  handle  ill-conditioned  systems,  test  calculations 
were  performed  on  a  64-atom  GaN  cell  in  the 


534 


VOL.  65,  NO.  5 


LARGE-SCALE  ELECTRONIC  STRUCTURE  PROBLEMS 


zinc-blende  structure  with  the  Ga  3d  electrons  in 
valence.  A  grid  spacing  of  0.14  bohr  was  used, 
which  corresponds  to  an  energy  cutoff  of  250  Ry  in 
a  plane-wave  calculation.  Only  the  F  point  was 
included  in  fc-space  sampling.  Starting  with  ran¬ 
dom  initial  wave  functions,  only  75  self-consistent 
steps  were  required  to  converge  the  total  energy. 
Recently,  several  calculations  have  been  per¬ 
formed  on  GaN  that  explicitly  included  the 
d-electrons  in  valence;  the  multigrid  results  are  in 
very  good  agreement  with  these  calculations  (see 
Table  II). 

In  addition,  the  energy  of  formation  of  the  Ga 
interstitial  impurity  in  a  64-atom  GaN  supercell 
was  calculated  to  be  7  eV.  The  ions  were  not 
relaxed  at  this  250-Ry  cutoff  in  order  to  conserve 
computer  resources.  Instead,  ionic  coordinates  from 
a  fully  relaxed  Ga  interstitial  calculation  with  the 
Ga  3d  electrons  in  core  were  used.  As  a  check,  the 
interstitial  was  also  fully  relaxed  with  the  Ga  3d 
electrons  in  valence,  but  at  a  160-Ry  cutoff  with  a 
slightly  softer  pseudopotential.  The  maximum 
difference  in  ionic  coordinates  between  the  two 

o 

relaxed  structures  was  0.1  A. 

The  present  results  for  the  Ga  interstitial  con¬ 
firm  our  previous  values,  obtained  with  the  Ga  3d 
electrons  in  core  [33];  namely,  the  Ga  interstitial  is 
one  of  the  low-energy  native  defects  in  GaN. 
Recent  experimental  results  indeed  find  indirect 
evidence  for  its  presence,  since  Ga  precipitates  are 
formed  upon  annealing  [34], 

We  have  also  studied  the  wurtzite  phase  of 
GaN  in  a  32-atom  supercell  with  the  d  electrons  in 
core.  The  discretization  in  the  hexagonal  plane  is 
straightforward  because  the  grid  lines  can  be 
mapped  onto  a  Cartesian  plane.  The  Kohn-Sham 
eigenvalues  computed  by  the  real-space  and 
plane-wave  methods  are  in  excellent  agreement. 

An  isolated  C^o  molecule  was  selected  as  an 
example  of  a  nonperiodic  system.  The  simulation 


TABLE  II _ 

Comparison  of  GaN  results  obtained  with  the 
Ga  3d  electrons  in  valence.^ 


Band  gap 

Cohesive  energy 

Full  potential  LMTO  [56] 

2.00 

10.89 

240  Ry,  plane  waves, 

2-atom  cell  [57] 

1.89 

Multigrid,  64-atom  cell 

1.88 

10.64 

®The  multigrid  calculation  uses  a  64-atom  supercell,  250-Ry 
cutoff,  and  r  point  sampling.  See  text  [eV]. 


cell  was  a  cube  of  length  23  bohrs  and  the  grid 
spacing  was  0.360  bohr.  The  initial  ionic  coordi¬ 
nates  were  generated  using  the  classical  Tersoff- 
Brenner  potential  [35],  and  the  electronic  wave 
functions  were  set  to  zero  on  the  boundaries  of  the 
cell.  After  the  convergence  of  the  electronic  sys¬ 
tem,  the  ions  were  relaxed  using  the  same  relax¬ 
ation  scheme  as  before.  Two  distinct  bond  lengths 
were  found  in  the  final  structure,  corresponding  to 
the  carbon-carbon  single  and  double  bonds.  There 
were  twice  as  many  single  bonds  as  double  bonds, 
and  the  average  double  and  single  bond  lengths 
were  1.39  and  1.44  A,  compared  to  1.41  and  1.45  A 
obtained  in  a  previous  CP  calculation  [36]  for  the 
C^o  solid.  The  standard  deviations  of  the  bond- 
length  distributions  were  on  the  order  of  10“^  A  in 
both  calculations.  The  experimental  values  for  the 
solid  are  1.40  and  1.45  A,  respectively. 

Next,  we  examine  the  efficiency  of  multigrid 
acceleration.  Table  III  illustrates  the  convergence 
properties  as  a  function  of  grid  resolution  for  an 
8-atom  diamond  cell.  The  observed  convergence 
rates  are  largely  independent  of  the  energy  cutoff. 
The  number  of  self-consistent  field  (SCF)  steps 
required  to  converge  the  density  is  also  nearly 
independent  of  the  system  size.  At  an  equivalent 
cutoff  of  35  Ry,  the  multigrid  method  required  17 
steps  to  converge  the  total  energy  of  the  8-atom 
diamond  cell  to  a  tolerance  of  10“^.  When  the 
same  calculation  was  performed  for  a  64-atom  cell 
only  20  SCF  steps  were  needed. 

Finally,  we  compare  the  convergence  rates  for 
four  iterative-diagonalization  schemes:  steepest 
descents,  multigrid,  steepest  descents  with  peri¬ 
odic  subspace  rotation,  and  multigrid  with 
periodic  subspace  rotation.  Subspace  rotation 
(subspace  diagonalization)  speeds  up  iterative 
diagonalization  schemes  because  it  efficiently  re¬ 
solves  exact  or  near  degeneracies  between  orbitals. 


TABLE  III _ 

Multigrid  convergence  tests  in  an  8-atom 
supercell.^ 


Grid  spacing 

Cutoff 

Totai  energy 

SCF  steps 

0.421 

25.0 

-44.87968 

22 

0.336 

35.0 

-45.03331 

17 

0.280 

60,0 

-45.06240 

21 

0.210 

110.0 

-45.06605 

26 

®Grid  spacing  [bohr],  equivalent  plane-wave  cutoff  [Ry], 
total  energy  [a.u.],  and  the  number  of  steps  to  converge  the 
density. 
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In  the  tests,  the  initial  wave  functions  were  ran¬ 
domly  chosen,  a  worse-case  scenario,  and  the  time 
step  was  the  same  for  each  scheme.  Ordinary 
multigrid  rapidly  decreases  the  error,  but  the  con¬ 
vergence  stalls  after  15  steps  because  the  den¬ 
sity  is  changing  slowly.  Periodic  subspace  rotation 
sorts  out  the  small  degeneracies  between  states, 
and  when  several  unoccupied  states  are  included 
in  the  calculation,  it  mixes  in  new  information 
from  the  unoccupied  sector.  This  greatly  improves 
the  convergence  rate,  even  for  steepest  descents; 
see  Figure  3. 


FIGURE  3.  Convergence  rates  for  a  64-atom  diamond 
cell  with  a  substitutional  N  impurity  at  a  63-Ry  equivalent 
cut-off.  The  convergence  rate,  log^o^^  ~  plotted 

against  the  number  of  self-consistent  field  (SCF)  steps. 
Random  initial  wave  functions  were  used  with  a  constant 
initial  density.  SD  represents  convergence  rates  for  the 
steepest  descents  algorithm,  MG  is  for  multigrid,  SD-SD 
is  steepest  descents  with  subspace  diagonalizations, 
and  MG-SD  is  multigrid  with  subspace  diagonalizations. 


Surfaces  of  Wurtzite  GaN 

The  Perdew-Zunger  parametrization  [37] 
of  the  Ceperley- Alder  form  [38]  of  the  exchange- 
correlation  energy  was  chosen.  Nonlocal,  norm- 
conserving  pseudopotentials  [25,  39,  40]  were 
included  using  the  Kleinman-Bylander  approach 
[30].  For  gallium,  a  recently  developed  norm- 
conserving  pseudopotential,  which  includes  a 
nonlocal  core  correction  [41]  and  permits  an 
efficient  description  without  the  need  for  an  ex¬ 
plicit  treatment  of  the  rf-valence  electrons  was 
used.  For  nitrogen,  a  standard  pseudopotential 
with  a  neutral  configuration  as  the  atomic  refer¬ 
ence  for  all  states  was  employed.  These  pseudopo¬ 
tentials  have  been  demonstrated  to  reproduce  ac¬ 
curately  the  bulk  properties  of  GaN  [42]. 

The  calculated  bulk  properties  are  presented  in 
Table  IV.  The  theoretical  lattice  parameters  of  both 
the  zinc-blende  and  wurtzite  forms  agree  very 
well  with  experiment;  a  similar  level  of  accuracy  is 
expected  for  the  surface  and  interface  calculations 
described  below.  The  cohesive  energy  of  GaN  and 
a-Ga  is  10.42  and  3.4  eV,  respectively.  The  binding 
energy  of  nitrogen  molecule  is  5.87  eV.  The 
theoretical  heat  of  formation  of  GaN  is  thus  1.15 
eV,  in  good  agreement  with  the  experimental  value 
of  1.14  eV. 

The  (0001)  surface  is  a  polar  surface.  The  ideal 
surface  has  dangling  bonds  with  charge  distribu¬ 
tions  that  are  energetically  unfavorable.  If  kineti- 
cally  permitted,  the  surface  atoms  will  relax  and 
redistribute  the  ''dangling-bond"  charge  density 
so  as  to  satisfy  the  valences  of  all  of  the  surface 


TABLE  IV _ 

Calculated  bulk  properties  of  zinc-blende  and  wurtzite  nitride  semiconductors.^ 


Zinc-blende 


AIN 

GaN 

InN 

ao(A) 

4.37  (4.38) 

4.52  (4.5) 

5.01  (4.98) 

Bq  (MBar) 

2.02  (2.02) 

1 .70  (1 .90) 

1 .58  (1 .37) 

Wurtzite 

a  (A) 

3.09  (3.11) 

3.20  (3.19) 

3.55  (3.54) 

da 

1 .62  (1 .60) 

1 .63  (1 .63) 

1.63  (1.61) 

u  (units  of  c) 

0.378  (0.382) 

0.376  (0.377) 

0.375 

Bo  (MBar) 

1 .99  (2.02) 

1 .69  (1 .95,  2.37) 

1 .62  (1 .26,  1 .39) 

^  The  values  of  the  gap  at  the  T-pt  (Ey)  and  of  the  valence-band  width  are  the  LDA  results.  Note  that  the  LDA  indirect  gap 

in  zinc-blende  At  N  is  3.2  eV.  Experimental  values  are  in  brackets  and  follow  Ref.  52. 
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species.  This  can  be  achieved  via  structural  relax¬ 
ation  (with  a  concomitant  charge  transfer  between 
surface  atoms).  Recent  experimental  studies  have 
revealed  the  presence  of  2  X  2  reconstructions  dur¬ 
ing  growth  [43],  In  this  study,  we  restrict  our 
attention  to  these  reconstructions  on  the  gallium- 
terminated  face. 

We  considered  eight  different  2x2  reconstruc¬ 
tions,  namely  the  ideal  relaxed  structure  and  gal¬ 
lium  vacancy,  nitrogen  adatom,  gallium  adatom, 
and  nitrogen-trimer,  each  on  the  T3  and  H3  (hol¬ 
low)  sites;  see  Figures  4  and  5.  We  find  that  adatom 
reconstructions  are  the  most  energetically  favor¬ 
able  of  the  studied  reconstructions.  In  particular, 
in  the  gallium-rich  case  the  gallium  adatom  on 
the  T3  site  has  the  lowest  energy,  while  in  the 
nitrogen-rich  case,  the  nitrogen  adatom  on  the  H3 
is  the  most  stable. 

The  slabs  contained  four  bilayers  of  GaN, 
three  bilayers  of  which  were  relaxed.  Relaxations 
in  the  third  bilayer  were  negligible,  indicating 
that  a  sufficiently  thick  slab  was  employed. 
Pseudo-hydrogen  of  charge  0.75  passivated  the 
nitrogen-terminated  face  [44].  If  the  work  functions 
of  the  two  surfaces  are  different,  then  the  periodic 
boundary  conditions  of  the  supercell  enforce  a 


FIGURE  4.  Schematic  top  view  of  the  2  x  2 
gallium-adatom  reconstruction  on  the  (0001)  surface  of 
GaN.  The  gallium  adatom  (in  gray)  sits  above  a  nitrogen 
in  the  second  layer  on  the  T3  site.  The  numbers  denote 
gallium  atoms  on  the  surface,  while  the  primed  numbers 
denote  nitrogen  atoms  in  the  first  subsurface  layer. 


FIGURE  5.  Schematic  top  view  of  the  2  x  2 
nitrogen-adatom  reconstruction  on  the  (0001)  surface  of 
GaN.  The  nitrogen  adatom  sits  above  the  hollow  (H3) 
site.  The  numbers  denote  gallium  atoms  on  the  surface, 
while  the  primed  numbers  denote  nitrogen  atoms  in  the 
first  subsurface  layer. 


common  electrostatic  potential  in  the  vacuum  and 
result  in  an  unphysical  change  in  the  electric  po¬ 
tential  in  the  vacuum  region  equal  to  the  differ¬ 
ence  between  the  two  work  functions.  A  large 
vacuum  region  of  10  A  served  to  reduce  the  size  of 
this  field.  Calculations  including  explicitly  a  cor¬ 
rection  for  this  field  did  not  show  significant  dif¬ 
ferences  in  the  final  equilibrium  geometries  [45]. 
An  orthorhombic  supercell  with  grid  spacing  of 
0.21,  0.22,  and  0.20  a.u.  in  the  x,  y,  and  z  direc¬ 
tions,  respectively,  was  employed.  Eight  different 
reconstructions  were  studied  using  P-point  sam¬ 
pling.  For  the  lowest  energy  structures  of  each 
class,  i.e.,  nitrogen  adatom  on  H3  site,  nitrogen 
trimer  on  the  T3  site,  etc.,  a  further  calculation 
using  two  k  points  in  the  irreducible  Brillioun 
zone  was  carried  out. 

Figure  6  displays  the  relative  surface  energies 
for  the  five  different  reconstructions.  Under 
gallium-rich  conditions  and  for  most  of  the  range 
of  the  chemical  potential,  the  lowest  energy  recon¬ 
struction  is  the  gallium-adatom  on  the  T3  site; 
while  under  nitrogen-rich  conditions,  the  recon¬ 
struction  with  the  nitrogen-adatom  on  the  H3  site 
is  preferred  energetically. 
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1.1 
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-  Ga-vacancy 

•••  (1X1)  relaxed 
N-adatom  on  H, 


site 


- N-trimer  on  T3  site 

- Ga-adatom  on  T3  site 


Nitrogen  rich  |Li^^  Gallium  rich 

FIGURE  6.  Formation  energy  vs.  gallium  chemical  potential  for  the  (0001)  surface.  The  maximum  chemical  potential 
for  N  (Ga)  is  equal  to  the  energy  per  atom  calculated  for  N2  (bulk  Ga).  Two  k  points  are  sampled  in  the  irreducible 
Brillioun  zone. 


The  gallium-adatom  reconstruction  is  shown  in 
Figure  4.  The  adatom  sits  on  the  T3  site  above  the 
subsurface  nitrogen  atom;  the  gallium-nitrogen 

o 

distance  is  2.46  A.  The  gallium-adatom  surface- 
gallium  bond-length  is  2.40  A  which  is  very  close 
to  the  Ga-dimer  distance  in  a-gallium  (2.44  A). 
The  bond  angle  between  two  gallium  surface  atoms 
and  the  adatom  is  82°.  The  in-plane  relaxation  of 
the  surface  atoms  is  negligible;  the  only  observed 
relaxation  of  the  first  bilayer  is  along  the  z  direc¬ 
tion.  The  three  gallium  atoms  bonded  to  the 
adatom  remain  coplanar  to  a  significant  degree, 
while  the  remaining  unbonded  gallium  surface 
atom  relaxes  into  the  slab.  The  proximity  of  the 
gallium  adatom  to  the  nitrogen  subsurface  atom 
lowers  the  electrostatic  energy  and  makes  this  par¬ 
ticular  configuration  energetically  more  favorable 
than  the  gallium  adatom  reconstruction  with  the 
adatom  on  the  hollow  site. 

The  nitrogen-adatom  reconstruction  is  shown  in 
Figure  5.  The  adatom  sits  on  the  hollow  (H3)  site. 
The  nitrogen-adatom  surface-gallium  bond-length 
is  2.0  A  and  is  very  close  to  the  Ga-N  bond  length 
distance  in  GaN  bulk  (1,96  A).  The  bond  angle 
between  two  gallium  surface  atoms  and  the  nitro¬ 
gen  adatom  is  89°.  The  relaxation  perpendicular  to 
the  surface  is  very  similar  to  that  in  the  gallium- 


adatom  reconstruction.  This  configuration  is  stabi¬ 
lized  because  it  separates  the  nitrogen  adatom 
from  the  subsurface  nitrogen  atom  and  thereby 
lowers  the  electrostatic  repulsion. 


Onset  of  Melting  of  Si  (100)  Snrface 

We  now  turn  to  the  melting  of  the  Si  (100) 
surface.  This  is  a  classic  problem,  whose  micro¬ 
scopic  details  are  not  well  understood.  This  is 
particularly  true  of  covalent  materials  like  Si, 
whose  surfaces  are  characterized  by  reconstruc¬ 
tions,  steps,  islands,  and  other  surface  defects — all 
of  which  are  expected  to  play  a  role  in  the  micro¬ 
scopic  aspects  of  the  melting  process.  As  a  first 
step  toward  this  goal,  we  have  carried  out  simula¬ 
tions  of  the  melting  process  of  the  Si  (100)  surface 
with  finite-temperature  ab  initio  methods.  The 
temperature  of  the  Si  ions  were  controlled  with  a 
chain  of  five,  linked  Nose-Hoover  thermostats  [46]. 
Because  the  electrons  are  always  quenched  back 
onto  the  Born-Oppenheimer  surface  after  every 
time  step,  no  additional  thermostat  is  needed  for 
the  electrons.  The  in-plane  cells  consisted  of  16 
atoms  per  layer  and  the  basic  time  step  of  the 
simulation  was  100  a.u. 
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The  melting  of  the  Si  (100)  surface  has  previ¬ 
ously  been  investigated  with  classical  molecular 
dynamics  simulations  [47],  and  the  thermodynam¬ 
ics  of  melting  has  recently  been  investigated  by  ab 
initio  techniques  [48].  However,  it  is  now  known 
that  classical  potentials  do  not  reproduce  well  sev¬ 
eral  aspects  of  the  melting  process.  Upon  melting. 
Si  goes  from  a  fourfold  coordinated  semiconductor 
to  a  metallic  liquid.  The  density  of  the  liquid  is 
about  10  percent  higher  than  in  the  solid.  The 
average  coordination  number  is  between  6  and  7, 
which  is  rather  low  for  a  metal.  This  low  coordina¬ 
tion  number  is  indicative  of  persistent  remnants  of 
covalent  bonding. 

A  relaxed  six-layer  slab  of  Si,  with  the  bottom 
layer  saturated  by  fixed  H  atoms,  was  heated  up 
to  1500  K  over  a  1.5  ps  period  of  time.  Over  the 
next  0.3  ps,  the  average  temperature  was  raised  to 
1700  K,  which  is  the  melting  temperature  of  bulk 


Si.  Sample  configurations  as  a  function  of  time  are 
shown  in  Figure  7. 

We  have  characterized  the  structural,  thermo¬ 
dynamic,  and  electronic  properties  changes  that 
take  place  during  the  melting  process.  However, 
here  we  will  only  briefly  summarize  some  of  the 
main  features,  while  a  full  description  will  be 
published  elsewhere  [49].  As  the  melting  of  the 
layers  proceeds,  there  is  a  change  in  the  density  of 
the  top  layers  as  function  of  the  vertical  distance. 
While  initially  each  of  the  layers  is  well  separated 
and  defined,  the  atomic  density  of  the  top  layers 
becomes  much  more  uniform  at  the  end  of  the 
simulation  time.  During  this  time,  the  pair  distri¬ 
bution  fimction  of  the  top  four  layers  becomes 
much  more  liquidlike,  with  the  second  peak  be¬ 
coming  essentially  washed  out.  At  the  same  time, 
the  angular  distribution  function  g^iO)  changes  by 
decreasing  its  large  peak  about  the  tetrahedral 


1.00  ps  1.50  ps 


2.00  ps  2.50  ps 


FIGURE  7.  Side  view  of  sample  configurations  of  Si  (1 00)  surface  at  1 700  K  as  the  top  layers  melt. 
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angle,  broadening  and  developing  a  second  peak 
in  the  50-60°  range,  as  shown  in  Figure  8.  All  of 
this  is  characteristic  of  a  melted  system.  As  a  probe 
of  the  electronic  properties,  we  have  computed  the 
local  density  of  states  for  the  bulk  and  liquid 
domains.  These  are  compared  in  Figure  9.  Note  the 
relatively  large  buildup  of  states  at  the  Fermi  level 
for  the  liquid,  characteristic  of  a  metal,  and 
the  decrease  in  number  of  states  for  the  bulk 
semiconductor. 


Ongoing  Work 

ORDER-/V  REAL-SPACE  IMPLEMEFVTATIOrVS 

For  very  large  systems,  the  complexity  of  the 
calculations  increases  as  in  computer 

time  due  to  orthogonalization,  and  the  memory 

Pair  Distribution 


Angular  Distribution 


Degrees 


FIGURE  8.  Angular  correlation  function  before  and  after 
melting.  Note  the  decrease  in  the  large  peak  near  the 
tetrahedral  angle. 


increases  as  Several  approaches  that 

reduce  both  types  of  scaling  behavior  to 
have  been  proposed.  These  0(N)  methods  rely  on 
the  assumption  that  physial  properties  are  local, 
and  that  the  orbitals  [14]  and/or  the  density  ma- 
trix  [15] — when  localized  in  finite  volumes  about 
each  atom — can  describe  the  electron  charge  den¬ 
sity,  kinetic  energy  density,  etc.  to  useful  accuracy. 
More  specifically,  in  most  of  the  approaches  the 
wave  functions  or  the  density  are  expanded  in 
localized  functions,  usually  atom-  or  bond- 
centered.  The  overlaps  are  then  set  to  zero  beyond 
a  certain  localization  radius,  and  the  size  of  this 
radius  determines  the  accuracy.  A  detailed  de¬ 
scription  of  the  methodology  is  beyond  the  scope 
of  this  work,  but  one  can  derive  variational  expres¬ 
sions  in  terms  of  these  functions  that  lead  to  sparse 
matrix  operations  and  0(N)  scaling.  One  should 
point  out  that  only  the  range  of  the  localized 
functions  is  predetermined,  their  shapes  change 
during  minimization.  In  the  wave-function-based 
approaches,  a  useful  analogy  is  to  think  of  these 
orbitals  as  generalized  Wannier  functions;  indeed, 
they  become  orthogonal  at  convergence.  A  recent 
review  article  provides  an  in-depth  description  of 
the  various  approaches  [50].  In  the  quantum 
chemistry  community  there  have  been  analogous 
developments  aimed  at  achieving  linear  scaling  in 
standard  packages,  which  use  Gaussian  functions. 
To  date,  linear  scaling  has  been  achieved  for  the 
Coulomb  part  [51,52],  the  exchange-correlation 
part  within  density  functional  theory  [53],  and 
most  recently  in  the  diagonalization  part  [54],  fol¬ 
lowing  the  Li-Nunes-Vanderbilt  density-based 
approach. 

We  have  implemented  a  real-space  self- 
consistent  version  of  the  Galli-Parrinello  func¬ 
tional  and  tested  it  on  64-  and  216-atom  supercells 
of  bulk  Si.  The  Galli-Parrinello  functional  is  varia¬ 
tional,  and  the  accuracy  of  the  calculations  de¬ 
pends  strongly  on  the  localization  radius.  In  Table 
V,  we  show  the  errors  per  atom  in  the  self- 
consistent  total  energy  as  a  function  of  the  radius. 
It  is  clear  from  the  results  that  fairly  large  radii 
encompassing  tens  of  atoms  are  necessary  for  an 
accurate  description.  We  are  currently  testing  other 
forms  of  0(N)  methods,  comparing  their  accuracy 
and  efficiency. 

SOLVATED  BIOMOLECLLES 

In  collaboration  with  L.  Pedersen  (UNC-Chapel 
Hill),  we  are  investigating  the  properties  of  a 


540 


VOL  65,  NO.  5 


LARGE-SCALE  ELECTRONIC  STRUCTURE  PROBLEMS 


Surface  {z  >  5.8  A) 
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FIGURE  9.  Local  density  of  states  In  units  of  states  per  electron  volt  per  atom  after  melting  in  the  top  layers  and  in  the 
deeper  ‘'bulklike”  region. 


TABLE  V _ 

Error  per  atom  due  to  localization  in  a 
self-consistent  Implementation  of  the  0{N) 
Gain  -  Parinello  functional.^ 


Localization  radius  (a.u.) 

Error  (eV  /  atom) 

5.5 

1.54 

6.0 

1.03 

6.5 

0.84 

7.5 

0.19 

8.0 

0.06 

12.5 

0.03 

®  See  text. 


protein  associated  with  Alzheimer's  disease,  the 
amyloid  jS-peptide  ^146^45^42^210-  LDA  Cah 
culations  were  performed  on  the  Cray-T3D  (128  or 
256  processors),  using  a  structure  obtained  from 
nuclear  magnetic  resonance  (NMR)  data  that  was 
refined  by  a  molecular-mechanics  force  field  pro¬ 
gram  (AMBER).  In  Figure  10  we  plot  the  electronic 
charge  density  of  various  parts  of  the  protein. 
Presently,  we  are  relaxing  the  system  quantum 
mechanically  in  the  presence  of  water,  which  is 
treated  by  a  classical  potential.  We  have  also  im¬ 
plemented  the  generalized  gradient  approximation 
to  DFT  (GGA)  [55],  which  is  known  to  improve  the 
description  of  hydrogen  bonding  as  well  as  the 
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FIGURE  10.  Three  views  of  the  electronic  charge 
density  of  the  amyloid  jS-peptide  associated  with 
Alzheimer’s  disease.  The  top  view  shows  the  entire 
molecule;  the  lower  two  views  are  close-ups  of  the  first 
quarter  and  second  third  of  the  molecule,  respectively. 


values  of  the  binding  energies.  In  tests  on  an 
amino  acid  (phenylalanine),  the  GGA  disassocia- 
tion  energy  was  lower  by  7%  than  the  LDA  value 
of  142.59  eV. 


Summary 

We  have  developed  a  methodology  for  perform¬ 
ing  large-scale  ab  initio  electronic  structure  calcu¬ 
lations  entirely  in  real  space.  By  using  highly 
efficient  multigrid  techniques  to  accelerate 
convergence  rates,  we  are  able  to  efficiently 
converge  even  difficult  ill-conditioned  systems, 
which  require  high-energy  cut-offs  or  large  length 
scales.  The  multigrid  method  is  readily  adaptable 
to  parallel  computer  architectures,  and  an  efficient 
parallel  implementation  was  developed.  The  de¬ 
tails  of  our  implementation,  which  include  new 
compact  discretization  schemes  in  real  space  for 
systems  with  cubic,  orthorhombic,  and  hexagonal 
symmetry,  and  new  multilevel  algorithms  for  the 
iterative  solution  of  Kohn-Sham  equations,  are 
described  in  Ref.  [18].  This  methodology  was  tested 
on  a  large  number  of  systems,  including  the 
molecule,  diamond.  Si  and  GaN  supercells,  and  ab 
initio  MD  simulations.  Further  applications  in¬ 


clude  a  simulation  of  surface  melting  of  Si,  struc¬ 
tural  properties  of  a  large  amyloid  /3  peptide 
(C146O45N42H210)  implicated  in  Alzheimer's  dis¬ 
ease,  and  electronic  and  structural  properties  of 
GaN  surfaces  and  In^Ga^.^N  quantum  wells.  The 
accuracy  of  the  method  was  also  tested  by  direct 
comparison  with  plane-wave  calculations,  when 
possible,  and  the  results  were  in  excellent  agree¬ 
ment  in  all  cases. 
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ABSTRACT:  Relativistic  correlation  energies  have  been  computed  for  the  closed-shell 
ground  states  of  Zn  and  Cd  and  their  singly  ionized  open-shell  ions  by  means  of  a 
recently  developed  relativistic  many-body  perturbation  theory  for  open-shell  multiplet 
states.  All  electrons  were  included  in  the  correlation  calculations.  Analytic  basis  sets  of 
Gaussian  spinors  are  employed  to  expand  the  upper  and  lower  components  of  the  Dirac 
four  spinors  in  closed-  and  open-shell  matrix  Dirac-Fock  and  relativistic  many-body 
perturbation  procedures.  The  ionization  energies  have  been  computed  for  the  atoms,  and 
the  effects  of  relativity,  electron  correlation,  and  the  nonadditive  interplay  of  these  effects 
on  the  ionization  energies  are  analyzed.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65: 
545-554,  1997 


Introduction 

In  heavy-atom  systems,  relativistic  and  electron 
correlation  effects  are  large  and  strongly  inter¬ 
twined  [1].  Therefore,  accurate  treatment  of 
many-electron  systems  makes  it  necessary  to  go 
beyond  the  independent  particle  model  to  include 
both  effects  simultaneously.  Fully  relativistic 
many-body  methods  for  atoms  and  molecules, 
which  simultaneously  account  for  both  effects,  are 
currently  being  developed  and  tested  by  several 
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groups  [2-16],  using  basis  sets  of  either  ''locaT'  or 
"global"  functions.  Three  many-body  techniques 
developed  from  nonrelativistic  electronic  structure 
theory  have  been  successful  in  recovering  correla¬ 
tion  energy  in  relativistic  treatments.  They  are 
relativistic  many-body  perturbation  theory  (MBPT) 
[2-6],  relativistic  configuration  interaction  (RCI) 
[7-12],  and  relativistic  coupled-cluster  (RCC)  the¬ 
ory  [13-16]. 

In  recent  studies  [6],  we  have  reported  a  state- 
specific  relativistic  MBPT  for  general  open-shell 
systems  in  which  our  single  Pock  operator  method 
[17]  is  employed  to  make  a  Moller-Plesset-type 
separation  of  the  relativistic  many-electron  Hamil¬ 
tonian.  Relativistic  correlation  energies  were  com- 
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puted  for  the  open-shell  ground  states  of  the  Group 
IB  and  IIIA  elements  up  to  Z  =  79  as  well  as  their 
singly  ionized  closed-shell  ions.  The  effects  of  rela¬ 
tivity  and  electron  correlation  and  the  nonadditive 
interplay  of  these  effects  on  their  ionization  ener¬ 
gies  (IE)  were  analyzed.  The  state-specific  MBPT 
involves  a  full  implementation  of  generalized 
Moller-Plesset  perturbation  theory  [18]  applied  to 
general  open-shell  reference  wave  functions. 

Relativistic  effects  are  known  to  be  important  in 
the  transition-metal  atoms.  The  differential  effects 
on  the  ionization  energies  of  the  first-row  transi¬ 
tion  metals  are  as  large  as  0.5  eV  in  some  cases  [19, 
20].  Therefore,  even  in  the  first-row  transition  met¬ 
als,  accurate  determination  of  electronic  structures 
requires  approaches  which  can  account  for  the 
nonadditive  interplay  of  relativity  and  electron 
correlation.  The  purpose  of  the  present  study  is  to 
report  on  the  application  of  our  relativistic  MBPT 
[6]  to  the  Group  IIB  elements  Zn  and  Cd  and  their 
singly  ionized  ions.  In  the  next  two  sections,  we 
outline  our  relativistic  MBPT  and  its  implementa¬ 
tion  with  analytic  basis  set  of  G  spinors  (G  for 
Gaussian  after  Grant  [21]).  In  the  last  section,  the 
results  of  matrix  Dirac-Fock  (DF)  self-consistent 
field  (SCF)  and  all-electron  relativistic  MBPT  cal¬ 
culations  on  Zn  and  Cd  and  their  singly  ionized 
ions  are  presented.  The  effects  of  relativity  and 
electron  correlation  and  the  nonadditive  interplay 
of  these  effects  on  their  IE  are  analyzed. 


Relativistic  Many-Body  Perturbation 
Theory  for  Open-Sheli  Systems 

In  c-number  theory,  the  effective  many-body 
Hamiltonian  most  commonly  used  in  relativistic 
atomic  and  moecular  structure  calculations  is  the 
no-pair  Dirac-Coulomb  (DC)  Hamiltonian  pre¬ 
scribed  by  Sucher  [22]  and  Mittleman  [23], 

=  E,./Zd(0  (  E1A,;V+.  (1) 

L+(l)L  +  (2) . . .  L  +  (n),  where  L^(i)  is  the  pro¬ 
jection  operator  onto  the  space  spanned  by  the 
positive-energy  eigenfunctions  of  the  matrix  DF 
SCF  equation  [17].  The  projection  operator,  take 
into  accouint  the  field-theoretic  condition  that  the 
negative-energy  states  are  filled  and  causes  the 
projected  DC  Hamiltonian  to  have  normalizable 
bound-state  solutions.  This  approach  is  called  the 
no-pair  approximation  [22]  because  virtual  elec¬ 


tron-positron  pairs  are  not  permitted  in  the  inter¬ 
mediate  states,  is  a  Dirac  one-electron  Hamilto¬ 
nian: 

h^(i)  =  ca,  •  p;  +  ( /3  -  l)c^  +  Ku.-Cr,).  (2) 

Here  a  and  p  are  the  4x4  Dirac  vector  and 
scalar  matrices,  respectively.  is  the  nuclear 
potential,  which  for  each  nucleus  takes  the  form 

Vnuc=  -Z/r,  r>R, 

=  -iZ/2R)(3  -  r  <  R.  (3) 

The  nuclei  are  modeled  as  spheres  of  uniform 
proton-charge  distribution,  Z  the  charge  of  nu¬ 
cleus.  R  is  the  radius  of  that  nucleus  and  is  related 
to  the  atomic  mass,  A,  by  R  =  2.2677  X  10“^ 
Adding  the  frequency-independent  Breit  interac¬ 
tion, 

Bi2  =  -l/2[ai-a2  +  ioL^' r^2yrh]/ri2> 

(4) 

to  the  electron-electron  Coulomb  interaction,  in 
Coulomb  gauge,  results  in  the  Coulomb-Breit  po¬ 
tential  which  is  correct  to  order  [22,  23].  Addi¬ 
tion  of  the  Breit  term  yields  the  no-pair 
Dirac-Coulomb-Breit  (DCB)  Hamiltonian  [22,  23] 

=  E/*d(0  +-2’+{E(1A,7  +  (5) 

i  1  ij  ' 

which  is  approximately  Lorentz  covariant  and  in¬ 
creases  the  accuracy  of  calculated  fine-structure 
splittings  and  inner-electron  binding  energies. 
Higher-order  quantum  electrodynamics  (QED)  ef¬ 
fects  appear  first  in  order  and  are  omitted. 

The  N-electron  Hamiltonian,  H,  is  partitioned 
into  a  model  Hamiltonian  Hq  and  a  perturbation 
y,  such  that  H  =  Hq  +  y.  The  zero-order  Hamil¬ 
tonian  Hq  is  arbitrary  but  should  be  chosen  as 
close  to  the  full  Hammiltonian  H  as  possible  so 
that  the  perturbation  series  converges  rapidly  in 
low  order.  The  zero-order  Hamiltonian  is  usually 
chosen  to  be  a  sum  of  effective  one-electron  opera¬ 
tors  (Moller-Plesset  partitioning  [24]).  In  that  form, 
all  perturbation  corrections  describe  electron  corre¬ 
lation.  For  closed-shell  systems,  the  best  results 
have  been  obtained  with  Moller-Plesset  partition¬ 
ing.  In  the  single  Fock  operator  method  of  Koc  and 
Ishikawa  [17]  relativistic  MBPT  for  general  open- 
shell  multiplet  states  of  atoms  and  molecules  re¬ 
tains  the  essential  aspects  of  Moller-Plesset  per- 
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turbation  theory.  The  theory  providees  a  hierarchy 
of  well-defined  algorithms  that  allow  one  to  calcu¬ 
late  relativistic  correlation  corrections  in  nonitera¬ 
tive  steps  and,  in  low  order,  yields  a  large  fraction 
of  the  dynamical  correlation.  The  no-pair  DC 
Hamiltonian  is 

=  Ho  +  V,  (6) 

where  the  unperturbed  Hamiltonian  Hq  is  a  sum 
of  single  DF  SCF  Fock  operators  Rq  [17]  for  closed- 
or  open-shells: 

Ho=I:Ro(0  and  V  =  Rod) ,  (7) 

i  i 

and  [6, 17] 

KeO) 

Here  the  mean-field  operators,  and  and  the 
projection  operators,  and  have  been 

defined  in  previous  work  [6,  17].  In  this  form  of 
partitioning,  all  perturbation  corrections  describe 
relativistic  electron  correlation  including  the  cross 
contribution  between  relativistic  and  correlation 
effects. 

The  exact  many-electron  wave  function  is  ex¬ 
panded  in  configuration  state  functions  (CSFs) 
/  =  1,2, ...}  generated  by  single,  dou¬ 
ble,  triple,  etc.  excitations  from  the  reference  CSF, 
Oo(7o/^)-  Individual  CSFs  are  eigenfunctions  of 
the  total  angular  momentum  and  parity  operators 
and  are  linear  combinations  of  antisymmetrized 
products  of  positive-energy  SCF  spinors.  The  SCF 
spinors  are  mutually  orthogonal,  so  the  CSFs 
{^j(%/P);  i  -  0, 1, 2, ... }  are  also  mutually  orthog¬ 
onal.  The  unperturbed  Hamiltonian  is  diagonal  in 
this  space: 

Ho  =  (8) 

i 

SO  that 

Hol$,(r,/P))  =  Eri<D,.(r,/P)>  (i  =  0, 1, 2, ... ) 

(9) 

is  a  sum  of  the  SCF  one-electron  energies  e^: 

(10) 

where  the  occupation  number  =  2/^  -h  1  and 
n^[^i]  is  the  occupancy  of  the  qth  spinor  in 

<^,(%/p)- 


Application  of  Rayleigh-Schrodinger  perturba¬ 
tion  theory  provides  order-by-order  expressions  of 
the  perturbation  series  for  the  state  approximated 

by  I  [bj: 

^  =  +  +  (11) 
E  =  E^°^  -H  +  E<^>  +  E‘^>  -I-  ••• ,  (12) 

where 

a>P>(/P)  =  L  <t>,(yJP)[v,o(^o  -  (13) 

/-I 

Ef  E(P  =  Escf,  (14) 

E<2)=  i:nVio(£o -£,)■'.  (15) 

!=1 

E(^)  =  L  Vo,W,jVfo(Eo  -  E,y\Eo  -  EjVK 

(16) 

Here 

Vij  =  <‘&,(rJP)|17|(F/y^/P)> 

and 

Wij  =  {^i(yiJP)\W  l^iiyj  JP )  >  with  W=V-EP\ 

In  this  form,  all  the  perturbation  corrections  be¬ 
yond  first-order  describe  relativistic  electron  corre¬ 
lation.  Summation  over  the  CSFs  in  Eqs.  (13) 
through  (16)  are  restricted  to  CSFs  constructed 
from  the  positive-energy  branch  of  the  spinors, 
which  effectively  incorporates  in  the  computa¬ 
tional  scheme  the  "no-pair"  projection  operator 
of  the  DC  and  DCB  Hamiltonians.  Negative 
energy  states,  as  part  of  the  complete  set  of  states, 
play  a  role  in  many-body  calculations.  However, 
contributions  from  the  negative  energy  states  due 
to  creation  of  virtual  electron-positron  pairs  are 
small  [22],  of  the  order  a^,  and  are  neglected  in  the 
present  study.  Neglecting  interactions  with  the 
filled  negative-energy  sea,  i.e.,  neglecting  virtual 
electron-positron  pairs  in  summing  the  MBPT  dia¬ 
grams,  we  have  a  straightforward  extension  of 
nonrelativistic  MBPT.  In  generalized  Moller-Ples- 
set  formalism,  the  CSFs  i  0},  gener¬ 

ated  by  single  excitations  from  the  reference  CSF, 
do  not  contribute  to  the  second-  and  third-order 
energies  because  the  singly  excited  CSFs,  ^^^{yJP), 
generated  by  the  SCF  spinors,  satisfy  the  general¬ 
ized  Brillouin  theorem,  (^i(yJP)\H^^\^QiyQ}P)} 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


547 


ISHIKAWA  AND  KOC 


=  0.  Thus  ViQ  =  0.  CSFs  generated  by  excitations 
higher  than  double  also  do  not  contribute  to  the 
second-  and  third-order  energies,  because  for  them 
=  0  and  =  0. 

Electron  correlation  effects  are  corrections  to  the 
independent  electron  approximation,  and  their  im¬ 
portance  depends  on  the  choice  of  the  approxima¬ 
tion  in  which  they  are  calculated.  In  nonrelativistic 
many-body  calculations,  the  correlation  energy, 
is  defined  as  the  difference  between  the  exact 
eigenvalue  of  the  N-electron  Schrodinger  Hamilto¬ 
nian,  and  the  nonrelativistic  single-config¬ 

uration  Hartree-Fock  (HF)  energy,  E^p  [25]: 

pcorr  _  pexact  _  p 

^NR  ~  ^NR  ^HF'  Vl/; 

The  nonrelativistic  definition,  Eq.  (17),  is  based  on 
the  implicit  assumption  that  it  is  easy  to  separate 
relativistic  effects  from  an  "exact"  value  of  the 
total  energy  which  contains  both  correlation  and 
relativistic  effects  [25]. 

We  employ  in  our  MBPT  calculations  two  dif¬ 
ferent  effective  N-electron  Hamiltonians,  the  no¬ 
pair  DC  Hamiltonian,  Eq,  (1),  and  the  no-pair  DCB 
Hamiltonian,  Eq.  (5).  With  respect  to  the  two  effec¬ 
tive  many-body  Hamiltonians,  we  can  define  DCB 
correlation  energy  (Egcp  as  the  difference  be¬ 
tween  the  exact  eigenvalue  Eq^r  of  the  DCB 
Hamiltonian,  and  the  DF  SCF  energy  (Es^p)/ 
and  DC  correlation  energy  (E^^O  as  the  difference 
between  the  exact  eigenvalue  (E^^^O  of  the  DC 
Hamiltonian,  and  the  DF  SCF  energy 
[5,  6]: 

£corr  _  pexact  _  p  riQ'i 

DCB  ”  ^DCB  ^SCF/ 

pcorr  _  pexact  _  p 

^DC  “  ^DC  %CF- 

The  difference  between  the  DCB  correlation  and 
the  DC  correlation,  E^cg  -  E^^^  is  the  relativistic 
many-body  shift  due  to  the  low-frequency  Breit 
interaction  [5,  6]. 


Computation 

The  four-component  atomic  spinor,  is  la¬ 

beled  by  symmetry  species  k  and  subspecies  m, 
and 


r  e ,  <i>)  =  - 


P„K(r)x^je,cl,) 


(20) 


with  P„^(r)  and  Q„^(r)  the  large  and  small  radial 
components,  respectively,  of  the  wave  function. 


The  four  spinors  are  assumed  to  be  orthonormal. 
In  the  matrix  DF  scheme,  the  radial  functions  are 
expanded  in  basis  sets  of  analytic  functions: 

PnM')  =  (21) 

i 

and 

QnSr)=  (22) 

i 

where  {C,^^  J  and  are  expansion  coefficients 

for  spinors  of  symmetry  k,  and  {gj^fir)}  and  {gf/(r)} 
are  the  large  and  small  component  basis  sets,  re¬ 
spectively. 

The  large  radial  component  is  expanded  in  a  set 
of  Gaussian-type  functions  (GTF)  [5,  6,  9,  11,  12, 
14,  15,  17]: 


=  A^;r"f''’exp(-4,r2)  (23) 

with  n[  k]  =  -  K  for  K  <  0,  and  n[  k]  =  k  +  1  for 
K  >  0,  is  the  normalization  constant.  The  small 
component  basis  set  is  constructed  to  sat¬ 

isfy  the  boundary  condition  associated  with  the 
finite  nucleus  of  uniform  proton  charge  distribu¬ 
tion  [1,  5,  6, 11, 15].  With  the  finite  nucleus,  GTF  of 
integer  power  of  r  are  especially  appropriate  basis 
functions  because  the  finite  nuclear  boundary  re¬ 
sults  in  a  solution  which  is  Gaussian  at  the  origin 
[1,  5,  6].  Basis  functions  which  satisfy  the  nuclear 
boundary  conditions  are  also  automatically  kineti- 
cally  balanced  [1,  5,  12].  Imposition  of  the  bound¬ 
ary  conditions  results  in  particularly  simple  forms 
with  spherical  G  spinors  [1,  5,  6]. 

Matrix  DF  SCF  and  second-order  relativistic 
MBPT  calculations  based  on  the  no-pair 
DiraC“Coulomb  Hamiltonian  on  the  closed-shell 
ground  state  of  Group  IIB  atoms,  Zn  and  Cd, 
and  the  open-shell  ground  state  of  the  singly 
ionized  Zn^  and  Cd^  ions  employed  large  well- 
tempered  basis  sets  [26]  of  G  spinors.  All  electrons 
were  included  in  the  relativistic  MBPT  procedure 
to  study  the  effects  of  relativity  on  electron  correla¬ 
tion  as  well  as  the  overall  relativity  and  cor¬ 
relation  contributions  to  the  IE  of  the  Group 
IIB  atoms.  The  order  of  the  partial-wave  expan¬ 
sion  the  highest  angular  momentum  of 

the  spinors  included  in  the  virtual  space,  was 
Lmax  =  5  throughout  the  study.  The  basis  sets 
were  20sl8  pl3dll/10gl0E  for  zinc  and 
23s22pl6dl4fl2gl2h  for  cadmium.  The  well-tem¬ 
pered  exponents  were  taken  from  the  work  of 
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Huzinaga  and  Klobukowski  [26].  Table  I  shows  the 
well-tempered  basis  set  employed  in  the  SCF  and 
MBPT  calculations  on  the  neutral  zinc  atom  and 
singly  ionized  zinc  ion. 

All-electron  relativistic  MBPT  calculations  were 
also  performed  that  included  the  frequency-inde¬ 
pendent  Breit  interaction  in  the  first  and  second 
orders  of  perturbation  theory.  These  calculations 
were  based  on  the  no-pair  Dirac-Coulomb-Breit 
Hamiltonian,  The  speed  of  light  was  taken 

to  be  137.0359895  a.u.  The  nonrelativistic  limit  was 
simulated  by  setting  the  speed  of  light  to  10^  a.u. 
The  nuclei  were  again  modeled  as  spheres  of  uni¬ 
form  proton  charge. 


Results  and  Discussion 

Table  II  displays  the  DF  SCF  energies  Escf/ 
second-order  Dirac-Coulomb  correlation  energies, 
and  first-  and  second-order  Breit  interaction 
energies,  and  for  ground-state  zinc,  Zn^, 
and  Zn^'^  in  increasing  order  of  partial-wave  ex¬ 
pansion.  In  each  entry,  the  order  of  the  partial-wave 

TABLE  I _ 

Well-tempered  exponents  of  the 
20s18p13d11f10g10h  G-spinor  basis  set 
employed  in  the  DF  SCF  and  RMBPT  calculations 
ofZn°  and  Zn^L 


Symmetry 


Exponent 

Sl/2 

Pl/2,3/2 

^3/2, 5/2 

^5/2, 7/2 

97/2,9/2 

^9/2,11/2 

4250042.1 

X 

663740.70 

X 

144800.18 

X 

X 

38457.241 

X 

X 

11633.475 

X 

X 

3957.4984 

X 

X 

1508.8936 

X 

X 

624.94155 

X 

X 

X 

272.06944 

X 

X 

X 

X 

121.77099 

X 

X 

X 

X 

X 

X 

54.721784 

X 

X 

X 

X 

X 

X 

24.885485 

X 

X 

X 

X 

X 

X 

1 1 .536923 

X 

X 

X 

X 

X 

X 

5.2194206 

X 

X 

X 

X 

X 

X 

2.3421989 

X 

X 

X 

X 

X 

X 

1.0625147 

X 

X 

X 

X 

X 

X 

0.46920953 

X 

X 

X 

X 

X 

X 

0.18025741 

X 

X 

X 

X 

X 

X 

0.08726504 

X 

X 

X 

X 

X 

X 

0.04021766 

X 

X 

X 

expansion,  the  DF  SCF  energy,  and  the  relativistic 
MBPT  energies  are  given  in  the  first  row.  The 
values  in  parentheses  in  the  second  row  are  the 
nonrelativistic  energies  computed  by  setting  c  = 
10\ 

In  the  third  column  of  Table  II,  we  present  the 
closed-shell  DF  SCF  energies  for  the  ground 
^So(3d^°4s^)  state  of  Zn®  as  well  as  the  open-shell 
DF  SCF  energies  for  the  ground  state 

of  Zn^^.  The  fourth  column  of  the  table  gives  the 
second-order  Dirac-Coulomb  correlation  energies 
E^^  in  increasing  order  of  partial-wave  expansion. 
The  MBPT  calculations  were  also  performed  with 
the  frequency-independent  Breit  interaction  in¬ 
cluded.  The  Breit  interaction  is  introduced  into  the 
no-pair  DC  Hamiltonian,  by  adding  the 

frequency-independent  Breit  interaction,  Eq.  (4),  to 
the  instantaneous  Coulomb  interaction  1  /r^^.  In  the 
fifth  column  we  present  the  first-order  Breit  in¬ 
teraction  energies,  B^^\  In  the  sixth  column  the 
second-order  correlation  corrections  to  the  Breit 
interaction,  B^^\  are  given  in  increasing  order  of 
partial-wave  expansion.  The  B^^^  are  computed  as 
the  difference  between  the  second-order  correla¬ 
tion  correction  evaluated  with  the  inclusion  of  B^ 
in  the  effective  electron-electron  interaction  and 
the  second-order  DC  correlation  correction,  E§^. 

in  correlation  energies  is  the  relativistic 
many-body  shift  that  arises  from  incorporation  of 
the  frequency-independent  Breit  interaction  into 
the  effective  two-body  interaction. 

The  DF  energies,  —1794.61114  a.u.,  for  the 
ground  state  Zn®  atom,  computed  with  a  basis 
of  20sl8pl3rf  G  spinors  agree  well  with  the  corre¬ 
sponding  numerical  DF  limit-1794.62  a.u.  obtained 
with  the  numerical  finite  difference  DF  program  of 
Desclaux  [27].  The  difference  between  relativistic 
DF  SCF  and  its  nonrelativistic  limit  for  Zn°,  i.e., 
the  relativistic  energy  lowering  at  the  SCF  level,  is 
16.76676  a.u. 

Table  II  also  shows  the  convergence  patterns  of 
the  perturbation  series  in  increasing  partial-wave 
expansion.  Because  large  G-spinor  basis  sets  were 
employed  to  saturate  the  occupied  and  virtual 
spaces  in  each  symmetry,  the  computed  correla¬ 
tion  energies  have  converged  to  at  least  four  sig¬ 
nificant  figures  in  each  order  of  the  partial-wave 
expansion.  The  convergence  rates  of  the  second- 
order  Coulomb  correlation  correction  with  respect 
to  partial-wave  expansion  are  approximately  the 
same  for  relativistic  and  nonrelativistic  MBPT.  The 
difference  between  the  second-order  Coulomb  cor¬ 
relation  correction  at  c  ==  137.0359895  and  its 
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TABLE  II 


DF  SCF  and  MBPT  energies  (a.u.)  of  the  ground-state  Zn® 
increasing  partial-wave  expansion.^ 

CSq)  and  Zn"^^ 

(^S^/g)  in  basis  sets  of 

^max 

^SCF 

B(1) 

B(2) 

^tot 

2  Zn® 

-1794.61114 

(-1777.84438) 

-0.96905 

(-0.95893) 

0.76145 

-0.02037 

-1794.83911 

(-1778.80331) 

-1794.32478 

(-1777.56381) 

-0.93013 

(-0.92134) 

0.76134 

-0.02036 

-1794.51393 

(-1778.48515) 

O 

C 

N 

CO 

-1794.61114 

(-1777.84438) 

-1.42251 

(-1.41104) 

0.76145 

-0.02342 

-1795.29562 

(-1779.25542) 

Zn+1 

-1794.32478 

(-1777.56381) 

- 1 .36893 
(-1.35949) 

0.76134 

-0.02341 

-1794.95578 

(-1778.92330) 

4  Zn° 

-1794.61114 

(-1777.84438) 

- 1 .56782 
(-1.55679) 

0.76145 

-0.02447 

-1795.44198 

(-1779.40117) 

Zn+^ 

-1794.32478 

(-1777.56381) 

-1.51246 

(-1.50351) 

0.76134 

-0.02446 

-1795.10036 

(-1779.06732) 

5  Zn° 

-1794.61114 

(-1777.84438) 

- 1 .62873 
(-1.61799) 

0.76145 

-0.02502 

-1795.50344 

(-1779.46237) 

Zn+^ 

-1794.32478 

(-1777.56381) 

- 1 .57272 
(-1.56418) 

0.76134 

-0.02501 

-1795.16117 

(-1779.12799) 

®The  values  in  parentheses  are  nonrelativistic  energies  computed  by  setting  c  =  TO"^. 


nonrelativistic  limit  (c  =  10^)  is  the  relativistic  DC 
many-body  shift  that  arises  from  the  use  of  rela¬ 
tivistic  single-particle  states  and  the  instantaneous 
Coulomb  two-body  interaction.  The  relativistic  DC 
many-body  shifts  are  relatively  small  for  Zn^  and 
Zn^-',  0.01074  a.u.  (0.292  eV)  for  Zn^  and  0.00854 
a.u.  (0.232  eV)  for  Zn^"^,  evaluated  at  =  5. 
However,  the  effect  on  the  IE  of  Zn^  of  the  many- 
body  shifts  is  significant  because  these  values  are 
noticeably  different  between  Zn^  and  Zn^"^.  For 
heavier  systems,  Au  and  Hg,  the  relativistic  DC 
many-body  shifts  are  an  order  of  magnitude  larger 
than  that  in  Zn  [5,  15]. 

We  have  summarized  in  Table  III  the  IE  for  the 
ground  state  of  the  zinc  atom  obtained  in 


TABLE  III _ 

Computed  ionization  energy  (eV)  of 
Zn°  in  increasing  partial-wave  expansion. 


^max 

Ionization  energy  (eV) 

Relativistic 

Nonrelativistic® 

2 

8.849 

8.658 

3 

9.248 

9.037 

4 

9.296 

9.085 

5 

9.314 

9.099 

Experiment*^ 

9.39 

®  Nonrelativistic  limit  (c  =  10^^). 
^  Experiment  (Ref  [28]). 


partial-wave  expansions  up  to  ^  5.  These  lEs 
were  computed  by  subtracting  the  total  energy  of 
Zn®,  from  the  last  column  of  Table  II,  from  that  of 
Zn^"^  in  each  order  of  the  partial-wave  expansion. 
The  table  illustrates  the  convergence  pattern  of  the 
computed  IE  of  Zn®  in  increasing  partial-wave 
expansion.  For  comparison,  we  also  list  in  the  last 
row  the  experimental  IE  [28].  As  the  order  of 
partial-wave  expansion  increases,  the  computed 
IE  smoothly  converges  to  approximately  9.32  eV. 
The  experimental  value  is  9.39  eV  [28].  The  IE, 
9.314  eV,  computed  by  relativistic  MBPT  with 
the  partial-wave  expansion  =  5,  has  con¬ 
verged  to  within  0.01  eV  of  the  extrapolated  value 
(Lmax  ^  iri  the  partial-wave  expansion.  The 
residual  discrepancy  of  0.07  eV  between  theory 
and  experiment  is  due  primarily  to  the  truncation 
of  MBPT  at  second  order.  The  all-electron  third- 
order  correlation  corrections  for  Zn®  and  Zn^  are 
much  more  computationally  demanding  than  the 
second-order,  and  are  therefore  neglected  in  the 
present  study.  The  IE  computed  at  the  nonrela¬ 
tivistic  limit,  9.099  eV,  underestimates  the  experi¬ 
mental  value  by  0.3  eV  because  of  the  neglect  of 
relativistic  effects. 

When  one  compares  experimental  lEs  of  transi¬ 
tion  metals  to  those  computed  by  nonrelativistic 
quantum  chemical  methods,  one  must  correct  for 
relativistic  contributions.  Martin  and  Hay  [19]  have 
calculated  relativistic  contributions  to  the  experi- 
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mental  ionization  energies  of  transition  metals  at 
the  DF  level,  assuming  that  electron  correlation 
does  not  change  the  relativistic  correction. 
Raghavachari  and  Trucks  [20]  modified  the  experi¬ 
mental  IBs  of  transition  metal  atoms  Sc  through  Zn 
by  subtracting  the  relativistic  corrections  com¬ 
puted  by  Martin  and  Hay  [19]  from  the  experi¬ 
mental  values.  Their  corrected  ''nonrelativistic"  IE 
for  Zn®  is  9.23  eV,  in  good  agreement  with  their 
estimate,  9.13  eV,  computed  by  quadratic  configu¬ 
ration  interaction  with  single  and  double  substitu¬ 
tions  and  perturbative  triples  (QCISD(T)).  The  cor¬ 
rected  nonrelativistic  IE,  however,  does  not  take 
into  account  the  relativistic  many-body  shift.  When 
this  correction  is  accounted  for,  the  agreement 
between  them  improves  dramatically  (see  below). 

Table  IV  displays  the  contribution  at  each  order 
of  perturbation  theory  to  the  ionization  energy  of 
Zn®  computed  at  =  5.  These  contributions 
were  computed  by  subtracting  the  energy  of  Zn® 
from  that  of  Zn^  ^  in  each  order  of  perturbation 
theory.  Relativistic  and  nonrelativistic  results  are 
given,  respectively,  in  the  first  and  second  rows. 
The  third  row  gives,  for  each  order,  the  difference 
between  the  relativistic  and  nonrelativistic  contri¬ 
butions  given,  respectively,  in  the  first  and  second 
rows.  The  second  column  gives  the  zero-order 
contribution,  AEg^^P,  to  the  ionization  energy,  rela¬ 
tivistic  and  nonrelativistic,  and  their  difference. 
The  relativistic  zero-order  contribution  7.7923  eV, 
the  so-called  ASCF  approximation  to  the  IE,  was 
computed  by  subtracting  the  DF  SCF  energy  of 
Zn®  from  that  of  Zn^"^.  The  nonrelativistic  zero- 
order  contribution,  7.6347  eV,  was  computed  by 
subtracting  the  nonrelativistic  SCF  energy  of  Zn® 
from  that  of  Zn^"^.  Both  relativistic  and  nonrela¬ 
tivistic  ASCF  approaches  significantly  underesti¬ 
mate  the  ionization  energy.  Numerous  nonrela¬ 


tivistic  and  relativistic  SCF  calculations  have 
demonstrated  that  single-configuration  SCF  yields 
IE  which  are  significantly  lower  than  experimental, 
indicating  that  electron  correlation  effects  are  im¬ 
portant.  When  the  Coulomb  correlation  correction, 
AEg^  =  1.5241  eV,  the  difference  in  the  second- 
order  DC  correlation  energies  between  Zn^'^  and 
Zn®  is  taken  into  account,  the  computed  IE  dra¬ 
matically  improves.  The  effects  of  Breit  interaction 
corrections  on  the  computed  IE,  —0.00299  and 
0.00027  eV,  respectively,  for  AB^^^  and  AB^^^  on 
the  valence-shell  electron  IE  of  Zn®,  are  negligible. 
The  Breit  interaction  is  a  short-range  interaction 
that  primarily  affects  the  inner-shell 
electrons  [5,  6]. 

Referring  to  the  third  row  of  Table  IV,  the 
largest  relativistic  correction  to  the  nonrelativistic 
contribution  to  the  ionization  energy,  0.1576  eV, 
comes  from  the  relativistic  DF  SCF.  In  an  earlier 
study  [19],  Martin  and  Hay  estimated  this  correc¬ 
tion  to  be  0.16  eV,  in  agreement  with  our  result. 
The  effect  of  electron  correlation  on  the  relativistic 
corrections,  i.e.,  the  relativistic  many-body  shift, 
has  not  hitherto  been  available  for  the  transition 
metals.  The  difference  in  relativistic  many-body 
shifts  between  Zn^^  and  Zn®  (0.0599  eV)  given  in 
the  third  row  of  Table  IV,  which  directly  con¬ 
tributes  to  the  IE,  is  approximately  |  of  the  rela¬ 
tivistic  SCF  contribution  (0.1576  eV).  The  differen¬ 
tial  effect  of  relativistic  DC  correlation  on  the  cal¬ 
culated  relativistic  corrections  to  the  ionization  en¬ 
ergy  is  thus  nonnegligible.  The  overall  relativistic 
correction  to  the  IE  of  the  zinc  atom  is  0.215  eV. 
When  this  correction  is  subtracted  from  the  experi¬ 
mental  IE,  one  obtains  9.17  eV  for  the  corrected 
nonrelativistic  IE  for  Zn,  in  excellent  agreement 
with  the  estimated  nonrelativistic  IE  of  9.13  eV 
obtained  by  Raghavachari  and  Trucks  [20].  The 


TABLE  IV _ 

Contributions  from  each  order  of  perturbation  theory  to  the  ionization  energy  (in  eV)  of 
ground  state  Zn®  at  =  5. 


E(Zn  +  ^)-£(Zn°) 

A^scf 

Ae<2) 

Total  (IE‘’) 

Relativistic 

7.7923 

1 .5241 

-0.00299 

0.00027 

9.314 

Nonrelativistic 

7.6347 

1 .4642 

0.0 

0.0 

9.099 

□iff.  (Rel  -  Nonrel) 

0.1576 

0.0599 

-0.00299 

0.00027 

0.215 

(0.16)^ 


®  Martin  and  Hay  (Ref.  [19]). 
^  Ionization  energy. 
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residual  discrepancy  of  ~  0.04  eV  is  primarily  due 
to  the  truncation  of  partial  wave  to  =  3  in  the 
nonrelativistic  QCISD(T)  calculations. 

Table  V  displays  the  DF  SCF  energies  Fscf/ 
second-order  DC  correlation  energies  and 

first-  and  second-order  Breit  interaction  energies, 
and  for  the  ground  ^Sq  state  of  the  cad¬ 
mium  atom  and  its  singly  ionized  Cd^"^  ion  ob¬ 
tained  at  =  5.  The  relativistic  many-body 
shift,  B^^\  increases  dramatically  from  Zn  to  Cd.  In 
going  from  Zn  to  Cd,  the  first-  and  second-order 
Breit  interaction  energies  increase  four-  to  fivefold, 
and  the  Breit  interaction  results  in  a  significant 
modification  of  relativistic  many-body  effects. 
However,  B^^^  and  B^^^  are  nearly  identical  for  the 
neutral  atoms  and  their  singly  ionized  ions,  and 
thus  the  valence-shell  electron  IE  is  almost  entirely 
accounted  for  by  the  differences  in  DF  energies, 
Escf/  srid  the  Coulomb  correlation  energies 
Such  is  not  the  case  with  the  elements  of  the  fifth 
and  higher  rows,  nor  with  highly  ionized  species 
[5,  6].  In  highly  ionized  ions,  B^^^  is  comparable  in 
magnitude  to  the  second-order  Coulomb  correla¬ 
tion  correction.  The  differences  in  B^^\  B^^\  and 
^DC  between  an  ion  and  +  tend  to 

affect  the  IE  by  similar  amounts  [5,  6]. 

As  with  Zn  the  large  basis  sets  result  in  com¬ 
puted  correlation  energies  that  have  converged  to 
at  least  four  significant  figures  in  each  order  of  the 
partial-wave  expansion.  The  convergence  rates  of 
the  second-order  Coulomb  correlation  correction 
with  respect  to  partial-wave  expansion  are  approx¬ 
imately  the  same  for  relativistic  and  nonrelativistic 
MBPT,  The  relativistic  DC  mnay-body  shifts  are 
relatively  small  for  Cd®  and  Cd^"^,  i.e.,  0.03530  a.u. 
(=  0.961  eV)  for  Cd®  and  0.02984  a.u.  (=  0.812  eV) 
for  Cd^'^  evaluated  at  =  5.  However,  as  with 
zinc,  the  effect  on  the  IE  of  Cd®  of  the  many-body 
shifts  is  significant  because  these  values  are  notice¬ 
ably  different  between  Cd®  and  Cd^"^. 


Table  VI  displays  the  contribution  from  each 
order  of  perturbation  theory  to  the  ionization  en¬ 
ergy  of  Cd®  computed  at  =  5.  The  table  was 
constructed  in  a  similar  manner  to  Table  IV.  Rela¬ 
tivistic  and  nonrelativistic  results  are  given  in  the 
first  and  second  rows,  and  the  third  row  gives  the 
difference  between  relativistic  and  nonrelativistic 
contributions.  The  second  column  gives  the  zero- 
order  contribution,  AEg^F/  to  the  ionization  energy 
computed  both  relativistically  and  nonrelativisti- 
cally  and  their  difference.  The  ASCF  approxima¬ 
tion  for  IE,  7.3544  eV,  was  computed  by  subtract¬ 
ing  the  DF  SCF  energy  of  Cd®  from  that  of  Cd^"^. 
The  nonrelativistic  zero-order  contribution  6.9264 
eV  was  computed  by  subtracting  the  nonrelativis¬ 
tic  SCF  energy  of  Cd®  from  that  of  Cd^"^.  As  in  the 
case  of  Zn,  both  relativistic  and  nonrelativistic 
ASCF  approaches  underestimate  the  ionization  en¬ 
ergy.  When  the  Coulomb  correlation  correction 
AE^^  =  1.7309  eV,  the  difference  in  the  second- 
order  DC  correlation  energies  between  Cd^"^  and 
Cd®,  is  taken  into  account,  the  computed  IE  im¬ 
proves.  The  differential  effect  of  relativistic  DC 
correlation  on  the  calculated  relativistic  corrections 
to  the  ionization  energy  is  nonnegligible.  On  the 
other  hand,  the  effects  of  Breit  interaction  correc¬ 
tions  on  the  computed  IE,  -0.00490  and  0.00027 
eV,  respectively,  for  AB^^^  and  AB^^^,  on  the 
valence-shell  electron  IE  of  Cu®  are  still  negligible 
in  Cd.  The  IE  at  the  nonrelativistic  limit,  8.509  eV, 
underestimates  the  experimental  value  by  0.48  eV 
because  of  the  neglect  of  relativistic  effects.  Refer¬ 
ring  to  the  third  row  of  Table  VI,  the  largest 
relativistic  correction  to  the  nonrelativistic  contri¬ 
bution  to  the  ionization  energy,  0.4280  eV,  comes 
from  the  relativistic  DF  SCF.  The  difference  in 
relativistic  many-body  shifts  between  Cd^^  and 
Cd®  (0.1486  eV)  given  in  the  third  row  of  Table  VI, 
which  directly  contributes  to  the  IE,  is  approxi¬ 
mately  I  of  the  relativistic  SCF  contribution  (0.4280 


TABLE  V _ 

DF  SCF  and  MBPT  energies  (a.u.)  of  the  ground-state  Cd®  CSq)  and  Cd"^^  (^^1/2)  computed  with 
partial-wave  expansion,  =  5.® 


^SCF 

B(1) 

e(2) 

^tot 

0 

Q. 

0 

-5593.32183 

-2.57354 

3.84216 

-0.08161 

-5592.13482 

(-5465.09882) 

(-2.53824) 

(-5467.63706) 

Cd-"^ 

-5593.05156 

-2.50993 

3.84198 

-0.08160 

-5591.80111 

(-5464.84428) 

(-2.48009) 

(-5467.32437) 

^The  values  In  parentheses  are  nonrelativistic  energies  computed  by  setting  0  =  10"^. 
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TABLE  VI 


Contributions  from  each  order  of  perturbation  theory  to  the  ionization  energy  of  the  ground  state 
Cd°  at  =  5  (in  eV). 


E(Cd-"^)-£(Cd°) 

^^SCF 

AB<^> 

Ae<2) 

Total  (lE^) 

Relativistic 

7.3544 

1 .7309 

-0.0049 

0.00027 

9.081 

Nonrelativistic 

6.9264 

1 .5823 

0.0 

0.0 

8.509 

□iff.  (Rel  -  Nonre!) 

0.4280 

0.1486 

-0.0049 

0.00027 

0.572 

®  Ionization  energy 


eV).  The  differential  effect  of  relativistic  DC  corre¬ 
lation  on  the  calculated  relativistic  corrections  to 
the  ionization  energy  is  thus  nonnegligible  and 
larger  than  that  in  Zn.  The  overall  relativistic  cor¬ 
rection  to  the  IE  of  the  cadmium  atom  is  0.572  eV. 

We  have  summarized  in  Table  VII  the  com¬ 
puted  lEs  for  Zn  and  Cd.  In  the  third  column,  the 
lEs  computed  in  previous  nonrelativistic  and  rela¬ 
tivistic  correlated  calculations  are  shown.  The  ex¬ 
perimental  lEs  are  given  in  the  last  column.  The 
lEs  computed  in  this  study  agree  with  experiment. 
The  discrepancies  between  the  computed  and  ob¬ 
served  lEs  are  within  0,1  eV  for  both  Zn  and  Cd. 
The  bulk  of  the  error  is  due  to  truncation  of  the 
perturbation  expansion  at  second  order.  Vijayaku- 
mar  and  Gopinathan  [29]  employed  a  relativistic 
correlated  local  density  functional  (RCH)  method 
to  calculate  the  lEs  of  atoms  He  through  Zn.  The 
computed  IE  of  Zn  by  the  RCH  method  is  8.41  eV, 
underestimating  the  experimental  value  by  as 
much  as  1  eV.  Guo  and  Whitehead  [30]  computed 
the  IE  of  Cd  by  generalized-exchange  local-spin- 
density-functional  method  with  relativistic  correc¬ 
tions  of  the  mass  velocity  and  Darwin  terms.  The 


TABLE  VII _ 

Ionization  energies  of  Zn  and  Cd  (eV). 


Ionization  energy 

Atom 

This  work 

Other  work 

Experiment® 

Zn 

9.314 

8.41,'"  ais*" 

9.39 

Cd 

9.081 

8.72^ 

8.99 

®  Moore’s  table  (Ref.  [28]). 

^Vijayakumar  and  Gopinathan;  relativitsic  and  correlated 
local-density  H  method  (Ref.  [29]). 

Raghavachari  and  Trucks;  nonrelativistic  QCISD(T)  (Ref. 
[20]). 

Guo  and  Whitehead;  QR-LSD-GX-SIC-GWB  method  (Ref. 
[30]). 


density  functional  method  produces  an  IE  for 
Cd  (8.72  eV)  in  closer  agreement  with  experiment 
(8.99  eV)  than  does  Vijayakumar's  RCH  estimate 
for  Zn. 

ACKNOWLEDGMENT 

This  work  was  supported  in  part  by  the  Na¬ 
tional  Science  Foundation. 


References 

1.  Y.  Ishikawa  and  U.  Kaldor,  in  Computational  Chemistry: 
Reviezos  of  Current  Trends,  Vol.  I,  J.  Leszczynski,  Ed.  (World 
Scientific,  Singapore,  1996). 

2.  W.  R.  Johnson  and  J.  Sapirstein,  Phys.  Rev.  Lett.  57,  1126 
(1986);  W.  R.  Johnson,  M.  Idrees,  and  J.  Sapirstein,  Phys. 
Rev.  A  35,  3218  (1987);  W.  R.  Johnson,  S.  A.  Blundell,  and  J. 
Sapirstein,  Phys.  Rev.  A  37,  307  (1988);  Ibid.  42,  1087  (1990); 
S.  A.  Blundell,  J.  Sapirstein,  and  W.  R.  Johnson,  Phys.  Rev. 
D  45,  1602  (1992);  S.  A.  Blundell,  W.  R.  Johnson,  and  J. 
Sapirstein,  Phys.  Rev.  A  37,  2764  (1988);  Ibid.  38,  2699 
(1988);  Ibid.  42,  1087  (1990);  Ibid.  42,  3751  (1990). 

3.  H.  M.  Quiney,  I.  P.  Grant,  and  S.  Wilson,  Phys.  Scr.  36,  460 
(1987);  H.  M.  Quiney,  1.  P.  Grant,  and  S.  Wilson,  in  Many- 
Body  Methods  in  Quantum  Chemistry  (Lecture  Notes  in 
Chemistry  52),  U.  Kaldor,  Ed.  (Springer,  Berlin,  1989);  H.  M. 
Quiney,  I.  P.  Grant,  and  S.  Wilson,  J.  Phys.  B  23,  L271 
(1990). 

4.  K.  Koc  and  J.  Migdalek,  J.  Phys.  B  23,  L5  (1990). 

5.  Y.  Ishikawa,  Phys.  Rev.  A  42,  1142  (1990);  Y.  Ishikawa  and 
H.  M.  Quiney,  Phys.  Rev.  A  47,  1732  (1993);  Y.  Ishikawa 
and  K.  Koc,  Phys.  Rev.  A  50,  4733  (1994). 

6.  Y.  Ishikawa  and  K.  Koc,  Phys.  Rev.  A  53,  3966  (1996);  Phys. 
Rev.  A,  in  press. 

7.  M.  H.  Chen,  K,  T.  Cheng,  and  W.  Johnson,  Phys.  Rev.  A  47, 
3692  (1993). 

8.  D.  R.  Beck  and  Z.  Cai,  Phys.  Rev.  A  37,  4481  (1988);  D.  R. 
Beck,  Ibid.  37,  1847  (1988);  Ibid.  45,  1399  (1992). 

9.  B.  Hess,  Phys.  Rev.  A  33,  3742  (1986);  G.  Jansen  and  B.  A. 
Hess,  Phys.  Rev.  A  39,  6016  (1989);  R.  Samzow,  B.  A.  Hess, 
and  G.  Jansen,  J.  Chem.  Phys.  96,  1227  (1992);  A  Pizlo,  G. 
Jansen,  and  B.  A.  Hess,  J.  Chem.  Phys.  98,  3945  (1993). 

10.  T.  Kagawa,  Y.  Honda,  and  S.  Kiyokawa,  Phys.  Rev.  A  44, 
7092  (1991). 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


553 


ISHIKAWA  AND  KOC 


11.  K.  Koc,  Y.  Ishikawa,  and  T.  Kagawa,  Chem.  Phys.  Lett.  231, 
407  (1994);  K.  Koc,  Y.  Ishikawa,  T.  Kagawa,  and  Y.-K,  Kim, 
Chem.  Phys.  Lett.  263,  338  (1996). 

12.  P.  J.  C.  Aerts  and  W.  Nieuwpoort,  Chem.  Phys.  Lett.  125,  83 
(1986);  Int.  J.  Quant.  Chem.  Symp.  19,  267  (1986);  O.  Visser, 
L.  Visscher,  P.  J.  C.  Aerts,  and  W.  C.  Nieuwpoort,  Theor. 
Chim.  Acta  81,  405  (1992). 

13.  E.  Lindroth,  Phys.  Rev.  A  37,  316  (1988);  E.  Lindroth  and  S. 
Salomonson,  Phys.  Rev.  A  41,  4659  (1990);  E.  Lindroth,  H. 
Persson,  S.  Salomonson,  and  A-M.  Martensson-Pendrill, 
Phys.  Rev.  A  45,  1493  (1992). 

14.  H.  Sekino  and  R.  J.  Bartlett,  Int.  J.  Quant.  Chem.  Symp.  24, 
241  (1990). 

15.  E.  Eliav,  U.  Kaldor,  and  Y.  Ishikawa,  Phys.  Rev.  A  49,  1724 
(1994);  Ibid.  50,  1121  (1994);  Ibid.  52,  291  (1995);  Phys.  Rev. 
Lett.  74,  1079  (1995). 

16.  S.  A.  Blundell,  W.  R.  Johnson,  Z.  W.  Liu,  and  J.  Sapirstein, 
Phys.  Rev.  A  39,  3768  (1989);  40,  2233  (1989);  S.  A.  Blundell, 
W.  R.  Johnson,  and  J.  Sapirstein,  Phys.  Rev.  A  43,  3407 
(1991). 

17.  K.  Koc  and  Y.  Ishikawa,  Phys.  Rev.  A  49,  794  (1994). 

18.  B.  O.  Roos,  P.  Linse,  P.  E.  M.  Siegbahn,  and  M.  R.  A. 
Blomberg,  Chem.  Phys.  66,  197  (1982);  K.  Wolinski  and  P. 
Pulay,  J.  Chem.  Phys.  90,  3647  (1989);  K.  Hirao,  Chem.  Phys. 
Lett.  196  397  (1992);  Ibid.  201,  59  (1993);  R.  B.  Murphy  and 
R.  P.  Messmer,  J.  Chem.  Phys.  97,  4170  (1992);  I.  Shavitt, 


presented  at  the  1992  Sanibel  Symposium,  St.  Augustine, 
Florida,  March  1992. 

19.  R.  L.  Martin  and  P.  J.  Hay,  J.  Chem.  Phys.  75,  4539  (1981). 

20.  K.  Raghavachari  and  G.  W.  Trucks,  J.  Chem.  Phys.  91,  2457 
(1989). 

21.  1.  P.  Grant,  in  Relativistic,  Quantum  Electrodynamic,  and 
Weak  Interaction  Effects  in  Atoms,  Proceedings  of  the  Conference 
on  Relativistic,  Quantum  Eletrodynamic,  and  Weak  Interaction 
Effects  in  Atoms,  W.  R.  Johnson,  P.  Mohr,  and  J.  Sucher,  Eds. 
AIP  Conf.  Proc.  No.  189  (AIP,  New  York,  1989),  p.  209. 

22.  J.  Sucher,  Phys.  Rev.  A  22,  348  (1980);  Phys.  Scripta  36,  271 
(1987);  J.  Phys.  B  21,  L585  (1988). 

23.  M.  H.  Mittleman,  Phys.  Rev.  A  24,  1167  (1981);  Ibid.  4,  893 
(1971);  Ibid.  5,  2395  (1972). 

24.  C.  Moller  and  M.  S.  Plesset,  Phys.  Rev.  46,  618  (1934). 

25.  P.  O.  Lowdin,  Adv.  Chem.  Phys.  2,  207  (1959). 

26.  S.  Huzinaga  and  M.  Klobukowski,  J.  Mol.  Struct.  Theochem. 
167,  1  (1988);  S.  Huzinaga,  M.  Klobukowski,  and  H.  Tate- 
waki.  Can.  J.  Chem.  63,  1812  (1985);  S.  Huzinaga  and  M. 
Klobukowski,  Chem.  Phys.  Lett.  212,  260  (1993). 

27.  J.  P.  Desdaux,  Comput.  Phys.  Commun.  9,  31  (1975). 

28.  C.  E.  Moore,  Atomic  Energy  Levels,  NSRDS-NBS  No.  35  (U.S. 
Government  Printing  Office,  Washington,  DC  ,  1971). 

29.  M.  Vijayakumar  and  M.  S.  Gopinathan,  J.  Chem.  Phys.  97, 
6639  (1992). 

30.  Y.  Guo  and  M.  A.  Whitehead,  Phys.  Rev.  A  38,  3166  (1988). 


554 


VOL  65,  NO.  5 


Hyperfine  Structure  Constants  of 
(d+  s)^  States  in  La  I  and  the  Zr  II  and 
Hf  II  Isoelectronic  Sequences 


DONALD  R.  BECK 

Physics  Department,  Michigan  Technological  University,  Houghton,  Michigan  49931 
Received  2  March  1997;  revised  18  June  1997;  accepted  19  June  1997 


ABSTRACT:  Accurate  results  for  hyperfine  structure  (HFS)  constants  of  (d  +  s)” 
states  often  require  the  inclusion  of  both  correlation  and  relativistic  effects.  Without 
these,  Dirac-Fock  HFS  constants  may  have  the  wrong  sign,  the  energies  of  uppermost 
levels  may  be  wrong  by  over  1  eV.  Here,  we  demonstrate  that  these  problems  persist 
well  into  an  isoelectronic  sequence,  and  identify  the  principle  configurations  needed  to 
achieve  accurate  results.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  555-564,  1997 


Introduction 

Transition-metal  atoms  play  important  roles  in 
catalysis,  as  impurities  in  plasma  fusion  de¬ 
vices,  and  as  deep-level  traps  in  semiconductors. 
Previously,  properties  of  (d  4-  s)"  states  of  these 
species,  which  form  many  of  the  ground  and 
low-lying  excited  states,  have  been  rather  poorly 
understood.  Positions  of  d”  levels  can  be  in  error 
over  1  eV,  independent  particle  hyperfine  struc¬ 
ture  (HFS)  constants  can  have  the  wrong  sign,  and 
little  ab  initio  work  has  been  done  on  their  transi¬ 
tion  probabilities. 

Project  grant  sponsor:  Division  of  Chemical  Sciences,  Office 
of  Energy  Research,  U.S.  Department  of  Energy. 

Project  grant  number:  DE-FG02-92ER14282. 

International  Journal  of  Quantum  Chemistry,  Vol.  65,  555-564  (1997) 
©  1997  John  Wiley  &  Sons,  Inc. 


Our  HFS  work  on  these  states,  which  began  in 
1992  [1],  has  established  the  need  to  accurately 
position  the  levels  relative  to  those  from  d" 

and  in  order  to  account  for  the  HFS  con¬ 

stants  for  the  latter  two  configurations.  We  may 
interpret  this  as  being  due  to  the  former  normally 
having  large  HFS  constants  arising  from  the  con¬ 
tact  operator  contributing  at  the  zeroth-order  (in¬ 
dependent  particle)  level.  Of  course,  in  addition  to 
positioning  the  levels  correctly,  the  usual  core  po¬ 
larization  effects  must  be  included. 

Since  relativistic  effects  associated  with  d,  s 
electron  interchanges  may  be  several  tenths  of  an 
electron  volt  [2],  they  can  have  an  important  im¬ 
pact  on  level  positions  and  so  must  be  included  in 
any  theoretical  treatment.  It  is  also  necessary  to 
treat  d",  d”“^s,  and  d”“^s^  levels  on  an  equal 
footing,  so  our  theory  must  be  a  multireference 
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one.  Furthermore,  when  obtaining  correlated  wave 
functions  for  these  levels,  one  can  be  involved 
with  configurations  possessing  a  few  hundreds  of 
eigenvectors,  each  constructed  from  several  hun¬ 
dreds  of  Slater  determinants.  This  calls  for  some 
innovative  ways  of  reducing  computational  costs. 


Methodology 

The  wave  functions  are  generated  from  the 
DiraC“Coulomb  Hamiltonian,  with  the  Breit  con¬ 
tribution,  optionally,  added  as  a  perturbation. 
Multiconfigurational  Dirac-Fock  (MCDF)  solu¬ 
tions  obtained  from  Desclaux's  program  [3]  are 
used  as  zeroth-order  functions.  There  are  gener¬ 
ated  from  just  (d  +  sY  levels.  Correlation  is  intro¬ 
duced  through  relativistic  configuration  interac¬ 
tion  (RCI);  normally  a  first-order  form  is  chosen, 
i.e.,  those  configurations  generated  by  making  sin¬ 
gle  or  double  excitations  from  outermost  subshells 
of  MCDF  configurations.  The  question  of  what  to 
use  as  one-electron  functions  for  unoccupied  (vir¬ 
tual)  subshells  is  an  important  one,  as  poor  choices 
may  lead  to  (partial)  variational  collapse  into  the 
''positron  sea."  This  can  be  avoided  if  the  major 
and  minor  components  of  these  functions  are 
properly  constrained  (e.g.,  not  allowed  to  vary 
separately).  In  our  case,  experience  has  demon¬ 
strated  that  use  of  relativistic  screened  (Z*)  hydro- 
genic  functions  avoid  this  problem,  while  at  the 
same  time  providing  an  adequately  converged  se¬ 
ries,  if  Z*  is  chosen  during  the  RCI  process  to 
minimize  the  energy  for  the  root  of  interest. 

Since  much  of  our  work  deals  with  excited 
states,  we  prefer  to  work  with  moderately  sized 
(<  7000)  energy  matrices,  which  we  diagonalize 
using  Weber  and  co-workers'  [4]  variant  of  the 
Davidson  algorithm,  to  which  we  have  added  some 
improvements  [5].  We  thus  want  to  use  N  electron 
functions  which  are  eigenstates  of  /^,  and  par¬ 
ity.  Formerly,  we  used  to  generate  eigenstates  of 

by  direct  diagonalization  of  that  matrix,  but 
some  of  the  correlation  configurations  can  involve 
matrices  of  order  several  thousand,  and  we  may 
need  a  few  hundred  eigenvectors.  In  our  HFS 
studies,  we  have  avoided  this  difficulty  by  split¬ 
ting  the  configuration  into  two  separate  parts,  each 
of  which  we  make  eigenstates  of  and  and 
their  "paste"  the  parts  back  together  using  stan¬ 


dard  vector  coupling  theory  (sums  of  the  product 
of  the  two  section  functions  and  a  single  3/  symbol 
are  involved).  The  procedure  is  very  fast  and  capa¬ 
ble  of  dealing  with  large  problems. 

To  further  restrict  the  size  of  the  energy  matrix, 
we  have  automated  a  suggestion  first  made  by 
Bunge  [6]  in  a  nonrelativistic  study  of  the  carbon 
atom.  This  recognizes  that  the  N  parents  (eigen¬ 
vectors)  of  a  single  configuration,  generate  matrix 
elements  (with  the  zeroth-order  function)  involv¬ 
ing  M  radial  integrals,  and  that  M  is  less  than  N, 
usually.  In  fact,  in  the  transition  metals  and  rare 
earths,  M  may  be  10-1000  times  smaller  than  N. 
Since  the  theory  is  first  order  in  "form,"  a  rotation 
of  the  original  set  of  parents  can  be  done,  such  that 
in  the  new  set,  there  are  only  M  parents  which 
have  a  nonzero  interaction  with  the  reference  func¬ 
tion.  The  other  N  -  M  functions  can  then  be  dis¬ 
carded,  consistent  with  a  first-order  (in  form)  the¬ 
ory.  Certain  adjustments  are  made  for  the  fact  that 
our  theory  is  relativistic,  and  multireference,  but 
the  approximation  is  both  a  good  one  (errors  less 
than  200  cm~^)  and  highly  efficient.  Further  de¬ 
tails  of  the  methodology  can  be  found  in  our  Zr  II 
paper  [7]. 

COMPUTATIONAL  STRATEGY 

Our  calculations  begin  by  doing  RCI  calcula¬ 
tions  built  from  single  and  double  excitations  from 
the  valence  (e.g.,  4rf,5s  in  Zr  II)  subshells  into 
virtuals  of  symmetries  s,  p,  rf,  g  (one  radial  for 
each  type).  These  excitations  are  significant  con¬ 
tributors  to  energy  differences,  and  sensitive  to 
orthogonalization  order  (if  core  excitations  are  done 
first,  it  is  difficult  to  recover  energy  loss  from 
valence  virtuals  due  to  orthogonalization  to  core 
virtuals).  Now  the  4rf  and  5s  radial  functions  can 
vary  substantially  between  (4d)^,  (4rf)^5s  and 
4d(5s)^  configurations.  To  account  for  this,  we  do 
the  MCDF  calculation  for  the  root  of  most  interest 
(e.g.,  (AdYj  =  I  was  chosen  in  Zr  II,  as  this  was 
the  subject  of  a  recent  measurement  [7]),  and  use 
"Brillouin"~type  excitations,  viz.  4d  ^  vd,  5s  vs 
(and  eventually  4p  ^  vp)  to  improve  the  descrip¬ 
tion  of  the  other  roots. 

After  the  first  set  of  valence  virtuals  is  deter¬ 
mined,  a  second  set,  associated  with  these  same 
single  and  double  valence  excitations  is  added; 
this  normally  provides  sufficient  radial  saturation 
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of  the  valence  space.  The  larger  triple  excitations 
are  also  added  in  at  this  time. 

In  the  next  calculational  layer,  we  begin  to  make 
single  and  double  excitations  form  the  shallow 
core,  viz.  the  4  s  and  4p  subshells,  adding  in  a 
third  set  of  virtuals,  with  larger  effective  Z*  (de¬ 
termined  during  the  RCI  process).  Important  exci¬ 
tations  include  4ip  ^  vp  -\-  vf,  and  4s  5s  -1-  z;s, 
Ad  vd.  Decisions  on  what  to  include  are  based 
on  the  criteria  that  we  are  seeking  energy  contribu¬ 
tions  which  are  differentially  large  or  make  sub¬ 
stantial  contributions  to  HFS.  These  single  excita¬ 
tions  generally  fall  into  both  categories. 

At  this  stage,  errors  in  relative  energy  differ¬ 
ences  were  still  too  large,  on  the  order  of  a  few 
tenths  of  an  electron  volt.  We  may  reasonably 
expect  the  next  most  important  corrections  to  come 
from  the  classes  of  excitations:  Ap{Ad  -h  5s)  , 

but  in  fact  these  have  a  differentially  modest  ef¬ 
fect.  Next,  we  looked  at  double  excitations  from 
4p^.  The  most  important  of  these,  differentially, 
turn  out  to  be  4p^  ^  4rf^  -h  Advd,  which  we  may 
ascribe  to  (1)  the  fact  that  /^  ->  (/  -h  1)^  are  among 
the  largest  [8],  and  (2)  Pauli  exclusion  effects  are 
involved,  i.e.,  the  Ad  subshells  are  already  partially 
occupied,  and  this  tends  to  reduce  the  contribution 
in  varying  amounts,  depending  on  the  occupancy 
of  the  Ad  subshell  for  the  level  of  interest.  This 
finding  is  valid  for  all  transition  metal  spectra 
studied  to  date,  and  was  first  noted  in  the  Zr  II 
work  [7]. 

By  using  intermediate  normalization  = 

1,  where  O  is  the  zeroth-order  function,  and  ^  is 
the  RCI  function,  it  is  possible  to  assign  energy 
contributions  to  each  correlation  vector.  These  are 
collected  in  Table  I,  for  Zr  II  for  all  configurations 
contributing  0.025  eV  or  more  to  at  least  one  root. 
The  excitations  present  in  Table  I  reduce  the  aver¬ 
age  energy  difference  error  to  0.075  eV  600 
cm"^).  On  simpler  systems,  e.g..  La  II,  we  have 
reduced  the  error  to  0.02  eV  or  160  cm“^  [9]. 


Results 

Zr  11  ISOELECTROMC  SEQUENCE 

In  this  sequence,  we  avoid  Y  I  because  of  the 
interpenetrating  manifolds  5s^6s  and  Ad5s6s  [10] 
which  would  increase  the  number  of  reference 
functions  needed.  In  a  relativistic  treatment  such 


as  this,  each  /  is  a  separate  problem.  We  chose  to 
study  /  =  I  because  it  had  the  greatest  number  of 
reference  functions  (10),  thus  making  it  challeng¬ 
ing,  and  because  several  /  ==  |  levels  were  just 
being  measured  for  Zr  II  [11].  Results  for  Zr  II 
have  already  appeared  [7,  11],  but  we  reproduce 
them  here  in  Table  II,  as  this  species  can  provide  a 
benchmark  for  the  other  sequence  members  for 
which  no  HFS  measurements  exist.  It  can  be  seen 
that  the  average  error  for  the  RCI  A  results  is 
9.2%,  in  contrast  to  the  Dirac-Fock  (DF)  values, 
for  which  two  of  the  three  results  have  the  wrong 
sign.  We  also  note  that  we  are  in  agreement  with 
the  corrections  of  Kiess  [12]  to  the  Moore  [10] 
energy  levels. 

In  Table  III  we  present  results  for  the  Nb  III 
member  of  the  /  =  f  isoelectronic  sequence.  For 
the  five  known  energy  differences  [10],  the  theoret¬ 
ical  error  is  0.050  eV  (410  cm~^);  note  that  we  have 
predicted  the  positions  of  four  new  levels,  not 
available  from  experiment  [10, 13].  Though  there  is 
greater  agreement  between  the  DF  and  RCI  results 
for  A  than  in  Zr  II,  there  are  still  two  levels 
exhibiting  a  sign  reversal.  The  one  and  N  electron 
basis  sets  used  to  generate  the  results  are  the  same 
as  that  used  for  Zr  II,  with  the  obvious  changes  in 
radial  functions  (MCDF  and  virtuals,  viz.  Z*). 

In  Table  IV  are  presented  the  results  for  Mo  IV 
]  =  \  using  the  same  1  and  N  electron  basis  sets 
(radial  sets  suitably  adjusted).  These  calculations 
along  isoelectronic  sequences  were  considerably 
eased  by  the  fact  that  almost  the  entire  data  sets 
(several  thousand  lines)  could  be  reused  from  one 
member  to  the  next.  Here,  the  average  error  for  the 
eight  known  [10,  14]  energy  levels  is  somewhat 
larger:  0.107  eV  or  860  cm"\  and  just  one  new 
level  has  been  identified.  We  may  note  that  there 
is  still  one  value  of  A  for  which  the  DF  and  RCI 
results  have  different  signs. 

THE  Hf  11  ISOELECTRONIC  SEQUENCE 

Discussion  of  La  I  is  postponed  to  the  next 
section,  as  it  is  a  thoroughly  measured  system  and 
we  have  done  calculations  for  several  /'s.  The  first 
member  of  the  series  considered  here  is  Hf  II, 
whose  results  are  presented  in  Table  V.  The  aver¬ 
age  error  [10]  for  eight  energy  differences  is  0.030 
eV  ('-'  200  cm^O.  One  new  level  is  predicted,  and 
three  of  the  A's  show  large  differences  (sign  or 
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TABLE  I _ 

Energy  contributions  in  electron  volts  for  Zr  II,  J  =  |.  All  signs  reversed. 


Roots 

4d^5s  4d®+  4d5s®  4d®  + 

4d®  +4d^  4d5s®  +4d^  4d®  4d^5s  4d^5s  4d^5s  4d®  4d^5s 


Basis 

4p 

V 

V 

V 

(4c/  +  5s)3 

-0.195 

- 1 .041 

-0.385 

-0.140 

0.010 

0.002 

0.993 

0.738 

0.014 

0.008 

4c/vp2 

0.269 

0.304 

0.166 

0.473 

0.056 

0.005 

0.003 

0.040 

0.022 

0.005 

Advd^ 

0.339 

0.011 

0.228 

0.002 

0.192 

0.002 

0.216 

-0.003 

0.161 

0.001 

Advf^ 

0.571 

0.054 

0.395 

0.010 

0.351 

0.000 

0.338 

-0.010 

0.244 

0.001 

Advg^ 

0.069 

0.008 

0.044 

0.000 

0.021 

0.000 

0.021 

-0.002 

0.014 

0.000 

Advsvd 

0.000 

0.061 

0.006 

0.000 

0.000 

0.047 

0.000 

0.044 

0.000 

0.048 

Advpvf 

0.090 

0.383 

0.102 

0.000 

0.019 

0.196 

-0.031 

0.214 

0.041 

0.192 

Ad^vs 

0.000 

0.004 

0.009 

0.000 

0.000 

0.027 

0.048 

0.018 

0.001 

0.022 

Ad^  vd 

0.024 

0.063 

0.066 

0.036 

0.091 

0.016 

0.075 

0.006 

0.084 

0.004 

Ad^vg 

0.068 

0.086 

0.051 

0.000 

0.034 

0.003 

0.039 

0.003 

0.014 

0.009 

5s  vp^ 

0.000 

0.014 

0.000 

0.097 

0.000 

0.034 

0.000 

0.009 

0.000 

0.000 

5s  vd^ 

0.000 

0.050 

0.000 

0.002 

0.000 

0.062 

0.000 

0.062 

0.000 

0.037 

5svf^ 

0.000 

0.164 

0.000 

0.016 

0.000 

0.172 

0.000 

0.127 

0.000 

0.058 

5s  vs  vd 

0.000 

0.000 

0.000 

0.033 

0.000 

0.000 

0.000 

0.003 

0.000 

0.000 

5s  vp  vf 

0.000 

0.000 

0.000 

0.227 

0.000 

0.000 

0.000 

-0.006 

0.000 

0.014 

Ad  5s  vs 

0.004 

0.000 

-0.002 

0.162 

0.000 

0.000 

0.001 

0.035 

0.001 

0.000 

Ad5svd 

0.000 

0.561 

0.010 

0.047 

0.000 

0.316 

-0.017 

0.364 

0.000 

0.295 

Ad5svg 

0.012 

0.000 

-0.001 

0.032 

0.000 

0.000 

0.000 

0.028 

0.000 

0.015 

5s^vd 

0.000 

0.000 

0.000 

0.434 

0.000 

0.000 

0.000 

-0.010 

0.000 

0.000 

Ap^vfAd^ 

0.911 

0.034 

1.003 

0.000 

1.109 

0.021 

1.010 

0.021 

1.166 

0.020 

Ap^vfAd^5s 

0.022 

0.853 

0.020 

0.054 

0.020 

0.852 

0.023 

0.805 

0.019 

0.899 

Ap^vfAd5s^ 

0.000 

0.007 

0.000 

0.505 

0.000 

0.006 

0.000 

0.007 

0.000 

0.006 

Ap^vpAd^ 

0.012 

0.040 

0.016 

0.000 

0.030 

0.042 

0.021 

0.056 

0.035 

0.084 

Ap^vpAd^5s 

0.000 

0.067 

0.002 

0.092 

0.004 

0.067 

0.004 

0.081 

0.005 

0.079 

Ap^vpAd5s^ 

0.000 

0.000 

0.000 

0.334 

0.000 

0.000 

0.000 

0.001 

0.000 

0.004 

AsvdAd^ 

0.049 

0.014 

0.036 

0.000 

0.042 

0.005 

0.032 

0.002 

0.022 

0.006 

AsvdAd^5s 

0.025 

0.056 

0.034 

0.015 

0.026 

0.041 

0.001 

0.020 

0.026 

0.018 

Ap'^Ad^ 

0.746 

0.000 

0.630 

0.000 

0.576 

0.000 

0.621 

0.000 

0.505 

0.000 

Ap^Ad^5s 

0.000 

0.757 

0.001 

0.000 

0.000 

0.754 

-0.002 

0.749 

0.000 

0.680 

Ap'^Ad^5s^ 

0.005 

0.000 

0.002 

0.876 

0.001 

0.000 

0.001 

-0.001 

0.001 

0.000 

Ap^Ad^vdvf 

0.062 

0.011 

0.070 

0.000 

0.070 

0.012 

0.074 

0.012 

0.075 

0.012 

Ap^  Ad  5svdvf 

0.000 

0.054 

0.000 

0.024 

0.000 

0.054 

0.000 

0.053 

0.000 

0.055 

Ap^5s^vdvf 

0.000 

0.000 

0.000 

0.029 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

As  vs  Ad  5s^ 

0.000 

0.000 

0,000 

0.056 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

AsAd^5s 

0.001 

0.062 

0.004 

0.004 

0.005 

0.061 

0.004 

0.045 

0.006 

0.016 

AsAd"^ 

0.041 

-0.005 

0.063 

0.000 

0.057 

0.002 

0.042 

0.003 

0.016 

0.002 

Ap^vpvd5s^ 

0.000 

0.000 

0.000 

0.040 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

Total 

3.136 

2.721 

2.576 

3.516 

2.718 

2.846 

3.520 

3.553 

2.473 

2.623 
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TABLE  II 


Fine  and  hyperfine  structure  of  Zr  II,  J  = 

=  |(cf  +  s)® 

states. 

Level® 

Energy  (cm 

A  dVIHz)" 

B  (MHz)/ 0(6)'’ 

e  (MHz)'’ 

Config. 

SLJ 

Expt.® 

Theory^ 

Theory^  Expt. 

Theory^ 

Expt. 

4cf® 

27  699.96 

29066 

-190.0 

224.0 

4cf3 

20  080.30 

20  710 

111.5  109.768 

(-76.0) 

-138.4 

(-33.9) 

33.716 

4of5s^ 

14  298.64 

14996 

-114.7 

-82.9 

4d® 

13428.64 

14387 

-66.8 

165.1 

4c/® 

V 

9  742.80 

10072 

134.1  110.177 

(-56.4) 

-133.3 

(-156.2) 

34.713 

4of®5s 

‘'p 

7  736.02 

8  209 

-534.6 

159.5 

4c/®  5s 

6  1 1 1 .70 

6  788 

134.3 

-132.2 

4c/®  5s 

4  248.30 

5  084 

190.3  199.250 

(375.1) 

-28.8 

(-83.8) 

7.706 

4c/® 

2  572.21 

2  950 

-298.0 

-57.3 

4c/®  5s 

"F 

0.000 

0 

273.8 

60.5 

°  Labeling  of  levels  is  from  Moore  [10]  as  corrected  by  Kiess  [12],  Experimental  energy  is  also  drawn  from  these  sources. 

From  previous  work  [11]. 

^  Dirac-Fock  results  are  given  in  parentheses. 


TABLE 
Fine  ar 
J  =  |(o 

:  III 

TABLE 
Fine  an 
J  =  f(d 

;  IV 

id  hyperfine  structure  of  Nb  Mi, 
f  +  s)®  states.® 

d  hyperfine  structure  of  Mo  IV, 

+  s)^  states.^ 

Level 

Energy  (cm  '') 

A  (MHz) 
Theory 

B  (MHz)  / 
Qib) 
Theory'' 

Level 

Energy  (cm  ^) 

A  (MHz) 
Theory" 

B  (MHz)  / 
Qib) 
Theory" 

Config. 

SLJ 

Expl.® 

Theory‘s 

Config. 

SLJ 

Expt." 

Theory^ 

4c/ 5s® 

71  816 

937.7 

355.9 

4c/5s® 

®D 

146  242 

-342 

501 

(710.0) 

(326.5) 

(-264) 

(456) 

4c/®  5s 

44  448 

-1012.2 

-314.5 

4c/®  5s 

®P 

81  052 

81  858 

423 

-446 

(-965.6) 

(-314.8) 

(387) 

(-436) 

4Gf®5s 

36536 

37111 

-1355.6 

-114.0 

4c/®  5s 

®D 

74  009 

74  862 

493 

-170 

(-1526.7) 

(-132.3) 

(510) 

(-180) 

4c/®  5s 

V 

34807 

35360 

2401 .4 

255.7 

4c/®  5s 

"P 

72163 

73  065 

-871 

350 

(2563.0) 

(251 .9) 

(-919) 

(345) 

4c/® 

33  870 

943.7 

371 .4 

4c/®  5s 

Y 

60  893 

60  794 

447 

143 

(660.3) 

(406.4) 

(511) 

(138) 

4c/®  5s 

V 

25221 

25  061 

-1149.3 

103.2 

4d® 

®D 

39  231 

41  548 

-272 

524 

(-1361.0) 

(100.8) 

(-247) 

(559) 

4c/® 

13  602 

289.6 

-42.78 

4C/® 

®D  +®P 

17107 

17  857 

-106 

10.9 

(427.4) 

(-8.66) 

(-141) 

(37.6) 

4c/® 

^P 

10912 

11411 

-47.82 

-248.5 

4c/® 

®P+®D 

14176 

14  920 

-86.0 

-388 

(378.8) 

(-271.2) 

(-162) 

(-419) 

4c/® 

8608 

8  864 

-267.7 

-215.8 

4C/® 

Y 

10  331 

10738 

48.7 

-315 

(310.4) 

(-241.2) 

(-112) 

(-340) 

4c/® 

"P 

0.0 

0.0 

1128.0 

-104.3 

4c/® 

Y 

0.0 

0.0 

-379 

-151 

(659.4) 

(-114.9) 

(-246) 

(-161) 

92.90,  Z  =  41,  /  = 

f,  M/  =  6.167. 

95.94,  Z  = 

42,  /  = 

f,  A/=  -0.9133. 

From  Iglesias  [13]. 

°  From  Fernandez  et  al. 

[14]. 

^  Uses  same  configurational  basis  set  as  Zr  II,  J 

_  3 

“  2  • 

^  Uses  same  configurational  basis  set  as  Zr  II,  J 

3 

“  2- 

^Dirac-Fock  results  are  given  in  parentheses. 

^  Dirac-Fock  results  are  given  in  parentheses. 
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TABLE  V _ 

Fine  and  hyperfine  structure  of  Hf  II, 
J  =  I (d  +  s)^  states.® 


Level 

Energy  (cm 

A  (MHz) 
Theory*’ 

B  (MHz)  / 
0(0) 
Theory*’ 

Config. 

SLJ 

Expt* 

Theory^ 

5d^ 

45  798 

113.1 

476.0 

(104.0) 

(533.0) 

5d® 

37324 

37198 

-60.0 

-99.9 

(48.6) 

(35.5) 

30  594 

30  479 

11 A 

-337.0 

(81.7) 

(-361.0) 

5d^ 

27  285 

26  958 

-121.1 

-290.0 

(43.0) 

(-308.0) 

5d^ 

V 

18897 

18  395 

140.8 

-162.0 

(107.8) 

(-182.0) 

5d^  6s 

^P 

17  830 

17  478 

-388.0 

-214.0 

(-496.0) 

(-261 .0) 

5d^  6s 

14359 

14446 

-224.0 

-170.0 

(-258.0) 

(-284.0) 

5d^6s 

V 

12  920 

12815 

638.0 

320.0 

(686.0) 

(312.0) 

5d^6s 

V 

3644 

3336 

-343.0 

193.0 

(-445.0) 

(121.0) 

5d6s^ 

0 

0 

1.88 

342.0 

(117.3) 

(458.0) 

178.49, 

Z  =  72,  /  : 

=  ^Ji|  =  0.6^. 

^  From  Moore  [10]. 

Uses  same  configurational  basis  set  as  Zr  II,  J  =  |. 
^  Dirac-Fock  results  are  given  in  parentheses. 


order  of  magnitude)  between  the  DF  and  RCI 
results.  Most  of  our  tables  also  contain  predictions 
for  the  electric  quadruple  constant  B,  divided  by 
the  quadrupole  moment,  Q.  One  can  either  extract 
B,  if  Q  is  known  [15],  or  if  a  measurement  of  B  is 
available,  as  in  Zr  II  [11],  a  determination  of  Q 
may  be  made. 

In  Table  VI,  we  present  our  results  for  the  Ta  III 
member  of  this  sequence.  Since  we  were  unable  to 
find  any  experiment  work  on  energy  levels,  our 
results  constitute  predictions  for  the  relative  posi¬ 
tions  of  nine  levels.  In  this  case,  there  are  five  A's 
exhibiting  substantial  differences  between  DF  and 
RCI  results. 

Table  VII  contains  our  results  for  the  W  IV 
member  of  this  sequence.  Again,  there  appears  to 
be  no  available  experiment,  so  nine  new  levels  are 
predicted.  Three  A's  exhibit  sign  changes  between 
the  DF  and  RCI  value.  Studies  in  the  Zr  II  and  Flf 
II  sequences  were  terminated  once  we  reached 


TABLE  VI _ 

Fine  and  hyperfine  structure  of  Ta  III, 
J=  |(d  +  s)^  states.® 


Level 

Energy  (cm  "') 

B  (MHz)  / 
A  (MHz)  Q{b) 

Config. 

SLJ 

Expt.^  Theory‘s 

Theory^  Theory^ 

5d^ 

42  299 

478.9  679.1 

(606.7)  (756.7) 

5d^6s 

^P 

38  054 

-1140.0  -347.0 

(-1917.0)  (-581.0) 

5d^6s  +  Sd^^D 

30  834 

-18.7  545.0 

(678.0)  (687.0) 

5d®  +  50^63 
+5d6s^ 

24  836 

-415.3  -220.7 

(469.3)  (-365.5) 

50^68 

4p  ^2p 

19  683 

308.6  445.9 

(357.7)  (474.0) 

5d^ 

4p  ^2p 

16  363 

-165.1  -235.3 

(240.4)  (-128.0) 

5d^ 

‘'P+^D 

13  402 

-618.5  -295.2 

(337.0)  (-470.0) 

5d^ 

^P  +*P  +''f 

7  814 

-134.6  -279.6 

(357.0)  (-413.0) 

5d^6s 

"F 

5  541 

-1961.7  110.3 

(-2433.1)  (120.7) 

5d® 

V 

0 

926.7  -299.4 

(663.2)  (-338.0) 

®/A  =  180.95,  Z  =  73,  /  =  |,  /A;  =  2.36. 

^  Uses  same  configuration  basis  set  as  Zr  II,  J  =  |. 
^  Dirac-Fock  results  are  given  in  parentheses. 


species  of  lesser  technological  interest  (in  our  judg¬ 
ment);  dramatic  differences  between  at  least  a  few 
RCI  and  DF  A  values  are  likely  to  persist  beyond 
the  ions  reported  here. 

RESULTS  FOR  La  I 

La  I  has  been  well  studied  experimentally,  for 
both  energy  differences  [16]  and  HFS  constants 
[17].  In  addition  to  reporting  results  for  /  =  f ,  we 
also  did  calculations  for  /  =  f  and  |  as  the  FIFS 
constants  (B's),  exhibit  some  substantial  second- 
order  (from  A)  contributions. 

In  Table  VIII  our  results  for  /  =  f  are  presented 
and  compared  to  experiment.  For  eight  energy 
differences,  the  average  error  is  only  0.028  eV  (227 
cm”^),  and  one  new  level  is  predicted.  The  largest 
errors  are  associated  with  the  uppermost  levels, 
which  may  suggest  the  presence  of  significant  sec¬ 
ond-order  electrostatic  effects  (see  Discussion  sec- 
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TABLE  VII _ 

Fine  and  hyperfine  structure  of  W IV, 
J  =  |(d  +  s)^  states.® 


Level 

Energy  (cm  '') 

A  (MHz) 

Config. 

SLJ 

Expl.** 

Theory^ 

Theory^ 

5d6s^ 

89111 

274.0 

(316.0) 

5d^6s 

66  789 

-863.0 

(-1018.0) 

5d^6s 

53  377 

-601.0 

(-732.0) 

5d^6s 

'•p 

48  886 

1458.0 

(1551.0) 

5d^ 

47  207 

299.0 

(282.0) 

5d^6s 

V 

32  565 

-1058.0 

(-1172.0) 

5d® 

26313 

-51.5 

(145.0) 

5d^ 

''P+^D 

19  578 

-36.4 

(158.0) 

5d® 

4p^2p 

11  113 

-68.4 

(160.0) 

5Gf® 

0 

458.0 

(309.0) 

=  183.85,  Z  =  74,  /=^,  ^,  =  0.117. 

^  Uses  same  configurational  basis  set  as  Zr  II,  J  =  f . 
^  DiraC“Fock  results  are  given  in  parentheses. 


tion).  The  average  error  for  the  six  measured  ^'s 
is  10.3%,  with  the  maximum  error  19.0%.  While 
there  are  no  sign  changes  (DF  vs.  RCI)  in  these 
bottom  six  A's,  the  top  two  of  them  show  large 
differences,  and  for  two  of  the  unmeasured  A's 
there  are  sign  changes.  For  the  six  B's,  the  average 
error  is  32%,  and  the  RCI  values  represent  at  best 
only  a  marginal  improvement  over  DF  values.  For 
the  smallest  measured  B  (  —  6.00,  we  estimate  a 
12.4%  correction  from  second-order  HFS  effects 
(see  below). 

The  average  energy  difference  errors  for  /  =  f 
and  /  =  I ,  shown  in  Table  IX,  are  0.17  and  0.13  eV, 
respectively  (1350  and  1020  cm”0.  This  is  indica¬ 
tive  that  the  basis  set  of  Table  I,  so  useful  for 
/  =  I ,  needs  improvement  for  these  J's,  Our  more 
accurate  studies  on  La  II  [9],  Nb  II  [18],  and  Cs  II 
[19]  contain  suggestions  of  what  might  be  added. 
Comparing  the  A's,  on  the  other  hand,  with  obser¬ 
vation  [17]  yields  an  average  error  of  11%  for  /  =  f 


and  14%  for  /  =  |,  which  is  comparable  to  our 
/  =  I  accuracy. 

Both  Tables  VIII  and  IX  contain  results  for  B's, 
the  electric  quadrupole  constants;  those  for  the 
5d^6s^G's  involve  large  second  order  HFS  effects 
which  we  discuss  in  the  next  section. 

ELECTRIC  OLADRIJPOLE  COrVSTANTS  FOR 
La  /,  J=|,f 

When  \B/A\  <  1  and  fine  structure  (FS)  split¬ 
tings  are  small,  second-order  contributions  from  A 
may  be  important;  experiment  measures  net  HFS, 
not  individual  poles  (M^,  £2).  Lower-order  poles 
(M^)  must  be  treated  in  high  enough  order  so 
errors  are  smaller  than  those  desired  in  the  pole  of 
interest.  Based  on  the  experimental  work  [17],  the 
four  levels  with  the  largest  discrepancies  [  vs. 
^corrected  1  ^  showu  in  Table  X.  Only  the 

last  two  are  of  interest  here,  as  we  don't  work  on 
/  =  f .  In  Table  X,  the  B^bs  values  are  deduced 
from  experiment  without  any  second-order  correc¬ 
tions  as  determined  semi-empirically  by  fitting 
experiment  [17].  This  does  not  produce  an  entirely 
satisfactory  fit  to  the  directly  observed  frequencies 
[17]. 

The  second-order  correction  may  be  expressed 
as  in  [17]  as 

where  is  the  relativistic  magnetic  dipole 

operator,  and  the  denominator  is  the  FS  difference. 
In  Ref.  [17]  this  was  initially  evaluated  at  the 
MCDF  level,  but  that  yielded  too  small  corrections. 
Here  we  show  that  the  RCI  second-order  contribu¬ 
tion  is  adequate  to  explain  observation  without 
having  to  introduce  extra  semi-empirically  deter¬ 
mined  parameters. 

In  the  spirit  of  Ref.  [20],  we  may  write  the 
numerator  as 

W;;,(MHz) 

X 
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where 

Here  fjui  is  the  magnetic  dipole  moment  [15], 
is  the  Qth  component  [Q  =  /  —  /']  of  the  relativis¬ 
tic  magnetic  dipole  component  and  {  }  is  a  6; 
symbol.  The  constant  factor  is  a  product  of  c  and  a 
factor  to  convert  atomic  units  to  megahertz.  In 
evaluating  Ajy  we  take  nonorthonormality  effects 
fully  into  account  [21].  Our  RCI  result  for 
^G7/2,  ^G9/2)  is  -504.8  MHz, 

whereas  our  MCDF  value  is  +  343.7  MHz. 

Our  RCI  results  for  the  electric  quadrupole 
constants,  using  experimental  [15]  Q's  are 
B[5d^6s^G7/2]  =  112.6  MHz  and  B[5d^6s^G9/2] 
=  108.3  MHz.  These  results  are  obtained  from  the 


Zr  II  type  bases,  with  3p  vp  excitations  added. 
These  results  are  in  good  agreement  with  the 
semi-empirical  B  (corrected)  values  [17]  which  are, 
respectively,  111.512  and  127.571  MHz.  These  val¬ 
ues  appear  in  Table  IX. 

It  is  useful  to  look  at  what  is  happening  in 
detail  which  we  do  in  Table  XL  By  columns,  these 
tables  contain  (1)  the  experimental  energy  differ¬ 
ences  (transitions  between  HFS  levels),  (2)  exper¬ 
iment  with  the  magnetic  dipole  contribution 
removed,  using  the  observed  A  [labeled  A  ~ 
8(  •  J>],  (3)  the  £2  contribution  obtained  from 

our  RCI  B  using  first-order  perturbation  theory, 
Brci,  (4)  the  second-order  contributions  using 
our  RCI  A,  labeled  -  dA^^\  and  (5)  a  final  column 
summing  columns  (3)  and  (4),  X-ph-  Experimental/ 
theoretical  discrepancies  are  obtained  by  compar¬ 
ing  columns  (2)  and  (5).  It  can  be  seen  that  the 


TABLE  V 
Fine  and  li 

III 

lyperfine  structure  of 

La  1,  J  =  I  (d  +  s)®  states. 

Level 

Energy  (cm 

A  (MHz) 

B  (MHz)'' 

Config. 

SLJ 

Expt,* 

Theory‘s 

Theory^ 

Expt.'^ 

Theory^ 

Expt." 

27680 

212.6 

36.4 

(175.1) 

(38.8) 

Sd® 

21  037 

21  614 

-64.0 

-14.1 

(81.9) 

(-1.03) 

5d® 

18  038 

18  640 

190.2 

-24.3 

(145.3) 

(-23.3) 

5d® 

V 

16735 

16  753 

-231 .8 

-17.7 

(68.1) 

(-22.8) 

5d® 

V 

12  431 

12  361 

391.8 

445.0 

-9.87 

-16.07 

(176.15) 

(-11.3) 

5d®6s 

9719 

9  726 

-530.7 

-655.0 

-24.2 

-33.54 

(-188.96) 

(-24.0) 

5d®6s 

8446 

8  692 

-422.9 

-423.0 

-4.45 

-6.00® 

(-321.39) 

(-16.2) 

5d®  6s 

4p 

7  491 

7  556 

842.6 

930.0 

25.0 

34.72 

(844.26) 

(22.9) 

5d®  6s 

"F 

2  668 

2  436 

-414.7 

-480.0 

10.7 

16.34 

(-398.55) 

(9.26) 

5d6s® 

0 

0 

129.7 

141.0 

29.9 

44.78 

(111.23) 

(31.8) 

^  Experimental  energy  Is  taken  from  Martin  et  al.  [16]. 

^  This  work.  The  values  in  parentheses  are  MCDF  values  taken  from  same  reference  as  footnote 
^  From  Childs  and  Nielsen  [17]. 
is  from  Raghavan  [15]. 

®  Estimated  second-order  HFS  contribution  ~  12.4%  (see  text). 
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TABLE  IX _ 

Fine  and  hyperfine  structure  of  I,  J  =  |,  |(c/  +  s)^  states. 


Level  Energy  (cm  A  (MHz)  B  (MHz)°^ 

Config.  SLJ  Expt.^  Theory^  Theory^  Expt.^  Theory^  Expt.^ 


5d^ 

%I2 

18449 

20  920 

5c/® 

"Gy/a 

13  529 

15  282 

5c/® 

%/2 

9  744 

10121 

5c/®  6s 

6466 

7  493 

5c/®  6s 

^7/2 

4  558 

5  659 

5c/®  6s 

%I2 

0 

0 

5c/® 

"H9/2 

14194 

15  704 

5c/® 

^9/2 

13019 

14  357 

5c/® 

4c 

^9/2 

9626 

9968 

5c/®6s 

'^912 

5798 

6694 

5c/®  6s 

4c 

^9/2 

0 

0 

50.8 

(187.0) 

28.5 

137.7 

(182.8) 

35.1 

-18.2 

(190.9) 

-19 

-19.3 

-20.9 

-253.7 

(-189.7) 

-292 

113 

112 

-170.7 

-197 

37.5 

40.3 

(81.9) 

411.3 

(507.9) 

463 

17.9 

16.1 

104 

47.2 

(125) 

69 

34.3 

(131) 

-53 

-64 

-20.6 

-27.4 

(82) 

471 

560 

108 

128 

(432) 

440 

490 

24.2 

31.5 

(451) 

®  Experimental  energy  is  taken  from  Martin  et  al.  [16]. 

^  This  work;  RCI.  The  values  in  parentheses  are  MCDF  values. 

^  From  Childs  and  Nielsen  [17].  All  6’s  except  those  for  5d^^F  are  obtained  using  second-order  HFS  effects. 


agreement  is  quite  good;  and  that  the  second-order 
effects  associated  with  can  be  an  appreciable 
fraction  of  the  first-order  £2  effects. 


Discussion 

A  cautious  error  estimate  for  unmeasured  A's 
might  be  20-25%  for  RCI  values,  except  for  those 
below  50  MHz  in  magnitude.  Errors  for  these 


might  be  50-100%.  These  estimates  are  based,  in 
part,  in  how  A  changes  as  the  calculation  pro¬ 
ceeds.  Although  we  have  not  emphasized,  in  de¬ 
tail,  the  importance  of  the  relative  positioning  of 
d”~^s  levels  in  this  work,  this  is  more  thoroughly 
explored  in  the  work  on  La  II  [9]  and  Cs  II  [19]. 

In  general,  uppermost  levels  exhibit  large  errors 
in  energy  positions  and  HFS  constants.  This  is 
because  these  levels  are  closer  in  energy  to  ''Ryd¬ 
berg'"  excited  states  than  the  lowest  levels.  Excita- 


TABLE  X _ 

Experimental  [17]  results  for  HFS  constants  [6’s]  exhibiting  significant  second-order  effects. 


Level  (cm  “ Label  A  (MHz)  e^bs  (MHz)  (MHz) 


3  010.002 

5c/®  6s 

S5t2 

300.563 

10.873 

7.800 

9183.797 

CO 

CO 

CVJ 

in 

^5/2 

876.319 

-2.772 

-5.404 

9919.821 

5c/®  6s 

G9/2 

559.812 

202.638 

127.571 

9960.904 

5c/®  6s 

G7/2 

-292.267 

67.537 

111.512 
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TABLE  XI _ 

La  I  5cf^  6s^G  electric  quadrupole  results  (MHz)  including  second-order  effects.^ 


Transition 


r  p' 

^^obs 

^^obs  ^^obs  1  *  J) 

^RCI 

^Th 

7-6 

-2016.965 

j  =  lAobs  =  -292.267  MHz 
28.904 

46.44 

-12.9 

33.5 

6-5 

-1746.669 

6.933 

11.13 

0 

11.13 

5-4 

-1468.190 

-6.855 

-11.13 

7.80 

-3.33 

4-3 

-1182.851 

-13.783 

-22.08 

11.3 

-10.78 

3-2 

-891.992 

-15.191 

-24.37 

11.4 

-13.0 

2-1 

-596.977 

-12.443 

-20.3 

9.05 

-11.3 

1-0 

”299.185 

-6.918 

-10.4 

4.95 

-5.44 

8-7 

4555.752 

J  =  |Aobs  =  559.812MHz 
77.256 

43.04 

31.6 

76.64 

7-6 

3943.948 

25.264 

14.07 

12.9 

27.0 

6-5 

3349.169 

-9.703 

-5.29 

0 

-5.29 

5-4 

2768.920 

-30.14 

-16.84 

-7.80 

-24.64 

4-3 

2200.726 

-38.522 

-21.5 

-11.3 

-32.8 

^  Entries  are  explained  in  text.  Column  2  constitutes  the  experimental  prediction  and  the  last  column  constitutes  the  theoretical 
prediction. 


tions  from  the  core  artificially  pull  all  the  (rf  +  s)” 
levels  away  from  these  Rydberg  levels,  because 
they  aren't  given  equivalent  core  correlation. 
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ABSTRACT:  Haberlen  and  Rosch  (HR)  demonstrated  [Chem.  Phys.  Lett.  199,  491 
(1992)]  the  feasibility  of  performing  scalar-relativistic,  density  functional  theory  (DFT) 
linear  combination  of  Gaussian-type  orbitals -fitting  function  (LCGTO-FF)  calculations  on 
clusters  of  atoms  using  an  ""incomplete"  Douglas-Kroll  transformation.  Some  of  the 
approximations  used  in  their  multiatom  calculations  have  not  yet  been  fully  explored  for 
isolated  atoms,  especially  the  neglect  of  matrix  elements  involving  vector  products  of  the 
momentum  and  the  use  of  fitting  functions.  In  this  investigation,  scalar-relativistic 
LCGTO  calculations  (without  fitting  functions)  were  carried  out  at  four  levels  of 
approximation,  including  that  employed  by  HR,  for  four  atoms:  Ce,  Au,  Pb,  and  Pu.  The 
results  of  these  calculations  suggest  that  the  HR  approximation  should  be  applicable  to 
atoms  through  the  light  actinides,  so  long  as  spin-orbit  effects  are  unimportant.  A 
comparison  of  very  large  basis-set  results  with  results  obtained  using  ordinary-size  basis 
sets  indicates  that  the  valence  (bonding)  states  from  scalar-relativistic  LCGTO-FF 
calculations  should  be  no  more  sensitive  to  the  orbital  basis-set  size  than  are  their 
nonrelativistic  counterparts.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  565-574, 
1997 


Introduction 


The  linear  combinations  of  Gaussian-type  or- 
bitals-fitting  function  (LCGTO-FF)  technique 
is  now  used  routinely  to  perform  all-electron,  full- 
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potential,  density  functional  theory  (DFT)  elec¬ 
tronic  structure  calculations  on  a  wide  range  of 
systems,  including  isolated  clusters  of  atoms  [1-3], 
ID  periodic  polymer  chains  [4-6],  2D  periodic 
ultrathin  films  [7-9],  and  3D  periodic  crystalline 
solids  [10-12].  This  ability  to  treat  localized 
(molecular)  and  periodically  extended  (crystalline) 
systems  on  an  equal  footing  is  one  of  the  primary 
advantages  of  the  LCGTO-FF  method  over  other 
existing  DFT  electronic  structure  techniques.  For 
example,  the  various  linearized  techniques  used  in 
solid-state  physics  (FLAPW,  FLMTO,  etc.)  have 
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not  yet  been  demonstrated  to  be  useful  tools  for 
studying  isolated  clusters  of  atoms.  Thus,  the 
LCGTO-FF  method  is  one  of  the  few  all-electron, 
full-potential  methods  that  can  be  used  to  bridge 
the  gap  between  quantum  chemistry  and  solid- 
state  physics.  To  this  extent,  the  LCGTO-FF  tech¬ 
nique  may  be  viewed  as  a  ''universal"  band-struc¬ 
ture  technique. 

One  limitation  of  the  LCGTO-FF  method  has 
been  the  difficulty  associated  with  applying  any 
fixed-basis  method  to  heavy-atom  systems,  due  to 
the  lack  of  a  stable  method  for  incorporating  rela¬ 
tivistic  corrections  into  such  calculations  [13], 
Haberlen  and  Rosch  [14-16]  (HR)  demonstrated 
the  feasibility  of  performing  scalar-relativistic 
LCGTO-FF  calculations  on  clusters  containing 
heavy  atoms  using  an  "incomplete"  Douglas- 
Kroll  [17]  (DK)  transformation  (the  HR  approx¬ 
imation  hereafter)  that  requires  only  a  minimal 
increase  in  computational  effort  relative  to  the  cor¬ 
responding  nonrelativistic  calculation.  The  HR 
scalar-relativistic  LCGTO-FF  method  (LCGTO-DF 
in  their  notation)  has  since  been  successfully  ap¬ 
plied  to  a  number  of  heavy-atom  systems  [18-26], 
This  development  is  of  great  significance  because 
it  substantially  increases  the  range  of  applicability 
for  LCGTO-FF  calculations. 

Three  aspects  of  the  HR  scalar-relativistic 
LCGTO-FF  method  have  not  yet  been  fully  ex¬ 
plored  for  isolated  atoms:  (1)  the  effect  of  neglect¬ 
ing  all  matrix  elements  involving  vector  products 
of  the  momentum;  (2)  the  extent  of  sensitivity  to 
the  orbital  basis-set  size;  and  (3)  the  effect  of  using 
charge  and  exchange- correlation  (XC)  fitting  func¬ 
tions  in  a  scalar-relativistic  calculation.  In  the  cur¬ 
rent  investigation,  scalar-relativistic  LCGTO  calcu¬ 
lations  (without  fitting  functions)  are  carried  out 
for  four  atoms  (Ce,  Au,  Pb,  and  Pu)  in  an  attempt 
to  resolve  these  issues.  In  the  next  section,  the 
scalar-relativistic  LCGTO  method  is  reviewed  and 
the  four  approximations  considered  here  are  de¬ 
fined,  including  the  HR  approximation.  Results  are 
presented  in  the  third  section. 


Scalar-Relativistic  LCGTO  DFT  Method 

The  formulation  of  scalar-relativistic  DFT  used 
by  HR  [14]  begins  with  the  four-component 
Dirac-Kohn-Sham  (DKS)  equations  [27]: 

^DfCS'/'i  =  [(cot  •  P  +  pmc^)  +  l/r,  =  6;!/^,.,  (1) 


where 

^eff  =  +  W,  +  V,,  (2) 

is  the  effective  one-electron  potential,  composed  of 
the  nuclear  potential  the  classical  electronic 
Coulomb  potential  and  the  DFT  XC  potential 
The  eigenvalues  of  the  DKS  equations  are 
unbounded,  above  and  below,  since  they  include 
both  electron  and  positron  degrees  of  freedom.  For 
this  reason,  any  attempt  to  solve  the  DKS  equa¬ 
tions  variationally  with  a  truncated  basis  set  will 
lead  to  the  so-called  variational  collapse  problem, 
unless  some  specially  designed  collection  of  basis 
functions  is  used  [13].  In  principle,  the  electron 
and  positron  degrees  of  freedom  in  the  DKS  equa¬ 
tions  can  be  decoupled  exactly  to  produce  two  sets 
of  two-component  equations  whose  eigenvalues 
are  bounded  [28].  The  resulting  second-order 
equations  are,  however,  nonlinear  in  and  are 
not  amenable  to  solution  with  a  matrix  method. 
Both  of  these  difficulties  can  be  circumvented  by 
performing  some  unitary  transformation  on  the 
DKS  equations  that  approximately  decouples  the 
electron  and  positron  degrees  of  freedom  while 
maintaining  linearity.  This  is  the  approach  fol¬ 
lowed  by  HR  [14-16]. 

REDUCTION  TO  TWO-COMPONENT  FORM 

The  approximate  decoupling  of  the  DKS  equa¬ 
tions  into  two  sets  of  two-component  equations 
begins  with  a  free-particle  Foldy-Wouthuysen  [29] 
(FW)  transformation  using  the  unitary  transforma¬ 
tion  operator 

Uo  =  A^{1  +  l3Rf),  (3) 

where 


LUy,  1  llt-U 

(4) 

”  [  2E^  ' 

Rf  =  •  p. 

(5) 

Kp  =  c/[Ep  +  mc^), 

(6) 

Ep  =  c(p^  + 

(7) 

This  transformation  yields  an  alternative  four- 
component  equation 

(8) 
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where  and  are  even  and  odd  operators  with 
respect  to  commutation  with 
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(9) 

=  /3Ap[  Rf%f  -  Ap.  (10) 

Note  that,  up  to  this  point,  no  approximations 
have  been  made  beyond  those  inherent  in  the 
original  DKS  equation. 

It  can  be  demonstrated  that  the  even  (odd) 
operators  do  not  (do)  couple  the  positive  and  neg¬ 
ative  kinetic-energy  states  of  the  free-particle  DKS 
equation  [30].  Equation  (8)  can  then  be  reduced  to 
an  approximate  two-component  form  by  simply 
dropping  the  odd  operator  to  get 

h^fpp4>i  —  (11) 


tor  W  in  momentum-space  yields 


Wp.p'  =  Ap(Rp  - 


with 


d'  = 


v/(prp') 

£  -h  E  , 


(14) 


(15) 


where  z;^yy(p,p0  is  the  momentum-space  represen¬ 
tation  of  v^jrjr  and  is  now  assumed  to  take  on  its 
two-  or  four-component  representation  as  needed. 
Applying  to  hf\  expanding  the  resulting  oper¬ 
ator  to  second-order  in  W,  and  dropping  all 
higher-order  odd  operators  yields  the  second-order 
DK  equation  [17,  31,  32]: 


where  is  the  two-component  equivalent  of 

Rf, 

Rf  =  Kpcr  •  p.  (12) 

This  approximation  is  known  as  the  free-particle 
projector  (FPP)  approximation  because  it  can  also 
be  obtained  by  surrounding  free-par¬ 

ticle  projection  operators,  thereby  forcing  the  elec¬ 
tronic  wave  functions  in  the  presence  of  to  be 
constructed  from  a  superposition  of  positive-en¬ 
ergy  free-particle  states  [30].  The  FPP  approxima¬ 
tion  should  be  reasonable  if  the  effective  potential 
is  only  weakly  dependent  on  the  position,  a  re¬ 
quirement  that  is  clearly  not  satisfied  near  a  nu¬ 
cleus. 

The  primary  shortcoming  of  the  FPP  approxi¬ 
mation  is  its  total  neglect  of  the  mixing  of  positive 
and  negative  energy  free-particle  states  induced  by 
the  presence  of  the  odd  operator  in  hf\  An 
improved  two-component  equation  can  be  ob¬ 
tained  by  performing  another  unitary  transforma¬ 
tion  on  that  approximately  eliminates  To 
this  end,  Douglas  and  Kroll  [17]  observed  that  the 
operator 

111  =  (1  +  +  W  (13) 

will  be  unitary  if  W  is  anti-Hermitian.  W  can  then 
be  selected  such  that  the  transformation  elimi¬ 
nates  from  to  first-order  in  W.  Carrying  out 
this  procedure  and  expressing  the  integral  opera- 


This  approximation  is  often  referred  to  as  the  ex¬ 
ternal-field  projector  (EFP)  approximation  because 
it  effectively  projects  out  the  electronic  solutions  to 
the  DKS  equation  for  a  particle  moving  in  the 
external-field 

SCALAR-RELATIVITY 

The  approximate  two-component  equations  pre¬ 
sented  above  [Eqs.  (11)  and  (16)]  are  fully  relativis¬ 
tic,  in  the  sense  that  they  include  mass-velocity, 
Darwin,  and  spin-orbit  coupling  corrections.  A 
scalar-relativistic  formulation  is  desired  here  to 
allow  a  direct  comparison  with  previous  work.  In 
the  FPP  equation  [Eq.  (11)],  the  spin-orbit  correc¬ 
tions  are  all  contained  in  the  term  A^R^vR^A^. 
Using  the  standard  properties  of  the  Pauli  matrices 
( O' ),  this  term  can  be  rewritten  as 

ApRjjVRpAp 

=  ApKp(p  •  +  zo*  •  p  X  t;p)Kp  Ap.  (17) 

The  term  involving  p  •  i;p  is  scalar-relativistic, 
whereas  the  term  involving  or  •  p  X  z?p  is  a 
spin-orbit  coupling  term.  Thus,  the  scalar-relativ¬ 
istic  FPP  approximation  does  not  require  matrix 
elements  of  p  X  vp. 

Separating  out  the  spin-orbit  coupling  terms  for 
the  EFP  equation  is  somewhat  more  complicated 
because  of  the  second-order  terms  in  Eq.  (16).  The 
basic  properties  of  the  external-field  correction 
terms  can  all  be  illustrated  with  the  operator  W^. 
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Since  the  momentum-space  operators  in  W  com¬ 
mute,  except  for  v’ ,  can  be  expressed  as 

W^  =  (A^R/R^A^XA/A^) 

-  ( p^K^)(  ApV'Ap) 

+  ( ApV'ApX  ApRpV'R^,  Ap) 

-(ApRpi;'RpAp)(p%2)''(ApV'i?pAp). 

(18) 

The  second  term  in  this  expression  is  scalar-relativ¬ 
istic  as  written.  The  first  and  third  terms  can  be 
easily  split  into  Darwin  and  spin-orbit  coupling 
terms  using  Eq.  (17).  The  last  term  is  more  compli¬ 
cated  because  it  behaves  like 

(A^R^v'R^A/ 

=  [^p^p(p-»'P  +  20-  •  p  X  l;'p)KpAp]^ 

(19) 

Inspection  of  the  rhs  of  Eq.  (19)  reveals  that  the 
squares  of  the  individual  factors  in  the  parentheses 
are  both  scalar-relativistic,  while  their  cross-prod¬ 
ucts  are  spin- orbit  coupling  terms.  Thus,  even  the 
scalar-relativistic  version  of  the  EFP  approxima¬ 
tion  requires  matrix  elements  of  p  X  vp.  Through¬ 
out  the  remainder  of  this  work,  it  will  be  assumed 
that  all  spin-orbit  coupling  terms  in  hflp  and 
are  neglected. 

INCOMPLETE  TRANSFORMATIONS 

The  FPP  and  EFP  equations  both  involve  the 
operators  E^,  and  K^.  Although  these  opera¬ 
tors  are  easy  to  work  with  in  momentum-space, 
analytical  evaluation  of  the  GTO  matrix  elements 
for  these  operators  has  thus  far  proven  to  be  im¬ 
practical.  This  difficulty  can  be  avoided  by  trans¬ 
forming  the  GTO  basis  set  to  an  approximate  mo¬ 
mentum-space  representation  by  diagonalizing  the 
nonrelativistic  kinetic-energy  (KE)  matrix  [33]. 
First,  the  matrix  elements  of  p  •  izp  and  p  X  izp  (if 
needed)  are  evaluated  along  with  the  usual  nonrel¬ 
ativistic  matrix  elements.  Next,  the  nonrelativistic 
KE  matrix  is  diagonalized  to  obtain  approximate 
eigenfunctions  of  and  the  matrices  are  trans¬ 
formed  to  this  approximate  momentum  space.  The 
operators  E^,  and  can  then  be  constructed 
from  the  approximate  eigenvalues.  These  basic 
components  are  used  to  form  the  complicated  ma¬ 


trices,  like  which  are  then  back- 

transformed  to  the  original  basis  set. 

The  approximate  momentum-space  representa¬ 
tion  becomes  exact  in  the  limit  of  a  complete  basis. 
Conversely,  as  the  basis  set  is  reduced,  the  approx¬ 
imate  momentum-space  representation  can  be¬ 
come  a  source  of  serious  error,  Hess  [31]  tested 
this  approximation  by  calculating  relativistic 
Hartree-Fock  total  and  one-electron  energies  for 
Br  (Z  =  35)  and  Ag  (Z  =  47)  using  fairly  large 
primitive  basis  sets  that  were  contracted  to  obtain 
a  wide  range  of  basis-set  sizes.  Hess  concluded 
that  the  results  were  sufficiently  sensitive  to  war¬ 
rant  using  the  uncontracted  basis  sets  whenever 
possible.  (Jansen  and  Hess  [32]  carried  out  similar 
tests  on  Au  [Z  =  79]  with  contracted  basis  sets 
ranging  from  13s9p6dlf  to  17sl2p8d4/,  but  only 
studied  the  total  energy  variation  in  detail)  The 
requirement  of  a  large  basis  set  to  ensure  a  good 
momentum-space  representation  conflicts  with  the 
obvious  need  to  use  rather  heavily  contracted  or¬ 
bital  basis  sets  for  heavy  atoms  to  reduce  the 
computational  demands  of  such  systems. 

HR  overcame  this  paradox  by  only  applying  the 
relativistic  transformations  associated  with  the  FPP 
and  EFP  approximations  to  the  one-electron  por¬ 
tions  of  the  DKS  equations,  i.e.,  the  kinetic  energy 
and  nuclear  potential.  These  incomplete  transfor¬ 
mations  allow  all  of  the  relativistic  matrix  ele¬ 
ments  to  be  initially  evaluated  using  the  primitive 
GTO  basis  set  and  then  transformed  to  a  much 
smaller  contracted  basis  set  prior  to  beginning  the 
iterative  SCF  cycle  [15].  The  more  numerous  two- 
electron  integrals  associated  with  then  only  need 
to  be  evaluated  for  the  smaller  contracted  basis  set. 
These  nuclear-only  versions  of  the  FPP  and  EFP 
approximations  will  be  denoted  as  the  FPPn  and 
EFPn  approximations,  respectively.  HR  further  re¬ 
duced  the  complexity  of  the  EFPn  approximation 
by  dropping  all  of  the  terms  involving  p  X  i;p;  this 
then  is  the  HR  approximation. 

COMPUTATIONAL  DETAILS 

To  test  the  approximations  discussed,  an  exist¬ 
ing  program  for  performing  nonrelativistic  LCGTO 
DFT  calculations  on  atoms,  without  the  use  of 
fitting  functions,  (ATOM  [34])  has  been  rewritten 
as  a  scalar-relativistic  code  (SATOM).  In  SATOM, 
each  term  in  the  effective  one-electron  potential 
(Vjj,  v^,  and  can  be  treated  nonrelativistically 
(NR)  or  with  one  of  three  scalar-relativistic  approx¬ 
imations,  FPP,  EFP  without  vector  product  terms. 
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or  EFP  with  vector  product  terms.  The  HR  approx¬ 
imation  then  corresponds  to  treating  and 
nonrelativistically  and  treating  with  the  EFP 
approximation  without  vector  product  terms. 

Four  atoms,  spanning  a  wide  range  of  atomic 
numbers,  were  studied  here:  Ce  (Z  =  58),  Au  (Z 
=  79),  Pb  (Z  =  82),  and  Pu  (Z  =  94).  In  each  case, 
LCGTO  results  have  been  obtained  nonrelativisti¬ 
cally  and  with  four  scalar-relativistic  approxima¬ 
tions:  full  EFP,  EFPn,  HR,  and  FPPn.  For  Ce  and 
Pb,  all  calculations  were  carried  out  using  the  Xa 
LDA  model  (a  =  0.7)  to  allow  direct  comparison 
with  previous  LCGTO-FF  and  numerical  results 
[16,  35].  For  Au  and  Pu,  the  Hedin-Lundqvist  [36] 
(HL)  model  was  used  to  allow  a  comparison  with 
numerical  results  obtained  from  a  code  written  by 
Wills  [37]. 

The  present  LCGTO  calculations  for  Ce  em¬ 
ployed  a  high-quality  21sl6j!:?ll(i9/  basis  set  de¬ 
scribed  by  Haberlen  [16].  Unfortunately,  no  com¬ 
parable  quality  basis  sets  have  been  published 
for  Au,  Pb,  and  Pu.  For  this  reason,  large 
30s24pl8dl2/  even-tempered  basis  sets  were  used 
for  the  three  larger  atoms.  The  (minimum:maxi- 
mum)  exponents  of  the  s,  p,  d,  and  /  parts  of  the 
large  basis  sets  for  Au  and  Pb  were  (0.02:1  X  10^), 


(0.01:1  X  10^),  (0.05:2  X  10^),  and  (0.1:2  X  10^),  re¬ 
spectively.  For  Pu,  the  limits  for  the  d  basis  set 
were  changed  to  (0.02:1  X  10^).  To  study  the  ef¬ 
fects  of  basis-set  size  on  the  results,  the  calcula¬ 
tions  for  Au  and  Pb  were  repeated  with  smaller 
basis  sets.  The  small  basis  used  for  Au  was  a 
20sl5plld6/  basis  derived  from  Gropen's  [38] 
19sl4pl0d5/  basis  by  adding  one  exponent  for 
each  symmetry:  s,  0.18;  p,  0.20;  d,  0.12;  and  /,  1.4. 
The  small  basis  for  Pb  was  the  20sl7plld6/  basis 
employed  by  Knappe  and  Rosch  [35]  in  their  pro¬ 
totype  scalar-relativistic  LCGTO-FF  calculations. 


Results 

The  rich  basis-set  results  for  the  NR  and  scalar- 
relativistic  one-electron  energies  and  total  energies 
for  Ce,  Au,  Pb,  and  Pu  are  compared  with  results 
from  numerical  calculations  in  Tables  I-IV.  In  each 
case,  the  NR  results  are  in  excellent  agreement, 
with  the  deepest  core  levels  differing  by  no  more 
than  0.5  eV  and  the  total  energies  differing  by  no 
more  than  0.3  Ry,  demonstrating  the  high  quality 
of  the  large  basis  sets.  A  careful  inspection  of 
Tables  I~IV  also  reveals  that,  in  each  case,  the  EFP 


TABLE  I _ 

Numerical  and  LCGTO  results  for  the  negatives  of  the  nonrelativistic  one-electron  eigenvalues  (In  eV)  and  total 
energies  (in  Ry)  are  compared  for  paramagnetic  Ce.  Scalar-relativistic  corrections  to  the  nonrelativistic 
results  are  listed  for  various  approximations.  The  numerical  results  are  from  [16].  The  LCGTO  results  were 
calculated  here  with  a  21s16p11d9f  basis.  All  calculations  used  the  Xa  model  with  a  =  0.7. 


Nonrelativistic  Scalar-relativistic  corrections 


Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

1s 

38286.20 

38285.95 

1932.93 

1866.18 

1838.04 

1835.17 

2723.95 

2s 

5956.84 

5956.84 

485.09 

477.75 

474.55 

473.61 

565.31 

3s 

1281.39 

1281.41 

105.83 

104.38 

103.83 

103.54 

121.63 

4s 

256.50 

256.52 

23.40 

23.11 

23.01 

22.92 

26.99 

5s 

36.54 

36.57 

3.45 

3.42 

3.41 

3.39 

4.04 

6s 

2.99 

3.00 

0.23 

0.22 

0.22 

0.22 

0.26 

2p 

5637.49 

5637.49 

133.70 

133.25 

130.98 

136.38 

131.86 

3p 

1142.79 

1142.81 

31.26 

31.30 

30.94 

31.95 

30.70 

4p 

204.75 

204.78 

5.51 

5.53 

5.48 

5.68 

5.29 

5p 

22.38 

22.42 

0.16 

0.19 

0.18 

0.21 

0.08 

3d 

883.62 

883.62 

-12.49 

-12.64 

-12.75 

-12.96 

-14.12 

4d 

113.76 

113.77 

-3.71 

-3.69 

-3.69 

-3.75 

-4.07 

5d 

3.08 

3.07 

-0.64 

-0.62 

-0.62 

-0.62 

-0.69 

4f 

8.58 

8.61 

-3.53 

-3.49 

-3.47 

-3.51 

-3.70 

E 

17134.59 

17134.55 

581.69 

571 .60 

568.89 

572.54 

725.03 
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TABLE  II _ 

Numerical  and  LCGTO  results  for  the  negatives  of  the  nonrelativistic  one-electron  eigenvalues  (in  eV)  and  total 
energies  (in  Ry)  are  compared  for  paramagnetic  Au.  Scalar-relativistic  corrections  to  the  nonrelativistic  results 
are  listed  for  various  approximations.  The  LCGTO  results  were  calculated  here  with  a  30s24p18cf12f  basis. 

All  calculations  used  the  HL  model. 


Nonrelativistic  Scalar-relativistic  corrections 


Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

1s 

73021 .73 

73021.79 

7617.36 

7288.75 

7200.23 

7185.95 

11173.46 

2s 

12187.55 

12187.62 

2034.09 

1993.22 

1982.11 

1977.25 

2428.50 

3s 

2852.37 

2852.44 

504.21 

495.39 

493.16 

491.68 

589.78 

4s 

600.78 

600.81 

127.28 

125.13 

124.64 

124.21 

148.83 

5s 

84.79 

84.79 

23.95 

23.54 

23.46 

23.36 

28.23 

6s 

4.51 

4.51 

1.67 

1.64 

1.63 

1.64 

2.00 

2p 

11720.50 

11720.58 

572.48 

573.25 

567.06 

604.16 

589.10 

3p 

2631.48 

2631 .58 

166.33 

167.40 

166.14 

174.19 

170.19 

4p 

505.55 

505.60 

39.34 

39.70 

39.47 

41.37 

40.23 

5p 

54.55 

54.55 

5.50 

5.59 

5.56 

5.88 

5.62 

3d 

2217.99 

2218.08 

-16.42 

-17.31 

-17.91 

-18.78 

-22.61 

4d 

330.12 

330.17 

-4.70 

-4.83 

-4.88 

-5.11 

-6.07 

5d 

8.37 

8.37 

-1.17 

-1.16 

-1.16 

-1.17 

-1.31 

4f 

94.89 

94.89 

-13.22 

-13.18 

-13.14 

-13.33 

-13.96 

E 

35720.73 

35720.65 

2280.97 

2237.68 

2229.21 

2253.88 

2944.25 

TABLE  III _ 

Numerical  and  LCGTO  results  for  the  negatives  of  the  nonrelativistic  one-electron  eigenvalues  (in  eV)  and  total 
energies  (in  Ry)  are  compared  for  paramagnetic  Pb.  Scalar-relativistic  corrections  to  the  nonrelativistic  results 
are  listed  for  various  approximations.  The  numerical  results  are  from  [35].  The  LCGTO  results  were  calculated 
here  with  a  30s24p18d12f  basis.  All  calculations  used  the  Xa  model  with  a  =  0.7. 


Nonrelativistic  Scalar-relativistic  corrections 


Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

1S 

78976.75 

78976.27 

8997.12 

8640.22 

8540.96 

8523.55 

13354.48 

2s 

13312.12 

13312.11 

2421.16 

2377.64 

2365.02 

2358.97 

2913.73 

3s 

3173.88 

3173.94 

604.04 

594.37 

591 .82 

589.92 

71 1 .34 

4s 

701 .38 

701.40 

153.40 

150.97 

150.40 

149.79 

180.62 

5s 

113.91 

113.91 

29.86 

29.34 

29.25 

29.07 

35.49 

6s 

8.93 

8.93 

2.59 

2.54 

2.52 

2.50 

3.17 

2p 

12824.30 

12824.31 

668.90 

677.78 

670.84 

717.56 

699.65 

3p 

2940.72 

2940.82 

195.58 

198.37 

196.93 

207.12 

202.18 

4p 

598.97 

599.03 

45.18 

46.29 

46.02 

48.44 

46.86 

5p 

79.45 

79.45 

5.80 

5.97 

5.94 

6.35 

5.86 

6p 

3.13 

3.13 

-0.15 

-0.14 

-0.14 

-0.13 

-0.20 

3d 

2503.90 

2503.99 

-20.25 

-21.07 

-21.71 

-22.82 

-27.58 

4d 

409.46 

409.51 

-7.67 

-7.79 

-7.86 

-8.20 

-9.57 

5d 

23.86 

23.86 

-3.27 

-3.27 

-3.26 

-3.33 

-3.66 

4f 

152.37 

152.36 

-18.88 

-18.91 

-18.85 

-19.18 

-20.16 

E 

39055.40 

39055.29 

2705.40 

2653.31 

2643.97 

2675.09 

3513.16 
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TABLE  IV _ 

Numerical  and  LCGTO  results  for  the  negatives  of  the  nonrelativistic  one-electron  eigenvalues  (in  eV)  and  total 
energies  (in  Ry)  are  compared  for  paramagnetic  Pu.  Scaiar-relativistic  corrections  to  the  nonrelativistic  results 
are  listed  for  various  approximations.  The  LCGTO  results  were  calculated  here  with  a  30s24p18c/12f  basis. 

Ail  calculations  used  the  HL  model. 


Nonrelativistic  Scalar-relativistic  corrections 


Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

1s 

104982.08 

104982.29 

17168.80 

16343.22 

16165.23 

16128.81 

26240.88 

2s 

18284.96 

18285.20 

4725.81 

4615.31 

4591 .57 

4578.44 

5820.87 

3s 

4635.88 

4636.15 

1212.89 

1187.79 

1182.78 

1178.49 

1456.94 

4s 

1177.46 

1177.62 

329.99 

323.28 

322.05 

320.59 

395.44 

5s 

253.73 

253.77 

83.30 

81.56 

81.27 

80.82 

100.07 

6s 

35.23 

35.24 

14.02 

13.65 

13.60 

13.50 

17.10 

7s 

3.42 

3.42 

0.70 

0.67 

0.67 

0.66 

0.86 

2p 

17707.84 

17708.08 

1242.85 

1257.35 

1247.14 

1356.24 

1322.06 

3p 

4351.35 

4351 .64 

378.01 

383.73 

381 .35 

405.97 

396.19 

4p 

1045.15 

1045.35 

97.28 

98.98 

98.47 

104.78 

101.40 

5p 

201 .82 

201 .88 

20.77 

21.19 

21.09 

22.58 

21.44 

6p 

21.20 

21.20 

1.37 

1.43 

1.42 

1.63 

1.22 

3d 

3818.07 

3818.34 

-22.69 

-25.11 

-26.25 

-28.45 

-37.85 

4d 

797.72 

797.90 

-9.62 

-10.11 

-10.27 

-11.03 

-14.03 

5d 

110.79 

110.82 

-4.91 

-5.01 

-5.02 

-5.21 

-6.10 

4f 

453.88 

453.95 

-37.42 

-37.54 

-37.49 

-38.32 

-40.67 

8.43 

8.41 

-6.20 

-6.22 

-6.20 

-6.29 

-6.77 

E 

54004.24 

54003.98 

5106.93 

4999.42 

4983.10 

5056.02 

6828.79 

approximation  accounts  for  roughly  98%  of  the 
scalar-relativistic  correction  to  the  total  energy  and 
produces  corrections  to  the  core-state  energies  that 
are  within  5%  of  the  numerical  results,  with  the 
exception  of  the  Pu  3d  energy.  More  importantly, 
the  EFP  approximation  yields  energies  for  the  all- 
important  valence  states  that  are  within  a  few 
hundredths  of  an  eV  of  the  values  found  in  the 
numerical  calculations.  Using  the  simpler  EFPn 
approximation  in  place  of  the  EFP  approximation 
produces  a  modest  deterioration  in  the  one-elec¬ 
tron  results  that  is  concentrated  in  the  chemically 
inert  core  states.  On  the  other  hand,  the  FPPn 
approximation  is  clearly  inadequate  for  all  four  of 
the  atoms  considered.  These  observations  are  all  in 
agreement  with  the  results  of  earlier  work  on  Ce 
[16]  and  Pb  [35]. 

Analysis  of  the  results  obtained  with  the  HR 
approximation  is  not  so  straightforward.  For  sim¬ 
plicity,  focus  on  the  Pu  results  in  Table  IV,  since 
they  exhibit  the  trends  most  clearly.  The  HR  ap¬ 
proximation  has  only  a  small  effect  on  the  rela¬ 
tivistic  corrections  to  the  total  energy  and  the  s 
and  /  one-electron  energies.  (In  fact,  the  shift  in 
the  total  energy  due  to  the  HR  approximation 


cancels  some  of  the  error  already  existing  in  the 
more  rigorous  EFP  approximation.)  For  the  d-state 
corrections,  the  HR  approximation  results  in  sig¬ 
nificantly  larger  percentage  errors  than  does  the 
EFPn  approximation,  but  those  errors  represent 
only  a  small  fraction  of  the  orbital  energies  them¬ 
selves.  The  most  serious  evidence  for  a  potential 
problem  with  the  HR  approximation  is  in  the  rela¬ 
tivistic  corrections  for  the  p  states,  for  which  the 
HR  approximation  gives  even  poorer  results  than 
does  the  simple  FPPn  approximation.  To  under¬ 
stand  this,  consider  the  vector  product  terms  that 
are  neglected  in  the  HR  approximation.  For  a 
spherical  potential, 

i  dv 

V  X  vp  =  --—L.  (20) 

r  dr 

The  radial  factors  in  Eq.  (20)  blow  up  near  the 
origin  and,  hence,  will  primarily  affect  states  with 
small  I  values.  At  the  same  time,  L  will  be  zero  for 
s  states.  Thus,  the  vector  product  terms,  and  the 
HR  approximation,  will  primarily  affect  the  p 
states,  as  observed.  The  HR  approximation  does 
not  appear  to  have  a  significant  effect  on  any  of 
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the  chemically  active  states  for  the  four  atoms 
under  consideration.  Given  the  similarity  between 
the  vector  product  terms  being  dropped  in  the  HR 
approximation  and  the  lowest-order  spin-orbit 
coupling  term  in  Eq.  (17),  the  HR  approximation 
should  be  valid  for  any  system  in  which  spin-orbit 
coupling  is  unimportant.  At  the  same  time,  if 
spin-orbit  coupling  is  required  for  a  given  system, 
there  will  be  no  advantage  to  using  the  HR  ap¬ 
proximation  anyway. 

The  small  basis  set  results  for  Au  and  Pb  are 
given  in  Tables  V  and  VI.  Comparison  of  the  large 
and  small  basis-set  results  for  these  two  atoms 
reveals  that  the  change  in  the  basis  set  has  a 
noticeable  impact  on  the  NR  one-electron  energies; 
note,  in  particular,  the  energy  of  the  5p  state  for 
Au,  which  is  shifted  by  about  0.4  eV.  It  is  also  clear 
that  the  relativistic  corrections  to  the  total  energy 
and  the  one-electron  eigenvalues  of  the  core  states 
are  sensitive  to  the  basis-set  size.  Nevertheless,  for 
the  chemically  active  valence  states,  there  is  rather 
good  agreement  between  the  relativistic  correc¬ 
tions  obtained  with  the  large  and  small  basis  sets. 
This  suggests  that  the  physical  properties  obtained 
with  scalar-relativistic  LCGTO  calculations  should 


be  no  more  sensitive  to  the  orbital  basis  set  size 
than  are  their  NR  counterparts. 

In  Table  VII,  the  current  small  basis  LCGTO 
results  for  Pb  are  compared  with  previous 
LCGTO-FF  calculations  using  the  same  orbital  ba¬ 
sis  set  and  LDA  model  [35].  The  LCGTO-FF  work 
did  not  consider  the  full  EFP  approximation  used 
here  or  the  HR  approximation;  hence,  results  are 
only  compared  for  the  NR,  EFPn,  and  FPPn  ap¬ 
proximations.  There  is  very  good  agreement  be¬ 
tween  the  NR  results,  indicating  that  a  good  fitting 
function  basis  set  was  used  in  the  earlier  work. 
The  differences  between  the  scalar-relativistic 
LCGTO  and  LCGTO-FF  results  are  significantly 
larger.  Those  differences  are,  however,  once  again 
concentrated  in  the  chemically  inert  core  states  and 
should  not  pose  any  difficulties  for  multiatom  cal¬ 
culations. 

The  three  questions  posed  in  the  Introduction 
can  now  be  answered  as  follows: 

1.  The  neglect  of  matrix  elements  involving 
vector  products  of  the  momentum  should  not 
present  any  problems  for  systems  in  which 
the  spin-orbit  coupling  is  unimportant. 


TABLE  V _ 

Numerical  and  LCGTO  results  for  the  negatives  of  the  nonrelativistic  one-electron  eigenvalues  (in  eV)  and  total 
energies  (in  Ry)  are  compared  for  paramagnetic  Au.  Scalar-relativistic  corrections  to  the  nonrelativistic  results 
are  listed  for  various  approximations.  The  LCGTO  results  were  calculated  here  with  a  20s15p11d6f  basis.  All 
calculations  used  the  HL  model. 


Nonrelativistic  Scalar-relativistic  corrections 


Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

1s 

73021 .73 

73022.30 

7617.36 

7281.25 

7192.75 

7178.54 

11117.51 

2s 

12187.55 

12188.15 

2034.09 

1991.51 

1980.41 

1975.56 

2419.90 

3s 

2852.37 

2852.98 

504.21 

494.85 

492.62 

491.15 

587.68 

4s 

600.78 

601.40 

127.28 

124.87 

124.38 

123.95 

148.15 

5S 

84.79 

85.26 

23.95 

23.49 

23.41 

23.31 

28.10 

6s 

4.51 

4.53 

1.67 

1.65 

1.64 

1.64 

2.00 

2p 

11720.50 

11721.21 

572.48 

573.21 

567.02 

603.99 

589.17 

3p 

2631 .48 

2632.12 

166.33 

167.29 

166.03 

174.04 

170.10 

4p 

505.55 

506.17 

39.34 

39.57 

39.34 

41.23 

40,09 

5p 

54.55 

54.95 

5.50 

5.54 

5.51 

5.83 

5.56 

3d 

2217.99 

2218.55 

-16.42 

-17.47 

-18.02 

-18.89 

-22.68 

4d 

330.12 

330.54 

-4.70 

-4.94 

-4.99 

-5.22 

-6.19 

5d 

8.37 

8.39 

-1.17 

-1.19 

-1.18 

-1.19 

-1.33 

4f 

94.89 

95.19 

-13.22 

-13.31 

-13.26 

-13.46 

-14.10 

E 

35720.73 

35719.90 

2280.97 

2236.13 

2227.66 

2252.24 

2933.86 
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TABLE  VI _ 

Numerical  and  LCGTO  results  for  the  negatives  of  the  nonrelativistic  one-electron  eigenvalues  (in  eV)  and  total 
energies  (in  Ry)  are  compared  for  paramagnetic  Pb.  Scalar-relativistic  corrections  to  the  nonrelativistic  results 
are  listed  for  various  approximations.  The  numerical  results  are  from  [35].  The  LCGTO  results  were  calculated 
here  with  a  20s17p1 1d6f  basis.  All  calculations  used  the  Xa  model  with  ol  =  0.7. 


Nonrelativistic  Scalar-relativistic  corrections 


Num 

LCGTO 

Num 

EFP 

EFPn 

HR 

FPPn 

1s 

78976.75 

78976.79 

8997.12 

8631 .51 

8532.30 

8515.90 

13281.58 

2s 

13312.12 

13312.67 

2421.16 

2376.22 

2363.60 

2357.86 

2903.21 

3s 

3173.88 

3174.49 

604.04 

593.93 

591.38 

589.58 

708.83 

4s 

701 .38 

701.85 

153.40 

150.86 

150.29 

149.71 

179.97 

5s 

113.91 

114.25 

29.86 

29.37 

29.27 

29.10 

35.42 

6s 

8.93 

8.97 

2.59 

2.54 

2.54 

2.50 

3.17 

2p 

12824.30 

12824.82 

668.90 

675.69 

668.75 

713.45 

695.88 

3p 

2940.72 

2941 .33 

195.58 

197.84 

196.40 

206.07 

201 .22 

4p 

598.97 

599.44 

45.18 

46.17 

45.89 

48.18 

46.63 

5p 

79.45 

79.76 

5.80 

5.96 

5.93 

6,32 

5.84 

6p 

3.13 

3.12 

-0.15 

-0.14 

-0.14 

-0.13 

-0.20 

3d 

2503.90 

2504.49 

-20.25 

-20.86 

-21.50 

-22.54 

-27.20 

4d 

409.46 

409.79 

-7.67 

-7.74 

-7.80 

-8.12 

-9.47 

5d 

23.86 

23.92 

-3.27 

-3.23 

-3.23 

-3.29 

-3.61 

4f 

152.37 

152.61 

-18.88 

-18.88 

-18.83 

-19.13 

-20.11 

E 

39055.40 

39054.66 

2705.40 

2649.61 

2640.28 

2670.00 

3496.23 

TABLE  VII _ 

LCGTO  and  LCGTO-FF  results  for  the  negatives  of  the  nonrelativistic  one-electron  eigenvalues  (in  eV)  and  total 
energies  (in  Ry)  are  compared  for  paramagnetic  Pb.  Scalar-relativistic  corrections  to  the  nonrelativistic  results 
are  listed  for  two  approximations.  The  LCGTO-FF  results  are  from  [35]  and  are  indicated  by  FF.  All 
calculations  used  the  20s17p11cf6f  basis  and  the  Xa  model  with  a  =  0.7. 


Nonrelativistic  Scalar-relativistic  corrections 


LCGTO 

LCGTO-FF 

EFPn 

EFPn-FF 

FPPn 

FPPn-FF 

Is 

78976.79 

78977.31 

8532.30 

8531.04 

13281.58 

13279.32 

2s 

13312.67 

13312.75 

2363.60 

2363.22 

2903.21 

2902.52 

3s 

3174.49 

3174.50 

591 .38 

591 .27 

708.83 

708.62 

4s 

701 .85 

701 .84 

150.29 

150.26 

179.97 

179.91 

5s 

114.25 

114.25 

29.27 

29.26 

35.42 

35.39 

6s 

8.97 

8.97 

2.54 

2.53 

3.17 

3.16 

2p 

12824,82 

12825.03 

668.75 

671 .07 

695.88 

700.14 

3p 

2941.33 

2941 .38 

196.40 

197.03 

201.22 

202.36 

4p 

599.44 

599.45 

45.89 

46.05 

46.63 

46.91 

5p 

79.76 

79.75 

5.93 

5.87 

5.84 

5.90 

6p 

3.12 

3.12 

-0.14 

-0.15 

-0.20 

-0.20 

3d 

2504.49 

2504.50 

-21.50 

-21.60 

-27.20 

-27.38 

4d 

409.79 

409.79 

-7.80 

-7.83 

-9.47 

-9.52 

5d 

23.92 

23.92 

-3.23 

-3.24 

-3.61 

-3.62 

4f 

152.61 

152.61 

-18.83 

”18.85 

-20.11 

-20.14 

E 

39054.66 

39054.74 

2640.28 

2641 .92 

3496.23 

3499.20 
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2.  The  valence  (bonding)  states  from  scalar-rela¬ 
tivistic  LCGTO  calculations  should  be  no 
more  sensitive  to  orbital  basis  set  size  than 
are  their  NR  counterparts. 

3.  Although  the  use  of  fitting  functions  has  a 
somewhat  larger  impact  on  scalar-relativistic 
calculations  than  on  NR  calculations,  this  ef¬ 
fect  is  confined  to  the  deepest  core  levels  and 
should  not  affect  the  properties  that  are  of 
interest  in  most  multiatom  calculations. 

ACKNOWLEDGME]\TS 

Helpful  communications  with  N.  Rosch  are 
gratefully  acknowledged. }.  M.  Wills  is  thanked  for 
providing  a  copy  of  his  numerical  atom  code.  This 
work  was  supported  by  the  U.S.  Department  of 
Energy. 


References 

1.  B.  I.  Dunlap,  J.  W.  D.  Connolly,  and  J.  R.  Sabin,  J.  Chem. 
Phys.  71,  3396  (1979);  Ibid.,  J.  Chem.  Phys.  71,  4493  (1979). 

2.  B.  I.  Dunlap  and  N.  Rosch,  Adv.  Quantum  Chem.  21,  317 
(1990). 

3.  J.  L.  Ballester  and  B.  I.  Dunlap,  Phys.  Rev.  A  45,  7985  (1992). 

4.  J.  W.  Mintmire  and  C.  T.  White,  Phys.  Rev.  Lett.  50,  101 
(1983);  Ibid.,  Phys.  Rev.  B  27,  1447  (1983). 

5.  J.  W.  Mintmire  and  C.  T.  White,  Phys.  Rev.  B  28,  3283 
(1983). 

6.  J.  W.  Mintmire,  Phys.  Rev.  B  39,  13350  (1989). 

7.  J.  W.  Mintmire,  J.  R.  Sabin,  and  S.  B.  Trickey,  Phys.  Rev.  B 
26,  1743  (1982). 

8.  J.  C.  Boettger,  Internat.  J.  Quantum  Chem.  Symp.  27,  147 
(1993);  also  see  J.  C.  Boettger  and  S.  B.  Trickey,  Phys.  Rev.  B 
32,  1356  (1985). 

9.  U.  Birkenheuer,  J.  C.  Boettger,  and  N.  Rosch,  J.  Chem.  Phys. 
100,  6826  (1994). 

10.  J.  C.  Boettger,  Int.  J.  Quantum  Chem.  Symp.  29,  197  (1995). 

11.  J.  C.  Boettger  and  S.  B.  Trickey,  Phys.  Rev.  B  51,  15623 
(1995). 


12.  J.  C.  Boettger,  Phys.  Rev.  B  55,  750  (1997). 

13.  W.  Kutzelnigg,  Int.  J.  Quantum  Chem.  25,  107  (1984). 

14.  N.  Rosch  and  O.  D.  Haberlen,  J.  Chem.  Phys.  96,  6322 
(1992). 

15.  O.  D.  Haberlen  and  N.  Rosch,  Chem.  Phys.  Lett.  199,  491 
(1992). 

16.  O.  D.  Haberlen,  PhD  Thesis  (Technische  Universitat,  Mu¬ 
nich,  1993). 

17.  M.  Douglas  and  N.  M.  Kroll,  Ann.  Phys.  82,  89  (1974). 

18.  O.  D.  Haberlen,  S.  Chung,  and  N.  Rosch,  Int.  J.  Quantum 
Chem.  Symp.  28,  595  (1994). 

19.  O.  D.  Haberlen,  H.  Schmidbaur,  and  N.  Rosch,  J.  Am. 
Chem.  Soc.  116,  8241  (1994). 

20.  S.  Chung,  S.  Kruger,  G.  Pacchioni,  and  N.  Rosch,  J.  Chem. 
Phys.  102,  3695  (1995). 

21.  S.  Chung,  S.  Kruger,  S.  P.  Ruzankin,  G.  Pacchioni,  and  N. 
Rosch,  Chem.  Phys.  Lett.  248,  109  (1996). 

22.  A.  L.  Yakovlev,  K.  M.  Neyman,  G.  M.  Zhidomirov,  and  N. 
Rosch,  J.  Phys.  Chem.  100,  3482  (1996). 

23.  V.  A.  Nasluzov  and  N.  Rosch,  Chem.  Phys.  210,  413  (1996). 

24.  S,  Chung,  S.  Kruger,  H.  Schmidbaur,  and  N.  Rosch,  Inorg. 
Chem.  35,  5387  (1996). 

25.  U.  Heiz,  A.  Vayloyan,  E.  Schumacher,  C.  Yeretzian,  M. 
Stener,  P.  Gisdakis,  and  N.  Rosch,  ].  Chem.  Phys.  105,  5574 
(1996). 

26.  M.  Mayer,  O.  D.  Haberlen,  and  N.  Rosch,  Phys.  Rev.  A  54, 
4775  (1996). 

27.  M,  V.  Ramana  and  A.  K.  Rajagopal,  Adv.  Chem.  Phys.  54, 
231  (1983). 

28.  See,  e.g.,  Eq.  (3)  of  K.  G.  Dyall,  f.  Chem.  Phys.  100,  2118 
(1994). 

29.  L.  L.  Foldy  and  S.  A.  Wouthuysen,  Phys.  Rev.  78,  29  (1950). 

30.  I.  Sucher,  Phys.  Rev.  A  22,  348  (1980). 

31.  B.  A.  Hess,  Phys.  Rev.  A  33,  3742  (1986). 

32.  G.  Jansen  and  B.  A,  Hess,  Phys.  Rev.  A  39,  6016  (1989). 

33.  B.  A.  Hess,  R.  J.  Buenker,  and  P.  Chandra,  Int.  J.  Quantum 
Chem.  29,  737  (1986). 

34.  J.  C.  Boettger,  unpublished. 

35.  P.  Knappe  and  N.  Rosch,  J.  Chem.  Phys.  92,  1153  (1990). 

36.  L.  Hedin  and  B.  1.  Lundqvist,  J.  Phys.  C  4,  2064  (1971). 

37.  J.  M.  Wills,  unpublished, 

38.  O.  Gropen,  J.  Comp.  Chem.  8,  982  (1987). 


574 


VOL  65,  NO.  5 


Correlated  Relativistic  Caicuiation  of  the 
Giant  Resonance  in  the 
Absorption  Spectrum 


S.  KOTOCHIGOVA/  H.  LEVINE/  I.  TUPITSYN^' 

^  National  Institute  of  Standards  and  Technology,  Gaithersburg,  Maryland  20899 
^  Chemistry  Institute,  St.  Petersburg  University,  198  904  Stary  Peterhojf,  St.  Petersburg,  Russia 

Received  29  March  1997;  revised  18  June  1997;  accepted  19  June  1997 


ABSTRACT:  The  aim  of  the  present  study  is  to  extend  the  multiconfiguration 
Dirac-Fock  approach  to  include  spin  polarization  in  order  to  study  magnetically  ordered 
systems.  Routines  handling  time-dependent  ionization  phenomena  adapted  to  multilevel 
and  multichannel  autoionization  processes  have  been  developed  and  used  to  describe  the 
giant  resonance  in  the  Gd^'*'  absorption  spectrum.  In  our  calculation  of  the  complete 
energy  structure  of  4d^4/^  configuration  it  is  discovered  that  the  majority  of  the 
oscillator  strength  associated  with  the  4t?-4/  transitions  is  limited  to  a  relatively  narrow 
spectral  width,  creating  a  giant  resonance.  Our  calculations  take  account  of  the  most 
important  correlation  effects,  and  the  agreement  with  the  experimentally  observed  giant 
resonance  in  gadolinium  is  quite  good.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65: 
575-584,  1997 


In  this  study,  we  demonstrate  the  applicability 
of  the  spin-polarized  multiconfiguration  Di¬ 
rac-Fock  (SPMCDF)  formalism  for  an  accurate  de¬ 
scription  of  magnetic  dichroism  in  rare-earth 
atoms.  The  motivation  for  applying  the  SPMCDF 
method  is  our  desire  to  use  the  exact  form  of  the 
exchange  potential  essential  for  a  correct  calcula¬ 
tion  of  the  core  spin  polarization  and  ferromag¬ 
netism  in  general.  As  far  as  we  know,  our  work  is 
the  first  attempt  to  apply  the  Dirac-Fock  formal¬ 
ism  in  combination  with  multiconfiguration  effects 
[1]  to  describe  magnetic  dichroism  on  an  atomic 
level.  A  multiconfiguration  description  enables  one 

Correspondence  to:  S.  Kotochigova. 

International  Journal  of  Quantum  Chemistry,  Vol.  65,  575-584  (1997) 
©  1997  John  Wiley  &  Sons,  Inc. 


to  introduce  the  multiple  structure  of  relativistic 
terms  with  different  total  angular  momenta.  More¬ 
over,  this  approach  is  designed  to  treat  spin  polar¬ 
ization  and  relativistic  effects  on  the  same  footing 
by  including  the  exchange  potential  dependence 
on  the  orientation  of  the  electron  spin.  Magnetic 
dichroism  in  its  circular  and  linear  forms  is  caused 
by  the  reduced  symmetry,  compared  to  the  para¬ 
magnetic  state,  due  to  the  simultaneous  effects  of 
spin-orbit  and  spin  polarization. 

Besides  being  valuable  in  the  study  of  ferro¬ 
magnetic  materials,  magnetic  dichroism  also  gives 
spectroscopic  information  about  the  electronic 
structure  of  free  atoms  and  ions  embedded  in  a 


CCC  0020-7608  /  97  /  050575-1 0 


KOTOCHIGOVA,  LEVINE,  AND  TUPITSYN 


crystal.  We  have  found  that  it  is  instructive  to 
begin  the  study  of  magnetic  dichroism  assuming  a 
free  atom  or  ion.  Moreover,  there  is  experimental 
evidence  that  the  total  absorption  cross  section  of 
rare-earth  metals  for  photon  energies  near  the  Ad 
thresholds  [2]  is  nearly  identical  with  the  corre¬ 
sponding  quantity  measured  in  the  gas  phase  [3]. 
This  motivates  one  to  consider  magnetic  dichroism 
in  an  atomic  model. 

A  number  of  relativistic  theories  have  been  de¬ 
veloped  recently  to  describe  magnetic  dichroism  in 
magnetic  solids  [4-8]  using  the  local  density  ap¬ 
proximation.  In  particular,  the  L2  3  edges  of  solid 
Gd  have  been  studied  within  a  multiple  scattering 
formalism  which  includes  finite  lifetime  effects  for 
the  excited  states  [9]. 

Here  we  consider  a  free  Gd^"^  ion,  which  has 
been  obtained  by  removing  the  three  valence  elec¬ 
trons  (5d6s^)  from  the  Gd  atom.  The  remaining 
ion  has  a  high-spin  4/^  state  as  the  ground  config¬ 
uration.  Spin  polarization  of  the  4/  orbital  and  as 
well  as  a  large  spin-orbit  splitting  of  the  4^3/2  and 
4^5/2  core  hole  states  are  essential  ingredients  for 
predicting  strong  magnetic  dichroism  in  the  Gd^'^ 
spectrum.  We  wish  to  describe  theoretically  the 
transitions  in  Gd^^  from  the  ground  Ad^^Af'^^Sy/2 
to  the  upper  Ad^Af^  states.  Experiment  [10,  11] 
suggests  that  these  transitions  create  the  key  fea¬ 
tures  in  the  rare-earth  atomic  spectra:  giant  reso¬ 
nances.  We  will  show  that  indeed  the  Ad~Af  tran¬ 
sitions  of  core  electrons  in  the  Gd^"^  ion  display 
resonance  profiles  similar  to  those  obtained  experi¬ 
mentally.  Hence,  for  both  rare-earth  atoms  and 
metals  the  strong  overlap  of  the  4/  and  Ad  orbitals 
makes  the  giant  resonances  an  important  proving 
ground  for  many-electron  effects. 

A  decade  ago  considerable  progress  was  made 
toward  understanding  the  essence  of  giant  reso¬ 
nances  in  absorption  spectra  of  rare-earth  ele¬ 
ments.  A  compilation  of  review  articles  can  be 
found  [12].  In  many  cases,  relativistic  time-depen¬ 
dent  local-density  approximation  (RTDLDA)  [13] 
as  well  as  random-phase  approximation  with  ex¬ 
change  (RPAE)  [14]  and  many-body  perturbation 
theory  (MBPT)  [15]  were  applied  successfully. 

We  focus  particular  attention  on  reproducing 
the  Gd^"^  total  cross  section  in  the  140-  to  180-eV 
energy  range.  We  want  to  demonstrate  the  forma¬ 
tion  of  the  giant  resonance  from  the  series  of  asym¬ 
metric  absorption  lines,  all  determined  by 
Ad^^Af'^-Ad^Af^  excitation,  or  otherwise  mainly  by 


Ad'^Ap-Ad'^Af'^sf, Ad^^Af^el  autoionizing  decays. 
The  SPMCDF  matrix  elements  and  the  resonance 
energies  are  used  to  solve  time-dependent  Floquet 
equations  simulating  the  Gd^"^  absorption  cross 
section.  This  approach  is  a  generalization  of  the 
theories  of  Fano  [16]  and  Mies  [17]. 

The  study  is  organized  as  follows.  In  the  first 
section  a  description  of  the  SPMCDF  theory  is 
given.  For  the  sake  of  completeness,  a  brief  deriva¬ 
tion  of  the  relativistic  formalism  used  to  calculate 
autoionization  and  photoionization  transition  ma¬ 
trix  elements  is  presented  in  the  second  section. 
The  formalism  of  the  time-dependent  Floquet  the¬ 
ory  describing  the  appearance  of  the  Fano-type 
resonances  in  absorption  spectra  is  given  in  the 
third  section.  The  absorption  cross  section  and  for 
circular  and  linear  magnetic  dichroism  in  Gd^^ 
spectrum  are  presented  in  the  fourth  section  to¬ 
gether  with  a  comparison  with  experimental  re¬ 
sults  [11]. 


Spin-Polarized  Dirac-Fock  Theory 

The  theory  is  modeled  on  the  basis  of  the 
Dirac-Fock  approximation  with  an  exact  descrip¬ 
tion  of  the  exchange  potential.  We  have  con¬ 
structed  the  central-field  spin-polarized  Dirac-Fock 
(SPDF)  equations  by  selecting  all  relativistic  atomic 
orbitals  in  two  groups  with  opposite  directions  of 
electron  spin.  The  total  energies  have  been  taken 
as  the  center  of  gravity  of  relativistic  terms.  Aver¬ 
aging  energies  helps  to  break  the  //  coupling  and 
partly  implement  the  LS  coupling.  We  also  use  the 
fractional  occupation  of  open  relativistic  subor¬ 
bitals,  so  that  the  SPDF  solutions  can  be  reduced 
to  the  spin-polarized  Hartree-Fock  solutions  in  the 
limit  of  c  ^  00. 

In  the  spin-polarized  relativistic  approximation, 
the  radial  parts  P(r)  and  Q(r)  of  wave  function 
become  dependent  on  both  the  projections  of  the 
total  angular  momentum  (with  quantum  num¬ 
ber  m)  and  the  electron  spin  /x.  The  solutions  of 
these  equations  can  be  identified  by  the  mixture  of 
the  quantum  numbers  m  and  /x,  so  it  is  hard  to 
distinguish  solutions  related  to  a  particular  spin 
projection.  To  avoid  construction  of  a  large  num¬ 
ber  of  equations  and  to  focus  more  on  the  spin- 
polarization  effects,  we  assumed  the  radial  func¬ 
tions  P(r)  and  Q(r)  to  be  independent  of  the 
quantum  number  m.  This  is  justified  because  the 
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m  splitting  is  much  smaller  than  that  of  spin  polar- 
ization. 

The  spin-polarization  interaction  mixes  states 
with  different  total  angular  momenta  j  and  creates 
new  states,  described  by  the  linear  combinations  of 
ji  =  l  ^  \  and  j2  =  I  -  \  [20]  for  a  given  jjl.  We 
obtain  from  two  relativistic  orbitals,  and  j2,  two 
linear  combinations  by  a  unitary  transformation. 
Coefficients  in  the  linear  combinations  are  found 
from  the  minimization  of  the  total  energy. 

In  this  approximation,  the  atomic  wave  function 
ipir)  is  a  four-component  Dirac  spinor 

^  r\iQA(r)x-K.m^i')  +  *QA('')^K+i,m(r) ) 

(1) 


with  A  =  (k,  jx),  and  A  =  (  -  k  —  1,  /a);  Xk, ^ 
two-component  spin-angular  function  (i.e.,  a  Pauli 
spinor).  and  are  the  large  and  small  compo¬ 
nents  of  the  wave  function,  respectively. 

The  quantum  numbers  k  (with  k  =  ±  1,  ± 
2, . . . )  are  related  to  the  orbital  and  total  angular 
momentum  quantum  numbers  I  and  j  by 

i  -\k\-  /  =  ;■  +  Sj2,  =  k/\k\.  (2) 


The  set  =  1, 2, 3, . . .  is  sometimes  denoted  as 
Pi/2/ ^3/2//5/2/ •  •  •  -1, -2,  ~3, ...  corre¬ 

sponds  to  Si/2/  Vz/lf  *  *  * 

The  trial  functions  are  then  constructed  from 
the  wave  functions  ipix)  so  that  they  belong  to  a 
given  configuration  “•  (n^/^ The 

former  of  these  means  that  the  set  of  one-electron 
functions  (1)  is  made  up  of  functions  belonging  to 
the  spin-polarized  subshells,  where 
numbers  of  electrons  with  a  specified  direction  of 
spin,  belonging  to  the  subshells  1, . . . , 

Once  the  class  of  functions  to  be  varied  is  cho¬ 
sen,  the  subsequent  calculations  are  performed  in 
two  stages.  In  the  first  stage,  /^-dependent  expres¬ 
sions  for  the  total  energy  are  constructed  for  the 
centers  of  gravity  of  relativistic  configurations.  The 
second  stage  of  the  calculation  involves  a  search 
for  the  extremals  of  the  energy  functional,  which 
leads  to  a  system  of  coupled  differential  equations 


dr 


dr  r  ^  \2c  cr  )  ^  r  ^ 


dr 


•K  -  1 


z-y. 


cr 


Qa 


y 


dr 


K  —  1 


-Qa-|^  + 


^A  Z  -  y. 


cr 


- -XP 


(3) 


where  we  have 


^A  “  ^AA'^XvA' 

A' 

-h  +  8^^)  Y,  “^AA' 

A  s>0  ^A 

X/  -  i  L  L  -  r  L 


^  X*  A  s  ‘?A 


A'^.A 


2cqA 


1 


gAA 


^AA 


XQ  =  _  £  +  r  E 


A'#  A  s  ^A 


A'#  A 


2c(?a 


yAA-rf^[P^P^  +  QMdr', 

J  y 


(4) 


and  Z  is  the  nuclear  charge.  Here,  atomic  units  are 
used,  i.e.,  h,  Planck's  constant,  the  electron  mass, 
the  charge  are  numerically  equal  to  1.  Indices  A 
and  A  label  different  atomic  subshells  in  the  SPDF 
approximation.  The  parameters  s  and  A  are  the 
Lagrange  multipliers  when  the  normalization  and 
orthogonalization  conditions  are  taken  into  ac¬ 
count.  The  functions  and  are  defined  by  the 
electron  orbital  [18].  To  solve  Eq.  (3)  we  apply  the 
self-consistent  field  approximation.  The  ideas  un¬ 
derlying  the  SPDF  self-consistent  field  theory  are 
similar  to  those  in  the  Dirac-Fock  (DF)  theory.  A 
numerical  solution  was  performed  by  transform¬ 
ing  the  differential  equations  to  a  logarithmic  scale 
p  =  In  r.  A  uniform  grid  in  p  is  taken  between 
Pmin  and  corresponding  with  =  0.1  X 

e-*/Z  and  =  80  a.u. 

Since  the  interaction  between  an  atom  and  the 
electromagnetic  field  is  represented  by  a  one-par¬ 
ticle  operator,  the  corresponding  matrix  element 
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can  be  written  in  the  one-electron  form 


=  <tv'in,l'i'A> 

1 

r 

/  Xk' ,  m' 

^  ^'Qa'  X~-  k’ ,  m' 

in,l 

Pa  Xk  ,  m 

X~  K,m  j 

+ 

^A'  X~  k'~1,  m' 

^  ^QkXK'  +  \,m' 

in,l 

Pa  X~~  K~i,m 
^*Qa  XK+l,m 

+  1 

X-  — 

^  ^QkXK'  +  l,m' 

inj 

Pa  Xk,  m 
^Qa  X-  K,m  ^ 

+  1 

^  Pa,  V  ,  , 

^Qa!  X-k' ,  m' 

inj 

Pa  X-  K—1,  m  \ 
^*Qa  XK+l,m  1 

where  we  have  =  jf,((x)/c,  The  terms  in 

the  above  expression  can  be  written  as  radial  and 
angular  parts,  for  example. 


PxXk'.m' 

^'Qa'  X-k' ,  m' 


inj 


Pa  Xk  ,  m 
^*Qa  X—  K,  m 


(2;a  +  1) 


X  E  <i?A(r)l;J^A'(r)>.  (6) 

y=  ±\ 


where  are  Clebsch-Gordan  coefficients; 

R(y)  is  the  radial  part  of  a  wave  function,  defined 
through  the  large  P^ir)  and  small  Qa(^)  compo¬ 
nents  of  the  wave  function 


(R^iy)\j,\RAy)) 


iPA(r) 

\QA(r) 


if  7  =  1 
if  7=  ~1, 


(7) 


with  the  condition  that  (l^  +  +  k)  is  even,  and 

is  a  spherical  Bessel  function. 

The  total  matrix  element  can  be  represented  by 
a  linear  combination  of  the  matrix  elements  in  Eq. 
(5). 

Vab  =  E  EcaCa'I’aa'/  (8) 

A  A'  m 


where  the  matrix  elements  i?aa'  generated  by  a 
one-electron  dipole  transition  between  states  A,  A 
and  A,  A,  included  as  components  in  the  initial 


and  final  levels,  respectively.  The  coefficients 
and  Ca  reflect  the  multiconfiguration  character  of 
terms  and  are  determined  by  solving  the  multicon¬ 
figuration  problem  with  the  Dirac-Fock  basis  set 
(MCDF)  [1].  In  SPMCDF,  the  Dirac-Fock  spin- 
polarized  orbitals  in  Eq.  (1)  were  used  as  the  basis 
functions  in  Eqs.  (5)  and  (8).  The  dipole  selection 
rules  for  the  different  polarizations  of  external 
field  are  determined  by  the  Clebsch-Gordan  coef¬ 
ficients. 


Transitions  to  the  Continuum 

The  formalism  for  the  calculation  of  photoion¬ 
ization  matrix  elements  is  similar  to  that  between 
discrete  levels.  For  the  description  of  the  contin¬ 
uum  state  we  have  limited  ourselves  to  a  single 
configuration  as  the  correlation  interaction  be¬ 
tween  the  continuum  states  is  assumed  to  be  negli¬ 
gible.  The  outer  (free)  electron  with  a  well-defined 
orbital  momentum  and  kinetic  energy  is  consid¬ 
ered  to  be  moving  in  a  core  potential.  The  core 
potential  is  found  by  a  SPDF  calculation  of  the 
core  orbitals  in  either  the  ground  or  excited  state. 
A  numerical  integration  leads  to  a  continuum 
two-component  wave  function  cf)  with  the  correct 
phase  behavior.  We  matched  the  wave  function  (f> 
to  Coulomb  functions  between  10  and  20  a.u.,  with 
a  charge  appropriate  for  the  remaining  ion  to  nor¬ 
malize  the  wave  functions.  The  Coulomb  regular  F 
and  irregular  G  functions  were  calculated  using 
continued  fractions  [19]. 

The  many-body  photoionization  matrix  ele¬ 
ments  of  Eq.  (8),  assuming  a  single  configuration 
for  the  continuum,  reduces  to 

’'4,^/ =  E  Eca^'a,^;/  (9) 

A  m 

where  a  labels  the  initial  discrete  state,  and  i 
labels  the  core  orbital.  Here  the  label  £  combines 
both  the  kinetic  energy  and  the  orbital  angular 
momentum  of  the  free  electron.  The  one-electron 
matrix  element  is  described  by  Eq.  (5),  but 
now  calculated  with  continuum  wave  functions. 

The  autoionization  matrix  elements  describe  the 
transition  between  an  autoionizing  atomic  state  a 
and  an  ion,  which  need  not  be  in  the  ground 
configuration,  plus  a  free  electron.  This  process 
involves  separating  two  electrons  in  the  outer  shell 
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and  is  mediated  by  the  Coulomb  repulsion  be¬ 
tween  those  electrons.  Within  the  Dirac-Fock 
framework  each  electron  can  be  assigned  an  inde¬ 
pendent  wave  function.  Calculating  the  many-elec- 
tron  autoionization  matrix  elements  therefore  re¬ 
duces  to  the  evaluation  of  two-electron  matrix 
elements.  If  the  electrons  participating  in  the  tran¬ 
sition  are  labeled  A  and  B  for  the  atomic  state  a 
and  C  and  D  for  the  continuum  configuration, 
where  D  defines  the  escaping  electron,  then  the 
matrix  element  is  given  by 

(ABHAiJCD) 

=  ]/(2Ja  +  1)(2/b  +  l)(2;c  +  +  1) 

y  {2v  +  1) 

S,C,D) 

mg  m^/  mg,  a 

V  rH^nA-^c)  cHms-rtio)  m 

where  the  c^^'s  describe  the  many-electron  Dirac- 
Fock  wave  function  in  terms  of  Slater  determi¬ 
nants.  The  radial  integrals  have  the  usual  form 

R"(A,  B,C,  D) 

=  (  dr^f  dr2(P^(ri)Pc(r^)  +  Q^(rj)Qc(r^)) 
•'o  •'o 

X  ^+1  (^b(^2)^d(^2)  +  Qb(^2^Qd(^2^^- 

(11) 

The  configuration  interaction  (Cl)  matrix  element 
for  autoionization  is  then  subsequently  found  as  a 
linear  combination  of  these  two  electron  matrix 
elements  with  coefficients  given  by  the  Cl  calcula¬ 
tion  for  the  a  level. 


Calculation  of  Absorption 
Cross  Sections 


correlated  to  and  interacts  with  its  own  contin¬ 
uum.  The  total  wave  function  ^  of  such  a  system 
evolving  under  influence  of  a  light  field  of  fre¬ 
quency  CO  is  represented  as  a  linear  combination  of 
atomic  wave  functions,  as  calculated  in  the  two 
preceding  sections,  which  are  either  discrete  i(/}^  or 
continuum  wave  functions.  Note  that,  in  this 
section  the  subscripts  A:,  n  =  0, 1, 2, . . .  label  dis¬ 
crete  states,  while  the  subscript  i  labels  the  various 
continua,  i.e.,  the  various  core  orbitals,  and  s  is 
the  kinetic  energy  of  the  free  electron.  Therefore, 
we  have 


^  =  CoCOiAoe'"'  +  E 

A:#0 

+  XI  j  ds  (12) 

continua 


where  the  coefficients  c  describe  the  light-induced 
mixing  of  states.  The  dynamics  of  the  wave  func¬ 
tion  is  given  by 


i—  -  (13) 

at 

where  includes  the  atomic  and  relativistic 
atom-field  interaction 

Ho  -  Ea,A^.(0,  (14) 

J 

A  jit)  is  the  relativistic  vector  potential  of  the  ex¬ 
ternal  field,  tty  is  a  Pauli  matrix,  and  the  sum  j  is 
taken  over  all  electrons.  Inserting  Eq.  (12)  into  Eq. 
(13),  integrating  over  all  electron  coordinates  r, 
and  using  the  rotating  wave  or  resonant  approxi¬ 
mation  to  the  time-dependent  differential  equa¬ 
tions,  the  coefficients  c  are  given  by 

(^c„(0  -  E„c„(f) 

=  T.  Ck(t)Vi,n+  E  f 

k=^n  /e  continua 

(15) 


The  nature  of  giant  resonances  is  complicated 
due  to  the  simultaneous  presence  of  a  multiplet 
structure  and  discrete  and  continuum  contribu¬ 
tions.  Any  model  for  the  frequency  dependence  of 
the  resonances  must  have  a  discrete  initial  state  iJjq 
and  a  number  of  final  states  embedded  in  the 
autoionizing  region.  Moreover,  each  final  state  is 


d  ^ 

dt  ^ 

where  the  index  n  denotes  either  the  initial  or  final 
state  and  the  energies  E  are  the  atomic  energies 
except  for  the  initial  state  where  Eq  equals  the 
initial  atomic  energy  plus  the  photon  energy  co. 
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The  matrix  elements  V  denote  the  coupling 
strength  between  the  various  levels.  In  fact,  the 
lead  to  field-induced  photoionization  (A:  =  0) 
or  autoionization  (A:  >  0). 

For  the  transition  between  the  initial  and  upper 
state  it  follows  that  we  have  „  ==  „F, 

where  the  Ihe  dipole  transition  matrix 

elements  and  F  the  field  strength.  Similarly,  for 
the  field-induced  photoionization,  we  have  X4/,jt=o 
=  where  the  d^^  Q  denote  dipole  transi¬ 

tion  matrix  elements  into  the  continuum.  On  the 
other  hand,  the  nonzero  autoionizing  matrix  ele¬ 
ments  (for  A:  >  0)  are  independent  of  the  laser 
field  and  arise  from  CL  Moreover,  the  matrix  ele¬ 
ments  for  k,n  >  0  can  in  principle  be  nonzero. 
This  is  due  to  the  configuration  interaction  be¬ 
tween  two  excited  states.  However,  our  level 
structure  calculation  has  already  taken  those  ef¬ 
fects  into  account  and  thus  we  have  =  0  for 
A:,  n  >  0.  Finally,  as  correlation  between  continua  is 
weak,  the  matrix  elements  14/^  are  negligibly 
small  and  therefore  have  been  omitted  from  Eq, 
(15). 

The  major  difficulty  in  solving  Eq.  (15)  by 
straightforward  integration  over  time  t  is  caused 
by  the  kinetic  energy  s.  In  fact,  Eq.  (15)  forms  an 
infinite  set  of  coupled  differential  equations,  which 
are  complicated  by  the  fact  that  the  matrix  ele¬ 
ments  Vgi  have  a  weak  dependence  on  e  such 
that  any  introduction  of  an  energy  cut-off  will  lead 
to  a  theory  with  poor  convergence  characteristics. 
Therefore  we  applied  the  Floquet  approach  which 
transforms  the  differential  equations  into  a  set  of 
integral  equations  with  the  property  that  the  inte¬ 
gration  over  s  can  be  solved  analytically.  In  the 
Floquet  method  the  Laplace  transformation 


Cp(p)  ^  f  dtc  iOe^^,  (16) 

•'o 


where  p  is  either  k  or  reduces  Eq.  (15)  to  the 
form. 


[ip  -  E„]c„ip)  - 

k^n 

^  «  zb^(O), 

i  e  continua 

[ip  -  =  Ki(0),  (17) 

k 


by 


Co(0)  =  1;  q(0)  =  c^,(0)  =  0  (18) 


for  A:  >  0.  In  deriving  Eq.  (17),  it  is  assumed  that 
the  autoionizing  and  Cl  matrix  elements  V  are 
time-independent. 

Equation  (17)  can  be  rewritten  by  eliminating 
The  second  equation  of  Eq.  (17)  gives 


c.i(p) 


Wp)Vk,si 

ip  -  E., 


(19) 


and  is  inserted  into  the  first  equation  of  Eq.  (17). 
The  integral  over  e  can  now  be  solved  with  Dirac's 
formula  to  give 


ip  -  E./ 


=  p  _  y  . 

^  ^  k,  Si  si,nf 


(20) 


where  is  a  matrix  element  at  the  point  s  = 
-F^hw  +  ^Pf  where  we  have  ■  -h  e,  and 

Fjj^  j  is  the  atomic  energy  at  threshold.  Because 
„  has  weak  energy  dependence,  the  principal 
part  P  is  small  and  will  be  neglected  henceforth. 
Moreover,  this  weak  dependence  allows  one  to 
replace  «  with  its  value  at  resonance  when 
E^i  =  E^j.  Therefore  we  get,  for  the  discrete  levels, 

(ip-E„  +  i7r  D  \Ki,n\^]cn(-P') 

'  i  e  continua  ’ 

-  L  (Vkn  -  i'^  E  Vk,eiKi.„]  =  *C„(0). 

k=^n^  /e  continua  ' 

(21) 


For  each  p  Eq.  (21)  forms  a  finite  set  of  algebraic 
equations  of  form  (ip  -  A)c(p)  =  ic(0)  over  the 
discrete  levels  n,  where  -A  denotes  the  matrix 
formed  by  the  terms  in  Eq.  (21),  which  are  inde¬ 
pendent  of  p.  In  general,  the  solutions  of  the 
algebraic  equations  are  given  by 


c„(p)  =  'ZiCk^OXip  -  A)J, 

k 


(ip  -  A)J 


Dnk(ip'> 

D 


(22) 


where  c„(0)  and  c^iiO)  describe  the  population  am-  where  D^^„(ip^  D  are  the  minors  and  the 
plitudes  of  all  levels  at  time  f  =  0,  and  are  given  determinant  of  the  matrix  (ip  -  A),  respectively. 
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The  inverse  Laplace  transformation  of  Eq.  (22)  gives 
us  the  time-dependent  solutions  of  Eq.  (21)  and 
after  some  algebra  we  find 


c„0)  =  iL 

k 


riy  :/=  /c(  ~ 


(23) 


From  the  population  P„(f)  =  of  a  level,  the 

partial  photoabsorbtion  cross  section  is  found  as 


cr^((o) 


1 

(x)C 


(24) 


where  r  is  the  pulse  duration  and  F  the  field 
strength.  The  method  was  originally  developed  for 
strong  field  applications  [21]. 


Results 

The  absorption  cross  section  of  Gd^^  as  a  func¬ 
tion  of  the  photon  energy  is  calculated  using  the 
resonant  energies  and  the  dipole  matrix  elements 
of  transitions,  from  the  ground  state  of  the 
configuration  to  the  states  of  the  4d^4/^  configura¬ 
tion,  which  lie  in  the  range  of  photon  energies 
between  139  and  180  eV.  The  Gd^^  ground  state  is 
a  high-spin  state  with  seven  aligned  4/  electrons 
and  zero  orbital  angular  momentum,  i.e., 

The  absorption  selection  rule  A  /  =  ±1,0  permits 
transitions  to  states  with  total  angular  momentum 

r'  —  5  7  9 

J  2'  2'  2' 

The  MCDF  method  [1]  was  applied  to  calculate 
initial  and  final  states  in  the  intermediate-coupling 
regime.  The  Cl  procedure  included  7  relativistic 
configurations,  belonging  to  the  4d^‘^4/^  configu¬ 
ration  and  describing  the  initial  states  and  14  rela¬ 
tivistic  configurations  describing  the  final  states  of 
the  4:d^4:f^  configuration  where  seven  and  eight 
electrons  were  divided  in  all  possible  ways  over 
the  4/5/2  and  the  4/7/2  relativistic  orbitals.  Table  I 
presents  the  resonant  energies  of  the  strongest 
lines  in  the  Gd^^  spectrum  which  contribute  to  the 
giant  resonance  and  correspond  to  transitions  from 
the  lowest  level  of  the  4rf^°4/^  configuration.  Other 
lines  (530  lines  in  the  range  between  140  and  160 
eV)  have  strengths  smaller  by  a  factor  of  2  or  more. 

The  total  absorption  cross  section  is  summed 
from  three  partial  cross  sections  found  for  transi¬ 
tions  to  states  with  total  angular  momenta  quan¬ 
tum  numbers  /'  =  f ,  f  of  the  4d^4/^  configura¬ 
tion.  Because  there  is  no  interference  between 


TABLE  I _ 

Resonant  energies  of  absorption  lines  used  to 
construct  the  giant  resonance  In  the  spectrum. 


J'  of  final  state 

Resonant  energy  in  eV 

9 

2 

140.444 

7 

2 

141.856 

9 

2 

144.424 

7 

2 

144.537 

5 

2 

144.622 

9 

2 

144.787 

5 

2 

153.990 

5 

2 

154.028 

5 

2 

154.092 

5 

2 

154.129 

7 

2 

154.972 

7 

2 

155.052 

channels  belonging  to  different  /',  we  solved  the 
multichannel  equations  (21)  for  each  J'  separately. 

For  complex  atomic  systems,  most  of  the  au- 
toionizing  states  have  different  core  states  and 
therefore  decay  to  different  continua.  This  plays  an 
important  role  in  the  formation  of  the  absorption 
spectrum  of  the  Gd^^  ion. 

Figure  1  presents  the  theoretical  results  for  the 
absorption  resonances  induced  by  transitions  to 
the  three  levels  with  /'  ==  f,  while  Figure  2  shows 
the  absorption  lines  of  transitions  from  the  ground 
state  to  states  with  /'  =  f  (four  levels)  and  =  f 
states  (five  levels).  The  later  transitions  comprise 
considerably  more  oscillator  strength  than  the  /' 
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FIGURE  1,  MCDF  absorption  cross  section  for  J  =  | 
to  J'  =  I  resonant  transitions  between  the  ground  state 
and  states  of  the  4af®4f®  configuration. 
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FIGURE  2.  MCDF  absorption  cross  section  for  J  =  | 
to  =  hi  resonant  transitions  between  the  ground 
state  and  states  of  the  4d^4f  ®  configuration. 


=  I  lines.  The  main  reason  for  the  significant 
differences  in  the  cross  sections  and  its  strong 
dependence  on  /'  can  be  found  in  the  multiconfig- 
urational  distribution  of  the  oscillator  strengths 
among  the  levels  of  the  4d^4/^  configuration. 

In  the  calculation,  the  full  set  of  final  states  with 
the  same  total  angular  momentum  J'  are  mixed; 
the  results  are  plotted  in  Figure  3.  The  cross  sec¬ 
tions  turn  out  to  be  given  by  the  sums  of  the 
individual  lines  (cross  sections  are  plotted  in  Figs. 
1  and  2)  to  within  1%.  The  total  cross  section  in 
Figure  3  determines  the  observable  giant  reso¬ 
nance.  The  maximum  of  this  resonance  lies  at  a 
higher  photon  energy  (6.5  eV  higher)  than  the 
experimental  peak  resonance  at  148.86  eV  [10,  11]. 
Figure  3  also  displays  several  weak  absorption 


lines  preceding  the  giant  resonance  in  the  range 
from  140  to  146  eV.  These  satellites  contain  a  small 
fraction  of  the  oscillator  strengths,  but  they  are 
very  characteristic  features  of  the  absorption  spec¬ 
trum,  observed  in  core  transitions  of  atomic 
gadolinium  [10]  and  in  solid  gadolinium  [11]. 

In  order  to  compare  the  theoretical  profile  of 
Figure  3  (solid  line)  with  the  experimental  reso¬ 
nance  from  [11],  the  theoretical  cross  section  was 
shifted  toward  lower  photon  energies  by  6.5  eV. 
The  result  is  presented  in  Figure  4  together  with 
the  experimental  profile.  Since  the  experimental 
absorption  cross  section  is  given  in  relative  units, 
we  normalized  it  to  our  absolute  theoretical  total 
cross  section  to  achieve  the  best  comparison.  Fig¬ 
ure  4  shows  that  the  MCDF  calculation  (solid  line) 
describes  the  experimental  spectrum  (round  dots) 
quite  well.  The  giant  resonance  in  both  cases  has  a 
single  Fano-type  profile  with  almost  the  same 
width  and  asymmetry  parameter  q.  The  results 
indicate  that  the  theoretical  approach  of  the  MCDF 
method  is  quite  appropriate  for  an  accurate  de¬ 
scription  of  the  Gd  absorption  spectrum. 

The  MCDF  results  discussed  above  yield  good 
agreement  with  the  experiment,  but  do  not  include 
correlations  due  to  spin  polarization.  Flowever, 
this  correlation  is  important  in  calculating  mag¬ 
netic  dichroism.  In  order  to  introduce  spin  polar¬ 
ization  we  included  in  the  model  the  alignment  of 
all  electron  spins  in  the  4/^  orbital  (4/5/2  T  T  T  T  T 
T 4/7/2  T)  of  the  ground  state  of  Gd^^.  The  spin 
alignment  of  orbitals  in  the  upper  configuration 
4d^4/^  is  introduced  as  4^5/2  T  4/5/2  f  T  T  T  T  T 
4/7/2  T  i  •  Spin  polarization  of  the  4/  orbital  and 


FIGURE  3.  MCDF  total  absorption  cross  section  for 
J  =  I  to  J'  =  f,  I  resonant  transitions  summed  over 
final  states  (dashed  lines)  and  the  total  absorption  cross 
section  (solid  line). 


FIGURE  4.  Theoretical  (solid  line)  and  experimental 
(open  circles)  total  absorption  cross  section  in  Gd^"^. 
The  theoretical  cross  section  is  shifted  by  6.5  eV  toward 
smaller  photon  energy. 
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the  spin-orbit  splittings  of  both  4d  and  4/  orbitals 
lead  to  the  strong  magnetic  dichroism  in  Gd.  In 
other  words,  magnetic  dichroism  appears  when 
one  can  observe  multiplet  structure  within  the 
resonant  profile.  Our  results  show  that  the  multi¬ 
plet  structure  in  Gd^"^  is  due  to  the  fine  splitting  of 
final  states. 

In  the  spin-aligned  model  we  consider  transi¬ 
tions  from  the  ground  state  with  /  =  f  and  =  f . 
First,  we  assume  that  the  external  field  is  either 
left-  or  right-circular-polarized.  The  selection  rule 
Am  =  +1  for  circular-polarized  light  allows  tran¬ 
sitions  from  the  ground  state  to  /'  =  |  levels  only 
for  left-circular-polarized  light  and  to  /'  =  f ,  f 
levels  with  right-circular-polarized  light.  The  pro¬ 
nounced  polarization  dependence  of  the  calculated 
spectrum  for  left-  and  right-circular-polarized  radi¬ 
ation  can  be  seen  in  Figure  5. 

The  dashed  lines  in  Figure  5  describe  the  total 
absorption  cross  section  for  left-  and  right- 
circular-polarization.  The  cross  section  for  right- 
circular-polarized  light  is  much  higher  than  for 
left-circular-polarized  light.  The  solid  line  in  Fig¬ 
ure  5  is  the  difference  between  the  two  cross  sec¬ 
tions  and  defines  circular  dichroism. 

Linear  dichroism,  defined  as  the  difference  of 
the  cross  section  for  x-linear  and  z-linear  polar¬ 
ized  light,  gives  complementary  information  about 
Gd^'^.  The  cross  section  for  x-linear  polarized  light 
is  half  of  the  sum  of  cross  sections  for  left-  and 
right-circular  polarized  transitions  while  the  cross 
section  for  z-linear  polarized  light,  with  selection 
rule  Am  =  0,  allows  transitions  to  }'  —  f ,  |,  but 
not  to  /'  =  f  levels.  The  resulting  spectra  are 
shown  in  Figure  6. 


FIGURE  5.  Total  absorption  cross  section  for  left  and 
right  circular  polarization  (dashed  lines),  magnetic 
circular  dichroism  (solid  line). 


FIGURE  6.  Total  absorption  cross  section  for  x-  and 
z-linear  polarization  (dashed  lines),  magnetic  linear 
dichroism  (solid  line). 


Conclusion 

We  have  found  from  MCDF  calculations  that 
the  giant  resonance  in  the  Gd^*^  spectrum  between 
135  and  160  eV  has  multiplet  structure.  This  struc¬ 
ture  is  described  by  the  spin-orbit  splittings  of  the 
4rf^4/^  final  configuration.  Moreover,  partial  ab¬ 
sorption  asymmetric  lines  of  different  total  angular 
momentum  differ  significantly.  This  allows  the 
calculation  of  the  dependence  of  the  cross  section 
on  atomic  and  light  polarization  and  thus  the  cal¬ 
culation  of  magnetic  dichroism  in  Gd^"^. 

Comparison  of  the  theoretical  and  experimental 
results  shows  that  we  describe  the  resonance  shape 
accurately,  whereas  the  position  of  the  resonance 
is  obtained  with  a  4%  accuracy.  We  believe  that 
the  distribution  of  the  oscillator  strengths  will  im¬ 
prove  when  additional  correlation  beyond  Cl  with 
only  the  4d^°4/^  and  4d^4/^  configurations  is 
introduced. 

The  SPMCDF  calculation  of  the  absorption  cross 
section  in  Gd^^  shows  a  strong  dependence  of  the 
cross  section  on  the  polarization  of  light.  The  mag¬ 
netic  dichroism  for  the  circular  polarization  is 
nearly  maximal. 
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ABSTRACT:  The  differential  scattering  cross  section  (DCS)  for  electrons  scattered 
elastically  by  neon  is  studied  using  a  model  potential  The  short-range  part  of  the 
effective  potential  is  represented  by  the  Chebyshev  approximation  and  the  long-range 
polarization  potential  is  represented  by  an  energy-dependent  function.  The  computed 
differential  cross  section  obtained  using  the  approximate  effective  interaction  potential 
for  electrons  scattered  by  neon  atoms  in  their  ground  state  in  the  50-100  eV  energy  range 
is  compared  with  available  published  results.  In  the  present  study,  a  nonrelativistic 
Hartree-Fock  wave  function  of  the  target  atom  is  used  to  compute  the  short-range  static 
part  of  the  interaction  potential  and  the  parameters  contained  in  the  energy-dependent 
function  are  determined  by  the  minimization  of  the  DCS  with  respect  to  the  angle  and 
the  incident  energy.  The  resulting  DCS  in  the  angular  range  2°  <  0  <  178°  is  found  to  be 
in  good  agreement  with  the  available  experimental  and  theoretical  results  in  the 
intermediate  energy  range.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  585-590, 1997 


Introduction 

The  differential  cross  section  of  electrons  scat¬ 
tered  by  atoms  and  selected  molecules  has 
been  a  topic  of  considerable  interest  as  evidenced 
by  a  large  number  of  experimental  and  theoretical 
work  published  over  the  last  decade.  The  theoreti¬ 
cal  work  has  progressed  mostly  along  the  applica¬ 
tion  of  the  R-matrix  theory  [1],  while  the  semiem- 
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pirical  calculations  based  on  the  optical  potential 
models  [2]  with  or  without  adjustable  parameters 
have  increased  over  the  last  few  years  owing  to 
their  simplicity  and  the  ease  of  computations  lead¬ 
ing  to  the  differential  scattering  cross  section  (DCS) 
over  a  wide  range  of  incident  energy.  In  the  pres¬ 
ent  study,  the  model  potential  is  based  on  the 
dipole  polarizability  of  the  target  atom  and  two 
parameters  contained  in  the  energy-dependent  po¬ 
larization  potential  determined  by  minimizing  the 
DCS  at  the  critical  angle  [3]  with  the  condition  that 
the  computed  integral  cross  section  agrees  with 
the  experimental  value  of  the  integral  cross  section 
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for  two  selected  incident  energies.  The  significance 
of  the  DCS  minimum  at  the  critical  angle  can  be 
xmderstood  on  the  basis  of  the  phenomenon  of 
low-energy  electron  diffraction  associated  with  the 
interaction  betw^een  the  incident  electron  and  the 
atomic  target.  While  the  reported  differential 
cross-section  data  for  a  given  target  atom  varied 
from  5  to  13%  depending  on  the  incident  energy, 
the  integral  cross  section  is  known  to  have  less 
uncertainty  for  most  closed-shell  atoms  over  the 
intermediate  energy  range.  In  the  present  study, 
the  critical  minima  and  the  integral  cross  section  of 
the  computed  DCS  are  shown  to  depend  on  the 
values  of  the  parameters  so  that  one  can  vary  them 
to  find  the  set  that  minimizes  the  DCS  at  the 
critical  angle  and  corresponds  to  the  correct  inte¬ 
gral  cross  section  within  the  uncertainty  of  the 
experimental  data.  Using  this  set  of  parameters, 
the  differential  cross  section  for  the  target  atom  is 
computed  over  the  energy  range  20  <  £  <  500  eV 
and  the  angular  range  2°  <  0  <  180°.  The  com¬ 
puted  DCS  are  in  good  agreement  with  recently 
reported  experimental  and  theoretical  cross-section 
data  for  neon  atoms  over  the  intermediate  energy 
range. 


Approximation  to  the  Effective 
Potential 

The  effective  potential  is  constructed  from  the 
Hartree-Fock  orbitals  [4]  of  the  target  atom  and  is 
of  the  form  given  by 

K^V,/r)  +  V/E,r) 

+  (2Z/r)ZT.-AkPn(r)exp(-B;,r),  (1) 

k  n 

where  V^fE,  r)  are  the  exchange  and 

polarization  potentials,  respectively,  and  the  val¬ 
ues  of  Aj^  and  (/:  =  1,2, . . . ,  M,  M  <  10)  are 
determined  from  the  Hartree-Fock  wave  function 
of  the  target  atom.  The  approximate  function  rep¬ 
resenting  the  short-range  part  of  the  potential  is 
then  determined  using  the  Chebyshev  polynomials 
T„(x)  given  by 

ro(x)  =  l,  T^(x)=^x,  T^Cx)  =  2x^  -  1. 

(2a) 


The  higher-order  functions  can  be  generated  with 
the  recursion  relation 

T„^,(x)  =  2xT„(x)  -  T„_i(x).  (2b) 

The  coefficients  Cj  for  the  representation  of  the 
effective  potential  is  computed  from  the  following 
equation: 

q  =  (2/N)i:Log,{(r,/2Z)<Ool  V(r,)lT>o>} 

k 

XTj_-y(Xk),  (2c) 

where  x^^  cosiirik  -  |)/N}.  The  Chebyshev  rep¬ 
resentation  truncated  to  order  M  (M  <  N)  is  then 
given  by  (/:  =  1  to  M  in  the  sum) 

Vo(r)  =(2Z/r)EP„(r)exp{EQT,„i(r)  -Q/zJ. 

(3) 

In  the  approximating  equation,  several  terms  con¬ 
taining  exponential  and  polynomial  terms  in  the 
original  function  are  reduced  to  fewer  exponential 
terms  of  a  sum  of  polynomials.  This  speeds  up  the 
DCS  minimization  code  used  in  the  present  calcu¬ 
lations.  The  operator  V(r^)  contained  in  the  equa¬ 
tion  for  Cj  represents  the  Coulomb  interaction 
between  the  incident  electron  and  the  target  atom. 
In  the  present  study,  the  Log(base  e)  of  the  origi¬ 
nal  function  is  used  due  to  its  exponential  varia¬ 
tion  for  the  Chebyshev  representation  using  a  value 
of  N  =  30  and  the  DCS  is  computed  using  the 
approximate  function  with  M  =  9.  The  exponen¬ 
tial  function  is  then  used  to  reverse  the  basis 
representation  of  the  Log  of  the  original  function 
for  the  computation  of  DCS.  These  coefficients  are 
determined  with  the  condition  that  the  DCS  com¬ 
puted  from  the  original  interaction  agrees  with  the 
DCS  obtained  from  the  approximated  potential 
within  1%.  This  has  been  found  to  be  sufficiently 
accurate  for  the  DCS  minimization  algorithm  used 
in  the  present  study. 

Thus,  the  effective  potential  used  in  the  present 
study  is  of  the  form 

U,(£,  r)  =  y,,(£,  r)  4-  V^(E,  r)  +  V,(r).  (4) 


Computation  of  Differential  Gross 
Section 

The  effective  interaction  potential  used  in  the 
present  study  is  the  sum  of  the  short-range  and  the 
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energy-dependent  long-range  polarization  terms 
given  by 

V(r)  =  <^>o|yo(r,r,)l<l>o> 

+  a,/[r+f(A,B,E)t  +  (5) 

where  is  the  ground-state  Hartree-Fock  [4] 
wave  function  of  the  target  atom  and  Vq  contains 
the  electron-electron  and  electron-nuclear  poten¬ 
tials,  is  the  dipole  polarizability  of  the  target 
atom,  and  the  parameters  A  and  B  contained  in 
/(A,  B,  E)  are  to  be  determined  for  the  selected 
incident  energy  £  (eV)  of  the  electron.  The  form  of 
the  exchange  potential  is  similar  to  the  one 
used  in  an  earlier  calculation  [5].  The  differential 
scattering  cross  section  /(£,  0)  is  then  computed 
from  the  scattering  amplitude  /(/c,  0)  as  a  function 
of  the  wave  number  k  and  scattering  angle  0 
given  by 

/(fc,0)  =  a/2ik)Y.(2l  +  1) 

/ 

X[exp(2/5;)  -  l]P;(cos0),  (6) 

where  the  phase  shifts  5;  are  computed  by  solving 
the  radial  part  of  the  Schrodinger  equation  for  the 
angular  momentum  in  the  range  0  <  /  <  12  using 
the  Numerov  algorithm  [6]  from  which  the  zeros 
of  the  wave  functions  are  used  in  the  argument  of 
Bessel  and  Neumann  functions  to  compute  the 
phase  shifts  accurate  to  four  decimal  places.  For 
higher  partial  waves,  the  Born  approximation  is 
used  for  I  in  the  range  11  <  /  <  30  with  a  32-point 
Gauss  quadrature  method  for  the  numerical  inte¬ 
gration,  The  phase  shifts  for  higher  partial  waves 
are  computed  using  an  analytical  expression  [7]  for 
the  phase  shift  in  terms  of  integrals  involving 
Bessel  functions  of  fractional  order  for  values  of  I 
up  to  200.  It  has  been  found  that  for  scattering 
angles  >  2°  not  more  than  200  partial  waves  are 
needed  for  the  DCS  to  converge  to  an  accuracy  of 
±0.1%.  However,  for  angles  less  than  0.1®,  it  was 
found  that  more  than  1000  partial  waves  were 
needed  for  convergence  to  the  listed  accuracy.  All 
the  computations  were  performed  in  double  preci¬ 
sion  using  an  IBM4381.  For  scattering  angles  in 
the  small-angle  range,  it  was  found  that  a  faster 
algorithm  was  required  to  efficiently  execute  the 
minimization  routine.  For  this,  the  electron-atom 
interaction  was  represented  by  a  truncated  least- 
square-fitted  function  containing  fewer  terms 
which  substantially  reduced  the  computing  time 
of  the  forward  scattering  amplitude  and  the  result¬ 


ing  integral  cross  section.  For  the  higher  partial 
waves,  the  approximation  to  the  short-range  inter¬ 
action  did  not  significantly  affect  the  final  results 
since  the  phase  shifts  depend  mostly  on  the  polar¬ 
ization  potential.  This  enabled  the  determination 
of  the  approximate  values  of  parameters  A  and  B 
for  helium  and  neon  atoms  that  satisfy  the  speci¬ 
fied  conditions  with  a  few  minutes  of  computer 
time  for  each  incident  energy  for  a  given  target 
atom. 


Discussion  of  Results 

The  variation  of  computed  DCS  with  the  pa¬ 
rameters  contained  in  the  polarization  potential 
/(A,  B,  E)  is  shown  in  Figure  1.  The  plots  num¬ 
bered  1, 2, ...  6  are  the  DCS  corresponding  to  the 
energy-dependent  polarization  potential  with  two 
parameters  for  an  incident  energy  of  60  eV.  For 
neon,  the  minimum  of  DCS  occurs  near  105°  as 
seen  in  plot  #1  and  for  smaller  angles  as  the  value 
of  /  is  increased.  The  corresponding  values  of  the 
computed  integral  cross  section  are  shown  in  Table 
1.  For  the  approximate  determination  of  parame¬ 
ters  A  and  B,  the  experimental  values  [8]  of  a  at 
60  and  100  eV  were  used.  Comparison  of  a  with 
other  experimental  values  is  shown  in  Figures  2 


FIGURE  1.  Plots  numbered  1-6  showing  the  variation 
of  computed  DCS  for  e-neon  scattering  at  60  eV  using 
the  model  potential  for  various  values  of  f{A,  B,  E)  with 
B  -  0.00476  and  A  =  0.7-1 .5  in  steps  of  0.1  (DCS  In 
units  of  ao/ster). 
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TABLE  I _ 

Variation  of  integral  cross  section  a  with  f(£)  for 
neon  at  the  incident  energy  E  ^  60  eV;  experimental 
value®  of  a  =  2.98  +  0.06  (10“^®  cm^/ster). 


Plot# 

m 

cr 

1 

0.986 

3.181 

2 

0.820 

3.172 

3 

0.920 

3.150 

4 

1.020 

3.131 

5 

1.120 

3.100 

6 

1.220 

3.071 

®  Experimental  data  from  [8]. 


and  3.  The  final  values  of  A  and  B  are  determined 
by  minimizing  the  DCS  surface  as  shown  in  Figure 
4  in  the  energy  range  50  <  E  <  90  eV  and  angular 
range  90  <  ^  <  110°.  The  contour  plot  shown  in 
Figure  5  indicates  that  the  critical  values  of  the 
energy  and  angle  are  £  =  66  eV  and  6  =  98°,  These 
are  in  good  agreement  with  the  experimental  val¬ 
ues  of  65  eV  and  101°,  respectively.  The  values  of 
the  parameters  are  determined  to  be  A  =  1.11  and 
B  =  0.0036.  The  computed  values  of  cr  for  various 
incident  energies  are  compared  with  the  experi¬ 
mental  values  of  Register  et  al.  [8]  and  the  R-ma- 
trix  calculations  of  Fon  et  al.  [1]  as  shown  in  Table 
II.  The  comparison  between  the  present  results  and 
other  theoretical  and  available  experimental  data 


FIGURE  2.  Comparison  of  the  computed  integral  cross 
section  for  e-neon  scattering  for  incident  energy  in  the 
range  30-200;  (□)  results  of  this  work;  (■)  experimental 
data;  {m)  P-matrix  theory. 


is  shown  in  Table  III  (50  eV)  and  Table  IV  (100  eV). 
The  computed  integral  cross  section  for  e-neon 
scattering  for  the  incident  energy  of  30-”200  eV  and 
its  comparison  with  experimental  data  are  shown 
in  Figure  2  and  the  computed  differential  cross 
sections  at  100  eV  compared  with  recent  experi¬ 
mental  and  theoretical  results  for  neon  are  shown 
in  Figure  3. 

The  computed  values  of  the  DCS  (open  squares) 
are  seen  to  be  in  good  agreement  with  the  experi¬ 
mental  data  and  R-matrix  calculations  for  scatter¬ 
ing  angles  up  to  140°.  For  larger  angles,  the  model 
potential  results  seem  to  differ  somewhat  from  the 
available  theoretical  data.  For  different  target 
atoms,  the  present  model  calculations  for  the  DCS 
are  in  good  agreement  with  available  published 
data  except  near  the  critical  angles.  It  is  possible 
that  there  is  more  uncertainty  in  the  DCS  near 
this  region  compared  to  those  near  the  forward 
scattering  angles.  A  more  general  function  for 
f{A,  B,  C,  £)  is  being  tested  to  determine  the  DCS 
near  the  critical  angle  for  other  atoms.  The  value  of 
the  computed  critical  angle  has  been  found  to  be 
in  good  agreement  with  published  data.  The  pres¬ 
ent  model  is  being  extended  to  argon,  krypton, 
and  xenon  to  determine  the  limitations  of  the  ef¬ 
fective  interaction  potential  for  the  computation  of 
the  DCS  in  the  intermediate  energy  range.  In  con¬ 
clusion,  the  usefulness  of  the  model  potential  used 
in  the  present  study  is  for  intermediate  energies 
where  a  large  number  of  partial  waves  are  needed 


SCATTERING  ANGLE  (deg) 

FIGURE  3.  Comparison  of  the  computed  DCS  (in  units 
of  cm^  /  s)  for  e-neon  scattering  at  an  incident  energy  of 
100  eV.  Markers  are  same  as  in  Figure  2. 
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TABLE  II _ 

Comparison  of  the  computed  cross  section  a 
with  experiment  and  R-matrix  theory  at  various 
incident  energies  for  neon  atom;  o-  in  units 
of  (10“^®  cm^/ster). 


E 

0-® 

(T^ 

50 

3.45 

3.13 

3.08 

60 

3.12 

2.98 

2.86 

70 

2.85 

2.82 

2.70 

80 

2.62 

2.60 

— 

100 

2.27 

2.23 

2.25 

200 

— 

1.45 

^Present  results. 

‘^Experimental  values  from  [8]. 
'^Theoretical  values  from  [1]. 


TABLE  III 


Comparison  of  the  differential  cross  section  with 
experimental  and  other  theoretical  results  for  neon 
at  50  eV  (units  in  ag/ster). 

Angle  0 

Theory 

Present 

[9] 

Experiment 

[8] 

0 

6.354 

6.8582 

10 

4.731 

4.6057 

15 

3.959 

3.8401 

4.182 

20 

3.329 

3.1967 

3.239 

30 

2.425 

2.2424 

2.100 

40 

1.838 

1 .6455 

1.468 

50 

1.407 

1.2717 

1.139 

60 

1.031 

0.9910 

0.889 

70 

0.673 

0.7209 

0.643 

80 

0.352 

0.4426 

0.400 

90 

0.116 

0.1919 

0.168 

100 

0.016 

0.0345 

0.035 

120 

0.329 

0.2424 

0.268 

140 

1.193 

1.1916 

1.103 

160 

2.105 

2.3315 

180 

2.502 

2.8406 

to  obtain  converged  values  of  the  DCS.  For  low 
energies,  where  a  fewer  number  of  partial  waves 
will  be  sufficient  for  the  convergence  of  the  DCS, 
the  close-coupling  calculations  have  the  advantage 
of  being  more  accurate  at  all  angles. 


TABLE  IV _ 

Comparison  of  the  differential  cross  section  with 
experimental  and  other  theoretical  results  for  neon 
at  100  eV  (units  in  ag/ster). 


Angle  0 

Theory 

(Present) 

Experiment 

[8] 

10 

6.515 

7.605 

20 

3.630 

3.699 

30 

1.969 

1.843 

40 

1.104 

0.968 

50 

0.661 

0.553 

60 

0.405 

0.324 

70 

0.228 

0.199 

80 

0.105 

0.100 

90 

0.042 

0.042 

100 

0.051 

0.039 

120 

0.299 

0.229 

140 

0.758 

0.700 

160 

1.195 

180 

1.385 
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ABSTRACT:  The  Schrodinger  equation  for  the  scattering  of  an  electron  by  a  hydrogen 
molecule  is  solved  by  the  finite  element  method,  in  spherical  coordinates,  using  fifth-order 
Hermite  interpolating  polynomials.  The  computational  method  is  quite  similar  to  the 
work  of  Shertzer  and  Botero  [Phys.  Rev.  A  49,  3673  (1994),  and  references  therein]. 
However,  to  study  large  systems,  an  effective  one-particle  dynamical  equation  is  defined, 
unlike  the  procedure  of  Shertzer  and  Botero.  To  illustrate  the  basic  computational 
procedure,  a  model  electron-H2  interaction  potential  (static  +  exchange  -t-  polarization) 
is  constructed  and  the  X-matrix  is  calculated.  A  novel  feature  of  the  present  method  is 
the  procedure  for  extracting  the  partial-wave  amplitudes  at  a  value  of  r,  the  size  of 
which  is  fixed  by  the  range  of  nonlocal  potentials  in  the  problem,  and  then  propagating 
the  scattering  amplitudes  out  to  an  effective  infinity  where  the  converged  X-matrix  is 
determined.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  591-600,  1997 


Introduction 

Low-energy  electron-molecule  scattering  is  a 
fundamental  problem  in  molecular  physics. 
An  excellent  review  of  the  status  of  this  field  in 
1980  can  be  found  in  the  article  by  Lane  [1].  An 
idea  of  the  current  status  of  the  field  can  be  ob- 
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tained  from  a  collection  of  articles  appearing  in  a 
recent  book  [2].  In  that  book,  a  number  of  different 
methods  are  described  for  the  solution  of  the  low- 
energy  electron-molecule  scattering  problem.  As 
described  in  the  article  by  Temkin  and  Weather¬ 
ford  [3],  when  the  interaction  is  highly  nonspheri- 
cal,  the  use  of  angular  momentum  partial-waves  is 
problematical  in  the  region  of  close  encounter  be¬ 
tween  the  electron  and  the  molecule.  At  a  large 
separation,  partial-waves  are  more  appropriate. 
Thus,  Temkin  and  Weatherford,  in  a  series  of  arti¬ 
cles  [4-7],  described  a  solution  method  whereby 
the  two-dimensional  Schrodinger  equation  is  di¬ 
rectly  discretized  using  finite  differences.  This  was 
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done  within  the  context  of  time-independent  scat¬ 
tering  theory  [1],  and,  thus,  the  partial  differential 
equation  was  elliptic.  They  were  considering  elec- 
tron“N2  collisions  where  the  cylindrical  symmetry 
allows  for  the  separation  of  the  (^variable,  reduc¬ 
ing  the  effective  one-electron  scattering  equation  to 
two  coupled  variables  (r  and  9).  The  scattering 
problem  was  solved  in  the  frame  of  fixed  nuclei 
(body  frame).  Thus,  r  describes  the  distance  to  the 
scattering  electron  and  0  describes  the  angle  of  the 
scattering  electron  position  vector  with  respect  to 
the  z  axis  (see  Fig.  1). 

In  an  independent  development,  Shertzer  and 
co-workers  [8-12]  used  the  method  of  finite  ele¬ 
ments  to  discretize  the  Schrodinger  equation  for  a 
number  of  important  problems.  In  each  case,  they 
solved  the  Schrbdinger  equation  explicitly  as  a 
few-body  problem.  The  present  work  differs  in 
that  we  are  solving  an  effective  one-body  problem 
using  finite  elements  in  two  dimensions  (r,  0). 
This  allows  for  the  possibility  of  solving  for  the 
solution  of  electron  scattering  from  much  more 
complicated  targets  [1]. 

The  X-matrix  is  a  concise  and  computationally 
simple  means  of  presenting  the  results  of  a  scatter¬ 
ing  calculation  using  the  method  of  partial  differ¬ 
ential  equations.  In  the  present  work,  the  K-matrix 
is  calculated  for  a  model  e  —  H2  interaction  poten¬ 
tial,  in  spherical  coordinates,  which  includes  the 
static,  exchange,  and  polarization  potentials.  The 


r  (bohr) 


FIGURE  1.  First  quadrant  of  the  r  -  0  plane  deformed 
into  a  rectangle  (PDE  solution  space). 


cylindrical  symmetry  is  used  to  factor  out  the  (f) 
dependence,  resulting  in  a  two-dimensional  ellip¬ 
tic  partial  differential  equation.  The  scattering  re¬ 
gion  is  bounded  by  a  quarter  circle  (see  Fig.  1, 
where  the  scattering  region  is  shown  as  a  coordi¬ 
nate  rectangle)  at  a  value  of  r  =  10  Uq  where  atomic 
units  are  used.  The  partial-wave  amplitudes  are 
extracted  at  that  value  of  r  =  p  and  then  the  cou¬ 
pled  ordinary  differential  scattering  equations  in  r 
are  propagated  to  an  effective  infinity  where  the 
X-matrix  is  determined  [4,5]. 

In  the  next  section,  we  describe  the  basic  ideas 
of  the  finite  element  method  that  we  are  using. 
The  third  section  describes  the  extraction  of  the 
partial-wave  amplitudes  and  the  propagation  pro¬ 
cedure.  The  fourth  section  describes  the  model  H2 
potential.  Finally,  the  fifth  section  gives  the  results 
and  conclusions. 


Finite  Eiement  Method 


A  short  outline  of  the  finite  element  method  is 
presented  below  for  the  simple  case  of  elastic  scat¬ 
tering  with  local  potentials — the  generalization  to 
inelastic  multichannel  scattering  is  straightforward 
but  complicated:  The  time-independent  Schro¬ 
dinger  equation  for  e  ^-molecule  scattering  using 
the  fixed  nuclei  approximation  (FNA)  in  3D,  using 
spherical  coordinates  (in  atomic  units),  is 

'  2d  1  cot9  d 

dr'^  ^  r  dr  ^  dO^  dO 


1 


r^sin^  0  d<p^ 


^(r,  0,</>) 


=  2F(r,  0,(/>)^(r,  0,(^),  (1) 


The  homonuclear  diatomic  molecule  is  the  sim¬ 
plest  system  for  which  nonspherical  terms  con¬ 
tribute  to  the  scattering  potential,  so  it  will  serve 
as  the  physical  anchor  for  an  otherwise  computa¬ 
tional  study.  When  the  energy  of  the  incident 
electron  is  between  1  and  10  electron  volts,  the 
nonspherical  structure  of  the  molecule  is  particu¬ 
larly  important.  In  such  a  situation,  we  use  the 
axial  symmetry  of  the  molecule  to  reduce  the 
Schrodinger  equation  to  a  two-dimensional  partial 
differential  equation.  We  assume  for  simplicity 
that  the  collision  does  not  excite  vibrational  states 
of  the  molecule.  Rotational  excitations  may  be  dealt 
with  at  the  cross-section  stage  of  the  calculation  in 
the  usual  adiabatic-nuclei  manner  [1]. 
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The  target  molecule  may  be  assumed  to  be 
oriented  in  a  particular  direction  in  space  so  we 
use  the  axis  of  symmetry  of  the  molecule  as  the 
z-axis  of  our  coordinate  system  (Fig.  2)  and  define 
a  modified  wave  function  ij/  which  is  related  to 
the  true  wave  function  ^  for  the  projectile  by 

(2) 


Note  that  we  are  using  the  nonreduced  equation. 
The  equation  for  in  atomic  units  is 


dr^  r  dr 


+  {k^  -  2V{r,e) 


COt(O) - - h 


r^sin(d) 


dO  36^ 

=  0,  (3) 


where  V(r,  6)  is  the  scattering  potential  and  k  is 
the  momentum  of  the  projectile.  A  finite  element 
approximation  to  this  equation  is  solved  on  the 
domain  0<r<p,0<0<  7r/2.  The  outer  limit  p 
is  chosen  to  be  sufficiently  large  that  the  scattered 
particle  sees  a  nearly  spherical  potential  at  that 
distance  and,  for  cases  with  a  nonlocal  potential,  is 
greater  than  the  range  of  the  nonlocality.  The  range 
of  9  was  cut  in  half  to  take  advantage  of  the 
symmetry  of  the  molecule  under  reflections  in  the 
X  —  y  plane. 

No  radial  boundary  condition  is  set  at  r  =  0.  If 
we  were  solving  the  reduced  equation,  we  would 
require  that  =  0.  We  do  set  the  derivative  of  if/ 
with  respect  to  9  equal  to  zero  at  the  origin 
(3il//39\r=.o  =  0).  Even  though  we  are  solving  an 
elliptic  equation,  which  requires  a  specification  of 
a  boundary  condition  on  all  boundaries,  we  may 


FIGURE  2.  Spherical  coordinate  system  for  scattering 
from  a  diatomic  molecule. 


avoid  setting  a  boundary  condition  at  individual 
points  on  the  boundary  as  long  as  they  are  iso¬ 
lated.  At  0  =  0,  there  are  two  possibilities:  If  m, 
the  z-component  of  the  orbital  angular  momen¬ 
tum,  is  nonzero,  then  =  0;  otherwise,  dip/ 39  =  0. 
At  0  =  77/2,  the  parity  of  the  system  dictates  ei¬ 
ther  that  w  =  0  or  3u/39  =  0.  At  r  =  p,  we  impose 
the  requirement  that  the  incoming  electron  be  in  a 
state  of  well-defined  orbital  angular  momentum, 
so  that 


>lfl"'Kp,e)  =  p,„,(cose).  (4) 

The  parity  of  the  associated  Legendre  function, 
determines  the  choice  of  boundary  condi¬ 
tion  at  0  =  77/2.  Although  this  parity  is  conserved 
by  the  scattered  wave  function,  the  value  of  /  for 
the  incident  wave  is  not.  Outgoing  waves  therefore 
contain  a  superposition  of  /,  /  ±  2,  Z  ±  4,  and  so 
forth.  The  finite  element  discretization  results  in  a 
matrix  algebra  problem: 


A  X  X;,,  =  (5) 

where  A  is  the  coefficient  matrix;^X/,„,  the  solution 
vector  at  the  grid  points;  and  the  vector  of 
boundary  values.  Note  that  the  solution  and 
right-hand-sides  are  indexed  by  Zm,  the  Legendre 
parameters  from  Eq.  (4).  Once  the  solution  has 
been  found,  information  about  the  scattering  is 
extracted  by  matching  it  to  the  known  asymptotic 
form  [4]  at  r,  <p: 


=  J^Pi'r,/cos9) 
r 


X 


1 

Tt 


(6) 


and 


3r 

=  T,Pr„,(cos0) 


Id  1 


3r 


1  3 


1 


+  -  —q/kr,)  ~  ^C,(kr,)  N, 


3r 


(7) 


where  Z  and  m  are  the  index  and  order  of  the 
Legendre  function  used  in  the  boundary  condition 
at  large  r  =  p,  and  S  and  C  are  related  to  spherical 
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Bessel  functions  by  S/  =  and  Q  =  —k^^^rrii. 

Thus,  ii  and  are  the  regular  and  irregular  spher¬ 
ical  Bessel  function  [13],  The  coefficients  J  and  N 
form  matrices  on  the  space  of  allowed  values  of 
orbital  angular  momentum.  Note  that  J  and  N  are 
diagonal  in  m  because  of  the  cylindrical  symme¬ 
try,  The  validity  of  Eq.  (7)  can  be  established 
because  of  the  Wronskian  relation  S/Cf  -  SJC^  = 
”  1  (where  the  prime  indicates  differentiation  with 
respect  to  r)  and  Leibnitz's  rule  with  respect  to  the 
parametric  differentiation  of  an  integral.  This  latter 
point  concerning  parametric  differentiation  is  a 
somewhat  subtle  point  that  is  explainable  only 
when  viewed  in  light  of  the  noniterative  integral 
equation  (NIEM)  solution  method  [14,15],  We  will 
not  elaborate  on  the  point  now,  but  only  say  that  it 
is  well  established  [16], 

The  orthogonality  of  the  Legendre  functions  may 
be  used  to  project  out  a  particular  value  of  Z.  When 
this  is  done,  the  linear  independence  of  the  Bessel 
functions  allows  extraction  of  the  unknown  coeffi¬ 
cients  J  and  N  from  Eqs.  (6)  and  (7)  by  a  matrix 
inversion.  This  process  must  be  done  for  each 
allowed  value  of  Z,  e.g.,  each  right-hand  side  of  Eq. 
(5).  However,  the  matrix  A  does  not  have  to  be 
redetermined  since  it  is  independent  of  the  p 
boundary  condition.  The  result  is  two  matrices 
describing  the  transformation  of  an  angular  mo¬ 
mentum  eigenstate  into  a  scattered  wave.  In  prin¬ 
ciple,  the  dimension  of  the  matrices  is  infinite,  but, 
in  practice,  the  finite  range  of  the  nonspherical 
potentials  provides  a  cutoff.  For  an  H2  molecule, 
with  a  projectile  energy  below  10  eV,  values  of  Z 
less  than  5  are  all  that  are  required.  Hence,  the 
matrices  are  at  most  3x3.  The  K-matrix  is  de¬ 
fined  by  the  equation 

=  ”E[5r(fcr)  +  Cy{kr)K,.,]P,,,{cose),  (8) 

so  K  =  NJ"\  and  the  influence  of  the  artificial 
imposition  of  an  initial  orbital  state  is  removed. 
The  diagonal  elements  of  K  are  the  tangents  of  the 
elastic-phase  shifts,  and  the  off-diagonal  elements 
describe  the  coupling  of  states  of  different  orbital 
angular  momentum  by  the  potential. 

The  method  of  finite  elements  is  well  described 
in  [9,17]  and  references  therein.  We  will  give  a 
short  outline  of  the  procedure.  The  procedure  is 

1.  Divide  solution  space  into  nonoverlapping 
domains  called  "finite  elements." 


2.  Define  local  basis  functions  within  each  ele¬ 
ment — in  the  present  case,  fifth-order  Her- 
mite  polynomials. 

3.  Discretize  in  each  element  by  the  Galerkin 
method  and  implicitly  form  a  global  element 
matrix. 

4.  Form  a  global  connected  matrix  by  enforcing 
continuity  of  the  solution  and  its  deriva¬ 
tives  across  element  boundaries. 

5.  Form  a  boundary  reduced  connected  matrix 
and  boundary  vector;  A;  B  ^ 

AX  =  B.  (9) 

6.  Solve  simultaneous  equations  by  lower-up¬ 
per  factorization  (LU)  decomposition  [17,18]: 

X  =  A-^B.  (10) 
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d 

—  pi(  x)\^.  =  for  i  =  2, 4, 6 

and  i  G  {xj,  X2,  Xj}.  (15) 

Let  us  write  Eqs.  (13)  and  (14)  explicitly: 

Pl(x)=  ^11  +gi2X  +  gj3  X^  +  ^14  +  gi5  X^ 

d 

=  g22  +  2g23X  +  3g2iX^  +  4:g25X^ 

+  5^26 

P3(x)  =  g3i  +g32X  +  g33X^  +  ^34  X^  +  g35X^ 

+  ^36^^ 

d 

^P4(^)  =  gil  +  2g43X  +  3^44X2  +  4^45  X^ 

+  5^46  X^ 

PS^X)  =  gs,  +gs2X  +  g33X^  +  g54X^  +  ^55  X^ 

+  ^56^® 

d 

^P6(^)  =  g62  +  2g63^  +  3^64^  +  4^65^^ 

+  5^66 


To  go  to  a  global  representation  of  (11),  it 

is  not  necessary  to  let  x  ->  r  in  pi(  x).  The  global 
operators  can  be  transformed  to  local  operators 
which  operate  only  on  the  piecewise  continuous 
polynomials  within  each  element: 


d  dx  d  2d 
dr  dr  dx  U'p  dx 


d^ 

dr^ 


2 

w 


^  d^ 
1^' 


(18) 

(19) 


So,  we  can  simply  say  that 

-  Y,  (20) 

1  =  1 


However,  the  problem  with  (17)  is  that  for 
/  =  2,4, 6,  represents  derivatives  of  with 

respect  to  the  local  variable  x  and  we  need  it  to 
represent  derivatives  with  respect  to  the  global 
variable  r: 


dlJ/^^Kx) 

dx 


dr  dij/^^Kr) 
dx  dr 


dif/^^Kr) 

dr 


.  (21) 


Thus, 


We  now  implement  conditions  (14)  and  (15)  and 
construct  a  matrix  equation  to  be  solved  for  g-p 


Sn  Si2  Si3  8u  8i5  Sie 

§21  §22  §23  §24  §25  §26 

§31  §32  §33  §34  §35  §36 

§41  §42  §43  §44  §45  §46 

§51  §52  §53  §54  §55  §56 

§61  §62  §63  §64  §65  §6t 


X 


1 

-1 


-1 


-1 


0 

1 


10  1 
0  1  1 


-2  0  0  1 
3  0  0  1 

-4  0  0  1 

5  0  0  1 


-1.  (16) 


The  expression  for  in  terms  of  global  variables 
can  now  be  written  using 


r  =  rk"^^  + 


(1  +  x) 


X  =  -1  + 


y  yi^') 

[  r  ^0  j . 


(17) 


’T^'’)(r)  =  Y,  ^r^Pi(x) 

(  =  1,3,5 


^  1  ,  (?!/»<''>(  r) 

+  E  — - - Pi(x).  (22) 

(=2,4,6  2  dr 


Let  us  now  write 


qr(e)(^)  =  ;^q[rp)pp)(x), 
1=1 


such  that 


fori  =  1,3,5 
qi/'’)  =  ^  Sif/^^Kr) 


dr 


rj=4‘'V  ftW 


(l  +  i,) 


and 


PpKx)  = 


hf^pfx)  for  i  =  2, 4, 6 


(23) 


(24) 


(25) 
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Thus,  the  solution  and  its  derivatives  [see  Eq.  (12)] 
are  extracted  at  each  grid  point  in  the  solution 
space. 


drop  the  /  and  V  label  and  switch  to  i  and  We 
thus  have  the  matrix  of  partial-wave  continuum 
orbitals  f  at  r  =  Thus,  from  [14-16],  we  may 
write 


Partial-wave  Amplitude  Extraction  and 
Propagation  Procedure 

The  matrices  J  and  N  can  be  obtained  from  Eqs, 
(6)  and  (7),  at  r  =  r^,  as  follows: 


N(’»)(r^) 


-S(r,) 


1  ^  1 
-S'(r,)  -  “S(r,) 


-i-i 


-C(q) 


1  ^  1  _ 
-C(r,)  -  — C(r,) 
r; 


(26) 


Note  that  the  prime  indicates  differentiation  with 
respect  to  r.  Also,  S  and  C  indicate  diagonal  matri¬ 
ces.  Note  that  the  F  and  F'  are  the  terms  within  the 
large  square  brackets  of  Eqs.  (6)  and  (7).  Rewriting 
Eqs.  (6)  and  (7)  in  terms  of  f  and  using  the  prime 
to  indicate  differentiation  with  respect  to  r  (except 
when  used  on  V), 

0)  T,Pn,(cos  ^)F«(r,)  (27) 

V 

=  ZP,.„,(cose)F'\'.','Kr,).  (28) 

V 

Then,  using  the  orthogonality  of  the  Legendre 
polynomials, 

^  ^  (/'-m)!(/'  +  l/2) 

=  - 77^ - 7^ - 2 


(/'  +  m)\ 


X  sin(9)  do  6)  (29) 

•'n 


^  ^  (Z'  -  m)!(Z'  +  1/2) 

f  1.7’(r,)  =  - 77^ - 7^ - 2 


(/'  +  m)\ 


X  sin(0)d0P,.Jcosd)ipf”'\r,,e).  (30) 
•'o 

Let  us  drop  the  m  label  and  switch  to  the 
reduced  function  /  defined  by  f  =  rF.  Let  us  also 


f(r)  =  S(r)J(r)  +  C(r)N(r), 


(31) 


where 


Jijir)  =  Sij  +  2fdx  C,(  x)Riji  x)  (32a) 


N: 


'ij(r)  =  2j  dxSi(x)Rjj(x), 


where 


R>i(r)  =  i:V,,(x)Ux). 

k 


(32b) 


(33) 


We  are  working  with  the  unnormalized  solution  in 
the  sense  defined  in  [14],  Then,  by  using  a  trape¬ 
zoidal  integration  rule  with  step  size  A  r,  it  can  be 
shown  that  [16] 


I(r,)  =  P(r,.)I(r,_,), 


where 


and 


and 


Hr) 


J(r) 

N(r) 


P(r)  =  1  +  Ar  M(r) 


(34) 


(35) 


(36) 


M(r)  =  2 


-hC(r)V(r)S(r)  -f  C(r)V(r)C(r) 
-S(r)V(r)S(r)  -S(r)V(r)C(r) 


Thus,  we  may  form 

KrJ  =  P  X  I(r,.), 
where,  if  r,  =  rg,  the  origin,  then 

Krg)  = 


(37) 


(38) 


(39) 


Otherwise,  the  value  of  I(r,)  is  picked  up  from  the 
partial  differential  equation  (PDE)  calculation  and 
constructed  using  Eq.  (35).  Thus,  we  may  propa¬ 
gate  from  the  surface  of  the  PDE  sphere,  out  to  an 
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approximate  infinity.  This  is  what  we  do  in  the 
present  work.  Thus, 

P  =  P(rp  X  P(r[^_,j)  X  -  X  P(r,.)  (40) 

and  P(ry:)  =  P(r^).  Then, 

K(rJ  =  N(rJ  X  (41) 

All  the  scattering  information  is  contained  in  KCr^,). 
It  is  a  real,  symmetric  matrix. 

It  should  be  pointed  out  that  his  propagation 
procedure  is  very  similar  to  the  procedure  of  the 
MEM  [14].  The  MEM  propagated  f  instead  of  I 
and  thus  required  unnecessary  matrix  multiplica¬ 
tions.  Therefore,  this  propagation  is  a  very  general 
solution  algorithm  and,  in  principle,  could  be  used 
all  the  way  into  the  origin.  For  the  present  problem 
under  consideration,  this  works  fine  and  is,  in  fact, 
used  to  check  the  accuracy  of  the  results.  For  more 
general  problems  with  highly  nonspherical  poten¬ 
tials,  accuracy  is  lost  because  of  the  large  number 
of  partial-waves  required  near  the  origin  and  be¬ 
cause  of  the  large  magnitude  of  the  irregular 
spherical  Bessel  function  n^(r)  near  the  origin  as 
compared  with  the  regular  spherical  spherical 
Bessel  function  //(r).  We  have  therefore  defined  a 
computational  procedure  similar  to  the  K-matrix 
method  [19].  The  difference  is  that  we  are  breaking 
the  i^-matrix  sphere  up  into  elements  and  span¬ 
ning  each  element  with  Hermite  interpolating 
polynomials  instead  of  basis  function  spanning  the 
entire  K-matrix  sphere. 


The  Potentials 

The  potential  energy  function  V  in  Eq.  (3)  repre¬ 
sents  the  interaction  of  the  projectile  with  the  tar¬ 
get  molecule  and  is  the  source  of  the  breaking  of 
spherical  symmetry.  In  principle,  this  interaction  is 
highly  complicated.  A  molecule  is  by  no  means  a 
static,  rigid  body  in  the  presence  of  an  incoming 
electron,  so  a  potential  model  is  not  strictly  correct, 
but  the  practical  usefulness  of  potentials  in  calcu¬ 
lations  is  great  enough  to  excuse  the  gross  simpli¬ 
fication  of  the  scattering  problem  that  they  imply. 
The  potentials  involved  in  scattering  from  hydro¬ 
gen  molecules  are  approximations  to  three  effects: 
the  static  electric  moments  of  the  molecule;  the 
polarization  of  the  molecule  by  the  electric  field  of 
the  projectile;  and  the  substitution  of  an  electron 
from  the  molecule  for  the  projectile  in  the  outgoing 


state,  known  as  '"exchange.'"  An  H2  model  was 
adopted  in  the  present  work,  which  mimics  to 
some  extent,  the  radial  structure  of  the  physical 
potentials  and  has  angular  properties  which  are 
expandable  in  the  low-order  Legendre  polynomi¬ 
als  of  even  parity.  Actually,  only  three  polynomi¬ 
als  are  needed  because  the  potential,  constructed 
below,  contains  only  I  =  0, 2, 4. 

Because  the  hydrogen  molecule  is  composed  of 
identical  atoms,  the  lowest-order  multipole  field 
seen  by  the  projectile  is  the  quadrupole  term 

VQir,e)  =  -^ prices  e),  (42) 

where  Q  =  0.49  in  atomic  units.  This  form  is  valid 
for  vales  of  r  which  are  outside  the  molecule  for 
all  6,  At  shorter  ranges,  the  quadrupole  potential 
is  cut  off  by  the  usual  exponential  term  (1  - 
exp[~(r/ay])  as  a  multiplier.  The  adjustable  pa¬ 
rameter  a  is  set  to  1,  since  no  effort  is  made  in  the 
present  work  to  fit  experimental  data.  The  static 
potential  inside  the  molecule  is  represented  by  a 
Yukawa  function  of  unit  range  and  arbitrary 
strength,  which  is  adjusted  as  necessary  to  give  the 
correct  order  of  magnitude  for  the  K-matrix. 

Polarization  of  the  target  is  well  known  to  in¬ 
clude  terms  of  both  spherical  and  character  [1]. 
The  form  used  here  is 

1 

Vp(r,  e)  =  -^(1  -  e)], 

(43) 

with  Uq  =  5.5,  and  ^2  =  1.4. 

The  exchange  potential  used  here  is  a  crude 
form  of  the  free-electron  gas  model.  The  basic  idea 
here  is  that  the  electron  in  the  target  may  be 
approximated  by  a  degenerate  Fermi  gas  whose 
density  varies  with  position  [1].  The  exchange  po¬ 
tential  is  proportional  to  the  Fermi  momentum  at 
any  point  in  space,  giving 

4;(r)  =  -(A)[3772)3(F)].  (44) 

The  electron  density  p(f0  is  approximated  by  a 
simple  sum, 

1 

IB(r,e)  -  —e-^[l  +  l.SPjicosO)],  (45) 
477 

which  has  contours  of  constant  density  which  re¬ 
semble  those  of  the  sum  of  two  hydrogen  atoms 
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separated  by  0.7  angstroms.  The  resulting  potential 
can  be  approximated  by  a  linear  combination  of 
Legendre  polynomials,  which  is 

V^(r,  0)  =  -0.8c"^/^[0.97  +  0.43P2(cos  0) 

■^OAP^icos  d)].  (46) 

The  "'realistic"  potential  is  a  sum  of  these  three 
terms. 

The  long-range  quadrupole  potential  is  the  most 
important  term  in  real  scattering  problems,  but  it 
does  not  vanish  until  a  distance  of  14  bohr  radii 
from  the  molecule's  center  is  reached.  This  was 
determined  a  posteriori,  by  searching  for  a  region 
in  which  the  K-matrix  does  depend  on  small 
changes  in  Such  a  long  range  requires  a  large 
number  of  radial  points  and,  consequently,  a  long 
running  time  for  the  computer  code.  Although  the 
convergence  properties  of  the  angular  approxima¬ 
tion  are  roughly  independent  of  the  quality  of  the 
radial  approximation,  this  independence  is  valid 
only  when  the  radial  mesh  is  on  the  order  of 
magnitude  of  the  mesh  which  gives  decent  radial 
convergence.  The  P2  terms  in  the  polarization  and 
exchange  potentials  are,  in  any  case,  expected  to 
yield  the  same  requirements  on  the  angular  mesh. 


Results  and  Conclusions 

Convergence  of  the  K-matrix  is  determined  in 
two  way:  The  first  takes  advantage  of  the  invari¬ 
ance  of  the  K-matrix  under  time  reversal  to  re¬ 
quire  that  the  K-matrix  be  symmetric.  The  second 
is  that  the  value  of  an  element  of  the  of  the 
fC-matrix  change  with  changes  in  the  number  of  6 
points  in  a  manner  which  points  clearly  to  an 
asymptotic  limit.  Now,  in  the  present  calculation. 


the  radius  of  the  PDE  solution  region  was  set  at  10 
bohr.  The  partial-wave  amplitudes  were  extracted 
at  the  first  grid  point  of  the  last  element  whose  last 
grid  point  is  p.  The  solution  was  then  propagated 
to  70  bohr  where  the  K-matrix  was  determined. 

The  matrices  are  not  strictly  symmetrical.  When 
the  scattering  potential  is  spherical,  the  wave  func¬ 
tion  should  have  the  angular  properties  of  the 
large-r  boundary  condition,  all  the  way  to  the 
origin.  This  is  true  only  when  the  boundary  condi¬ 
tion  is  for  I  =  0.  When  higher  waves  are  used, 
each  contains  a  contaminant  of  lower  I  values. 
This  contamination  decreases  with  each  step  in  r 
away  from  the  origin. 

The  second  criterion  makes  an  accurate  graphi¬ 
cal  depiction  of  the  convergence  difficult.  The  con¬ 
vergence  of  almost  all  elements  is  monotonic,  so 
the  choice  of  a  specific  limit  is  arbitrary,  yet  it  has 
a  great  impact  on  the  perceived  quality  of  the 
convergence.  A  systematic  study  of  the  conver¬ 
gence  has  not  been  done  as  a  part  of  the  present 
work.  We  set  grid  parameters  which  ensured  accu¬ 
racy  to  at  least  five  significant  figure  in  all  of  the 
X-matrix  elements  for  k  =  .1,  .5,  and  1.0  au.  The 
PDE  r-grid  used  Ar  =  0.1  bohr.  The  PDE  ^-grid 
used  A0  =  10°.  The  propagation  step  size  was  Ar 
=  0.01  bohr.  The  PDE  (with  propagation  from  10 
to  70  bohr)  results  are  displayed  in  Tables  I-III. 
The  answers  were  checked  by  the  propagation 
algorithm  starting  at  the  origin  and  going  out  to  70 
bohr  again  using  Ar  =  0.01  bohr.  The  propagation 
algorithm  actually  produced  X-matrix  accuracy  to 
12  significant  figures.  All  calculations  were  done 
using  64-bit  arithmetic.  Table  IV  shows  the  con¬ 
verged  ( p  =  70  bohr)  X-matrices  as  a  function  of 
the  incident  electron  moment  (k). 

Figure  3,  as  well  as  the  contribution,  com¬ 
pares  the  present  elastic  cross  sections,  calcu- 


TABLE  I _ 

K-matrIx  elements  for  k  =  0.1  au  in  Xg  symmetry;  p  is  the  value,  In  units  of  bohr,  of  r  where  the  K-matrix  was 
determined  symmetric  to  the  number  of  figures  indicated. 


k  =  0.1  au 


p 

K(1.2) 

K(1,3) 

K(2, 2) 

K(2,  3) 

K(3, 3) 

10 

-  .39023E  +  1 

-.18534E  +  0 

-.43518E  - 

2 

~.41619E  - 

2 

-.10151E  - 

3 

.4721 3E  -  4 

30 

-.10282E  +  1 

-.71393E  -  1 

-.22522E  - 

2 

.36462E  - 

1 

.14096E  - 

2 

.24640E  -  2 

50 

-.10267E  +  1 

-.72195E  -  1 

-.22252E  - 

2 

.36821 E  - 

1 

.98354E  - 

3 

.24326E  -  2 

70 

-.10267E  +  1 

-.72271 E  -  1 

-.22363E  - 

2 

.36769E  - 

1 

.96741 E  - 

3 

.22941 E  -  2 
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TABLE  II 


Same  as  Table  I  except  for  k  =  0.5  au. 

P 

k  =  0.5  au 

Kd.D 

K(1,2) 

K(1,3) 

K{2, 2) 

K(2,  3) 

K{3, 3) 

10 

.25061 E  +  1 

.61816E  -H  0 

.77306E  -  1 

.16084E  +  1 

.23790E  +  0 

.6381 7E  +  0 

30 

.34973E  +  1 

.10642E  +  1 

.12152E  +  0 

.21885E  -h  1 

.301 92E  +  0 

.76876E  -h  0 

50 

.34979E  +  1 

.10645E  +  1 

.12158E  +  0 

.21890E  +  1 

.30203E  +  0 

,76885E  +  0 

70 

.34978E  +  1 

.10645E  +  1 

.12158E  +  0 

.21889E  +  1 

.30203E  +  0 

.76883E  +  0 

lated  using  [1] 

477 

(47) 

^  IV 

where 

T  =  1  -  (1  +  zK)(l  -  zK)”\  (48) 

and  the  total  elastic  cross-section  calculation  of 
Jain  and  Baluja  [20]  as  well  as  the  total  experimen¬ 
tal  cross  section  of  Hoffman  et  aL  [21]  It  must  be 


emphasized  that  the  present  calculation  employs  a 
model  e~  -  H2  potential  and,  thus,  the  interesting 
agreement  with  the  experimental  results  of  Hoff¬ 
man  et  al.  is  fortuitous.  These  results  simply  indi¬ 
cate  that  the  model  for  the  interaction  potential  has 
a  realistic  aspect  to  it. 

The  finite  element  method  has  been  shown  to 
be  an  effective  numerical  method  for  the  solution 
of  the  time-independent  electron-molecule  scatter¬ 
ing  problem.  The  next  step  is  to  generalize  to 
nonlocal  potentials  and  coupled  equations  in  order 
to  treat  real  molecular  targets  realistically. 


TABLE  III _ 

Same  as  Table  I  except  for  k  =  au. 


/c  =  1 .0  au 


P 

K(1,2) 

K(1,3) 

K{2, 2) 

K(2,  3) 

K(3,3) 

10 

.67237E  +  0 

.11579E  +  0 

.85839E  -  2 

.92080E  +  0 

.11500E  4-  0 

.72282E  +  0 

30 

.72882E  +  0 

.12633E  -h  0 

.81894E  -  2 

.10158E  +  1 

.12908E  +  0 

.801 66E  -H  0 

50 

.72888E  -h  0 

.12632E  -H  0 

.81858E  -  2 

.10159E  +  1 

.12908E  +  0 

.801 72E  +  0 

70 

.72887E  +  0 

.12632E  +  0 

.81852E  -  2 

.10159E  +  1 

.12908E  -h  0 

.801 71 E  +  0 

TABLE  IV 


Converged  K-matrix  elements  (p  =  70  bohr)  in  Xg  symmetry. 


k 

K(1,1) 

K(1,2) 

K(1,3) 

K(2,  2) 

K(2,3) 

K(3, 3) 

0.10 

-.10267E  +  1 

-. 72271 E 

-  1 

-.22363E 

-  2 

.36769E 

_ 

1 

.96741 E  -  3 

.22941 E  -  2 

0.25 

-  .3530E  +  1 

-.88189E 

+  0 

-. 68481 E 

-  1 

.79764E 

+ 

0 

.80772E  -  1 

.19074E  +  0 

0.50 

.34978E  +  1 

.10645E 

+  1 

.12158E 

+  0 

.21889E 

1 

.30203E  +  0 

.76883E  +  0 

0.75 

.12721 E  +  1 

.27909E 

-h  0 

.26470E 

-  1 

.14367E 

+ 

1 

.20023E  +  0 

.88056E  +  0 

1.00 

.72887E  +  0 

.12632E 

0 

.81852E 

-  2 

.10159E 

1 

.12908E  -H  0 

.801 71 E  +  0 

1.25 

.4521  IE  +  0 

.6731 4E 

-  1 

.26203E 

-  2 

.75040E 

-H 

0 

.86545E  -  1 

.68998E  +  0 

1.50 

.27464E  +  0 

.36056E 

-  1 

.52522E 

-  3 

.56448E 

+ 

0 

.59900E  -  1 

.58408E  +  0 

1.75 

.15123E  +  0 

.15902E 

- 1 

-  .45278E 

-  3 

.4241 OE 

+ 

0 

.41872E  -  1 

.49046E  +  0 

2.00 

.65783E  -  1 

.13066E 

-  2 

-.10867E 

-  2 

.31309E 

+ 

0 

.28755E  -  1 

.40838E  +  0 

2.25 

.11581E  -  1 

-.96914E 

-  2 

-.16741 E 

-  2 

.22331 E 

+ 

0 

.18616E  -  1 

.33608E  +  0 

2.50 

-.15535E  -  1 

-.17612E 

-  1 

-.23247E 

-  2 

.15070E 

+ 

0 

.10418E  -  1 

.27204E  +  0 

2.75 

-  .20396E  -  1 

-  .22432E 

-  1 

-.30457E 

-  2 

.93306E 

- 

1 

.36233E  -  2 

.21529E  +  0 

3.00 

-  .95865E  -  2 

-.241 41 E 

-  1 

-  .37803E 

-  2 

.50201 E 

- 

1 

-  .20086E  -  2 

.16531 E  +  0 
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FIGURE  3.  Cross  section  (•••)  present  elastic 
cross  section  for  Hg  model  potential;  (a  a  a)  calculation 
of  elastic  cross  section  (Jain  and  Baluja  [20]);  (000) 
experimental  total  cross  section  (Hoffman  et  al.  [21]). 
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ABSTRACT:  Some  preliminary  numerical  studies  of  scattering  by  a  potential  separable 
in  a  general  ellipsoidal  coordinate  system  are  presented.  The  associated  scattering  or 
far-field  amplitudes  are  evaluated  using  expansions  in  radial  and  angle  Lame  wave 
functions.  Simple  representations  of  the  scattering  matrix  and  the  far-field  amplitude  are 
given  as  well  as  some  preliminary  figures  of  scattering  data  and  related  potential  level 
surfaces.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  601-608,  1997 


Introduction 

When  separating  the  variables  in  terms  of 
ellipsoidal  coordinates  (see  Fig,  1),  particu¬ 
lar  solutions  of  the  Helmholtz  equation  are  formed 
as  products  of  the  so-called  angle  and  radial  ellip¬ 
soidal  (or  Lame)  wave  functions.  For  comparison, 
in  case  of  a  sphere,  the  products  of  the  angle 
jfunctions  are  the  spherical  harmonics,  while  the 
radial  ones  are  the  Bessel  and  Hankel  functions. 

A  potentially  very  useful  method  to  study,  cal¬ 
culate,  and  analyze  these  ellipsoidal  wave  func- 
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tions  was  presented,  during  the  last  7-8  years,  by 
the  powerful  Abramov  School  at  the  Department 
of  Numerical  Methods  at  the  Computing  Center  of 
the  Russian  Academy  of  Sciences  [1-3],  This  work 
was,  to  a  large  extent,  inspired  by  M.  V.  Fedoryuk, 
who  investigated  the  fundamental  properties  of 
the  ellipsoidal  wave  functions  (see  [4,5]  and  the 
literature  cited  therein).  Furthermore,  his  articles 
[6, 7]  on  acoustic  scattering  by  a  triaxial  ellipsoid 
are  also  of  great  importance  here,  as  he  was  the 
first  to  obtain  the  explicit  form  of  the  series  expan¬ 
sions  of  scattering  data  in  terms  of  ellipsoidal 
wave  functions. 

It  should  be  noted  in  this  context  that  we  are 
not  simply  interested  in  the  ellipsoidal  wave  func¬ 
tions  as  such,  but  we  have,  rather,  a  more  general 
viewpoint  in  mind.  For  instance,  one  may  apply 
the  Fourier  technique  to  solve  the  Schrbdinger 
equation,  thus  arriving  at  a  system  of  problems 
very  similar  to  those  arising  in  the  study  of  the 
Helmholtz  equation.  In  the  particular  case  of  the 
potential  plotted  in  Figure  2,  the  angular  functions 
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FIGURE  1.  Examples  of  ellipsoidal  coordinate  surfaces.  The  grid  is  produced  by  the  intersection  of  the  coordinate 
surfaces.  For  more  details,  see  text. 


FIGURE  2.  Equipotential  surfaces  of  the  potential  ^3)  =  “  fi)!-  Two  adjacent  octants  are 

displayed  to  show  the  mirror  symmetry  of  the  potential. 
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will  coincide  with  the  angular  ellipsoidal  wave 
functions.  The  methods  developed  in  [1-3]  allow 
us  to  compute  these  perturbed  ellipsoidal  wave 
functions  for  an  arbitrary  potential  separable  in 
ellipsoidal  coordinates  and  vanishing  sufficiently 
fast  at  infinity  and,  more  importantly,  to  represent 
scattering  data  in  them.  The  present  technique 
should  also  lead  to  many  important  applications  to 
general  problems  in  physics  and  chemistry,  where 
the  standard  partial  wave  analysis  may  extend 
into  the  regime  of  thousands  and  more  waves — 
obviously,  by  trying  to  manage  a  description  of  a 
system  far  from  spherical  symmetry. 

In  a  recent  contribution  [8],  we  examined  the 
shapes  of  the  partial  waves  in  an  ellipsoidal  repre¬ 
sentation  using  so-called  pct^-diagrams,  where 

^  (^2  _  ^2  _  are 

defined  [see  Eqs.  (l)-(4)  below].  It  is  easy  to  see 
that  the  flexibility  offered  by  the  ellipsoidal  coordi¬ 
nate  representation  is  obviously  absent  in  the 
standard  choices  of  coordinate  systems. 

We  will  here  consider  the  scattering  of  particles 
by  a  potential  which  enables  a  separation  of  the 
variables  in  ellipsoidal  coordinates,  i.e.,  we  look 
for  the  solution  to  the  Schrodinger  equation 

=  0,  (1) 

defined  at  infinity  by 


expz(fc,  r) 

expiikr) 
-h  fin,  Hq) - 


(r  oo)  (2) 


where  fin,  nf)  is  the  far-field  or  scattering  ampli¬ 
tude,  n  =  r/r  and  n^  =  k/k.  Our  aim  was  to  ex¬ 
pand  it  in  terms  of  the  eigenfunctions  of  the 
Sturm-Liouville  problem  arising  as  a  result  of 
separation  of  variables  in  (1).  In  what  follows,  the 
variables  introduced,  in  e.g.,  [1-3]  are  used. 


Separation  of  Variables 

Given  a  >  b  >  c  >  0,  the  ellipsoidal  coordinate 
system  is  defined  by 

x^/ia^  +  17)  +  y^/ib'^  +  T/)  -f  z^/ic^  -\-  7])  =  1, 
or,  going  over  to  the  dimensionless  variables  ^  = 


ir]  +  a^)/ia^  -  b^),  being  used  further  below,  by 

+  7"2 - 1  2\/  - 2\  "" 

(r  -  bvi^-  pO 

Here,  we  have  denoted 

p^  =  ia^  —  c^)/ia^  “  b^)  >  1. 

This  equation  defines  a  family  of  confocal  sur¬ 
faces  specified  by  the  variable  To  obtain  a  deeper 
understanding  of  the  ellipsoidal  coordinate  sys¬ 
tem,  one  should  contrast  Figure  1  with  Eq.  (3).  For 
each  point  ix,  y,  z),  xyz  ^  0,  the  latter  exhibits 
exactly  three  different  roots,  ^  =  ^1,  ^2/  ^3/  where 
e  =  (0;  1),  I2  ^  4  =  (1;  P^X  and  ^3^/3  = 
( p^;  00).  The  equation  ^3  =  ^3^  e  f  defines  an  el¬ 
lipsoid  with  the  semiaxes  ia^  -  b^)^^  >  iof  - 
”  1)  >  ~  P^)-  As  ^3  tends  to 

p^  +  0,  ellipsoids  degenerate  into  the  part  of  the 
plane  z  =  0  bound  by  the  focal  ellipse  x^/ia^  - 
c^)  -I-  y^/ib^  “  c^)  =  1.  Setting  ^2  =  P^/  we  obtain 
the  part  of  the  plane  2  =  0  complement  to  ^3  =  p^. 
As  ^2  diminishes  from  p^  to  1,  the  equation 
^2  ^  ^2^  specifies  a  hyperboloid  of  one  sheet.  As  ^2 
tends  to  1  +  0,  hyperboloids  degenerate  to  the  part 
of  the  plane  y  =  0,  enclosed  between  the  branches 
of  the  focal  hyperbola  x^/ia^  ~  b"^)  -  z^/ib^  - 
c^)  =  1.  The  remaining  part  of  the  plane,  which 
consists  of  two  disjoined  regions,  containing  foci  of 
the  hyperbola,  is  given  by  =  1.  Starting  from 
=  1,  a  family  of  hyperboloids  of  two  sheets, 
each  described  by  tends  to  the  plane  x  = 

0,  which  coincides  with  the  surface  =  0. 

The  above  three  families  of  confocal  surfaces 
form  an  orthogonal  curvilinear  coordinate  system 
related  to  Cartesian  coordinates  within  each  octant 
(e.g.,  X  >  0,  y  >  0,  z  >  0)  by  the  one-to-one  corre¬ 
spondence: 

=  -(fl^  -  b^)(^-f  -  1)(^2  “  1) 
x(^3  -  l)/(p^  -  1), 

X(4-p2)/[(p^-l)p2]. 

In  Figure  1,  one  can  see  the  ellipsoidal  coordinate 
surfaces  of  all  possible  types.  Here,  we  cut  off  a 
one-eighth  part  of  the  ellipsoid  to  show  the  hyper¬ 
boloids  of  both  one  and  two  sheets.  It  is  readily 
seen  that  there  is  a  point  of  intersection  of  all  the 
plotted  surfaces  in  each  octant. 
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We  now  proceed  as  follows:  Denote 

;  =  1,3,  with  /(^)  -  ^(^- 1)(^- p^), 
op-  =  k^{P  —  P), 

Then,  in  the  newly  introduced  coordinates,  the 
Lame  coefficients  (see  below)  are  =  (^i  ~ 
^2X^1  “  “  ^^V(4/(^i))  (h2  and  result 

from  the  above  formula  by  a  cyclic  permutation  of 

the  indexes),  with  Eq.  (1)  taking  the  form 

{[(^3  “  ^2)^1  (^1  “  ^3)^2  ^^2  ” 

+  i(^l"f2)(^2"^3)(^3”^l) 

x[(o^-V(^,,^^,^,)]}u(r)  =  0,  (4) 

In  (4),  we  will  admit  a  separable  potential  of  the 
type 

=[(^3-^2)a(^i)  +(^i-^3)i3(|^2) 

X(^2-^3)(^3-^i)]''  (5) 

Here  a,  j8,  and  y  are  arbitrary  functions.  This  is, 
in  fact,  the  most  general  expression  for  such  a 
potential  [9]. 


By  substituting  a  particular  solution  ^(r)  == 
into  (1),  we  obtain  three 
equations  similar  to  the  Lame  wave  equations: 


0))  —  a( ^|)] A^^^  — 0, 

0  <  <  1,  (6) 


-\[q(^2^h,l,co)  -  =  0, 

l<^2<p^  (7) 


+  \[q(  ^3/  h,  l,  w)  —  yf  =  0, 

<  ^3  <  00,  (8) 

where  ft,  I  are  the  separation  constants, 

q(^,  h,  I,  (o)  =  hp^  -  Ip^  +  0)^'^. 

Taking  the  symmetry  of  the  problem  into  ac¬ 
count,  a  particular  solution  of  (1)  must  be  either 
odd  or  even  with  respect  to  each  coordinate  plane. 
Since  coordinate  planes  correspond  to  the  ends  of 
the  intervals  of  the  ellipsoidal  coordinate  varia¬ 
tions,  we  find  that 


either  ^lim^[//XiT  =  0/ 

either  ^  lim  =  0, 

either  lim  (/-/(^  A'2(^2))  =  0/ 
f2-»l  +  0'  ' 

either  lim  U ^2(^2))  =  0/ 
either  lim  ^3(^3)]  =  0, 


or  lim  Aj(^i)  =  0, 

+0 

or  lim  AA^.)  =  0, 

^1-1-0 

or  lim  A2(^2)  =  0/ 
^2^1  +  0 

or  lim  A2(^2)^0/ 
or  lim  A3(^3)  =  0. 


(9) 


Here,  we  note  that,  in  order  to  keep  the  same 
parity  with  respect  to  a  coordinate  plane,  it  is 
necessary  that  the  boundary  conditions  for  ^ 
1-0  and  for  ^2  1  +  0  as  well  as  for  ^2  “  0 

and  for  ^3  p^  4-  0  are  identical.  As  in  the  case  of 
ellipsoidal  wave  functions,  the  ends  of  the  inter¬ 
vals  are  singular  points  for  (6)-(8).  By  a  proper 


change  of  variables,  one  can,  however,  avoid  these 
singularities  [1,  3]. 

Thus,  we  obtain  eight  boundary  value  prob¬ 
lems,  each  specified  by  a  triplet  of  binary  indexes 
(z^,  fy,  f^)/  where  q  =  0  (y  =  x,  y,  z)  for  a  particu¬ 
lar  solution  ^P'Cr)  of  even  parity  with  respect  to  the 
plane  ;  =  0  [see  the  left  column  of  (9)],  while 
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i.  =  1  (j  =  X,  y,  z)  corresponds  to  ^(r)  of  odd 
parity  with  respect  to  ;  =  0  [the  right  column  of 

(9) ].  Each  particular  solution  can  be  extended 
smoothly  over  the  whole  space  according  to  its 
parity  properties  [i.e.,  in  accordance  with 

Given  the  set  of  boundary  conditions 
represented  by  the  triplet  and  consider¬ 
ing  Eqs.  (6)  and  (7)  on  the  intervals  and 

^2^4/  arrive  at  the  following  two- 

parameter  Sturm-Liouville  self-adjoint  problem: 
We  look  for  such  a  couple  of  values  for  [h,  1}  that 
nontrivial  solutions  of  (6)  and  (7)  exist  simultane¬ 
ously  on  both  intervals,  satisfying  the  above- 
mentioned  conditions  corresponding  to  the  triplet 

Let  a  and  p  be  continuous.  It  can  be  proved 
that  for  any  multi-index  {m,n)  there  exists  pre¬ 
cisely  one  eigenvalue  (h,  Z)"'  such  that  the  corre¬ 
sponding  solution  to  (6)  oscillates  m  times  within 
4,  while  the  solution  to  (7)  has  (n  —  m)  zeroes 
inside  4.  Besides,  this  two-parameter  problem  is 
left-definite  with  respect  to  the  vector  |ul  =  (0, 1); 
hence,  the  eigenfunctions 

form  a  complete  system  on  cr),  where  11  = 

4  X  4  and 

[10] .  In  passing,  we  note  that  the  functions 

^2)'  associated  with  distinct  points  of  the 
spectrum,  are  orthogonal,  with  respect  to  the 
weight  ^2)/  U.  Extended  according  to  the 
triplet  (z4,  fy,  4)  and  normalized  by 

4  / [a'^i ^2)]V( ^1,  ^2)  =  1/8,  (10) 

all  the  eigenfunctions  for  all  sets  thus 

constitute  an  orthonormal  basis  in  a),  where 

5  is  an  arbitrary  ellipsoid  defined  by  ^3  =  ^3*  e  4. 

The  functions  ^2)  specified  in  this  way  are 

determined  uniquely  up  to  a  sign.  Any  two  func¬ 
tions  with  m  ¥=  m'  and/or  n  n',  are 

obviously  orthogonal  (if  they  are  subject  to  the 
same  boundary  conditions,  they  are  orthogonal  on 
each  part  of  S  situated  within  a  coordinate  octant; 
otherwise,  their  inner  product  vanishes  on  S  be¬ 
cause  they  are  of  different  parity). 

The  completeness  of  the  function  system,  de¬ 
fined  above,  follows  from  the  fact  that  each  func¬ 
tion  from  may  be  represented  as  a  sum 

of  eight  terms  of  distinct  parity  with  respect  to 


coordinate  planes: 

u  =  Vq  xv-y  yv2  +  21^3  +  xyv^ 

+  xzv^  +  yzv^  -\~  xyzvj 

[here,  v^  =  v^{x^,y^,z^)].  One  can  therefore  ex¬ 
pand  each  term  in  the  above  equality  as  a  series  in 
the  functions  ^2^  Ihe  corresponding  par¬ 
ity  due  to  their  completeness  on  The 

expansion  holds  for  all  coordinate  octants  and  the 
sum  of  all  the  eight  series  yields  then  the  expan¬ 
sion  of  the  original  function  in  terms  of  ^2) 

(for  more  details,  see  also  [3]). 

In  [8],  a  natural  way  to  classify  these  surface 
functions  in  relation  to  the  conventional  classifica¬ 
tion  in  the  spherical  case  was  proposed.  The  so- 
called  pa>-diagrams  utilize  the  property  that  for 
small  0)  the  shapes  of  the  partial  angular  distribu¬ 
tion  are  almost  independent  of  the  geometry  of  the 
problem  and  thus  that  the  orbitals  are  close  to  the 
spherical  case  (see  also  [11]). 

On  4/  ^  nontrivial  solution  to  (8),  subject  to 
"the  boundary  condition,"  exists  for  any  h  and 
Z.  If  7(^3)  as  ^3  00^  with  q  <  \  (for  sim¬ 

plicity),  then  (cf.  the  case  of  radial  ellipsoidal  func¬ 
tions)  the  behavior  of  such  a  solution  at  infinity  is 
given  by 


A®(^3)  =  +  AU,h)) 


^3  , 


(11) 


where  D  is  of  an  arbitrary  constant  value  (see 
below)  and  the  phase  A(Z,  h)  expresses  the  influ¬ 
ence  of  the  potential  on  the  scattered  particle  mo¬ 
tion:  If,  e.g.,  a(^i)  =  /3(^2)  =  7(^3)  =  0/  then  the 
functions  and  A®  ,,(^3)  coin- 

cide  with  the  corresponding  ellipsoidal  wave  func¬ 
tions  with  the  phase  given  by 

AZ  =  AOZ.K)  =  -7r[i,  +  z*  +  Zj/2  (mod  77). 

(12) 


It  is  interesting  to  note  that  the  last  relation  has  not 
yet  been  proven  thus  far.  Since  we  have  only 
observed  it  in  practical  calculations  [3],  it  still 
remains  as  a  conjecture. 

As  for  the  asymptotical  relation  (11),  the  proof 
is  analogous  to  the  one  used  in  [2]  for  the  radial 
ellipsoidal  wave  functions.  Calculations  of  the 
functions  A®  „(^2X  and  A®  „(^3)  are 

carried  out  by  applying  previously  developed 
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techniques  to  calculate  ellipsoidal  wave  functions 
(see  [1,2]).  The  method  for  calculating  the  radial 
function  [2]  exploits  the  asymptotic  behavior  dis~ 
played  in  Eq.  (11).  Instead  of  computing  the  solu¬ 
tion  to  Eq.  (8),  we  integrated  an  auxiliary  equation 
for  a  modified  Priifer  angle,  a  modification  equat¬ 
ing  the  latter  to  A(/,  h)  at  infinity.  However,  to 
speed  up  the  calculations  and  to  diminish  the 
errors  due  to  ODE  numerical  integration,  we  use 
additional  terms  of  the  radial  function  asymp¬ 
totics,  allowing  us  to  reduce  the  integration  inter¬ 
val  (see  [12, 13]). 


Scattering  Data  Expansions 

Following  [14],  we  can  now  find  the  connection 
between  the  functions  H(  ^2)  the  eigenfunc¬ 
tions  of  the  scattering  operator  S  =  1  +  (ik/lir)/. 
We  will  further  use  the  asymptotic  representation 
of  the  particular  solution  in  the  form,  putting 

D  =  1/  [see  (11)], 

^;Tr)  -  H;;^(^i,^2)^{exp[f(fcr  +  A^')] 

“exp[  -i(kr  +  A'”)]},  (r  ^  00).  (13) 

Note  that  as  ^3  tends  to  infinity  the  ellipsoidal 
coordinates  as  mentioned  above  approach  the 
spherical  ones,  i.e., 

~  b^)  ~  (r  00),  (14) 

and  (under  the  tradition  of  acoustics,  6  is  meas¬ 
ured  in  relation  to  the  large  semiaxis  which  here 
happens  to  be  the  X-axis) 

^2  =  COS^  0, 

(  -  1)(  ^2  -  1)  =  -(  -  l)sin^  e  cos^  cp, 

(^1  -  p^)(^2  “  P^)  =  P^(P^  “  Dsin^Psin^  <p 

(15) 

(see  [6, 7]  for  more  details). 

The  behavior  of  an  arbitrary  solution  to  (1)  can 
now  be  expressed  with  the  aid  of  the  scattering 
operator  S  (defined  on  the  unit  sphere): 

1  , 

^(r,  fc)  — — {exp(zlr)S(w,  Mo)F(-no) 

2ikr  ^ 

-exp(-f/cr)F(no)},  (r  ^  00),  (16) 

where  F(no)  depends  on  the  angles  6^  of  the 
incidence  direction. 


Introducing  the  inversion  (with  respect  to  the 
origin)  operator  J, 

JF(n)  =  F(-n), 
we  will,  instead  of  (16),  obtain 
1  , 

'?^(r,  fc)  -  ^^{exp(zlr)S(n,  «o)I 

-exp(-z7:r)}F(no),  (r 00).  (17) 
Rewriting  (13)  as 
1 

— -{exp(fA:r)  exp(2/A"0  ~  exp(-/A:r)} 
2ikr 

x[exp(-zA'pH;;X^i,  ^2)]/ 

and  comparing  it  with  (17),  we  find  that 
Fitto)  =  exp(-zA';:)H,T(^«,  ^2“). 

where  ^2  connected  with  the  angles  (p^,  0^ 
by  (15)  and  with  ^2)  being  the  eigenfunc¬ 

tion  of  SI: 

s7H:r(^^  ^2")  =  exp(2zApH;r(^l^ 

In  view  of  Eq.  (14)  and  since  cr(^^,  ^2) 
lim^^^«  [h-^h2/ia^  -  b'^)^^],  we  have 

(  f  (p)]^  sin  6  do  dip 

•'0  *'0 

-  lim  dS 

00  y  j  ^  j 

=  li*"  72  /  ,2 

r  oc  r  •'  ^  J 

X  /z^/z2  d^-^  d^2 

=  8  f  dii 

•'n 

(18) 

(here,  is  a  sphere  of  radius  r).  Thus,  the  matrix 
elements  of  the  scattering  operator  in  terms  of  the 
functions  Hj”(^i,  ^2)  latter  constitute  an 

orthonormal  system) 

<m,  n\  S  |m',  n') 

=  exp(2zA”J)  (m,n\  l\m' ,n') 

=  exp(2zA'p  <m,  n  \  m' ,  n') 

^  ,  m '^n,n'  ,  / y ,  / ,)( 
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or,  written  differently, 

S\n,no)=  E  £  E 

('x-'j/'D  n  =  0  m  =  0  ■' 

Therefore,  the  far-field  or  scattering  amplitude  is  given  by 


n  =  0  m  =  0 


As  a  check,  we  note  that  for 

a(^i)  =  ^(^2)  =  r(^3)-0, 

taking  (12)  into  account,  we  get  /(«,  Hq)  =  0. 


Examples  and  Discussion 

In  Figure  2,  we  displayed  a  scattering  potential, 
separable  in  ellipsoidal  coordinates,  given  by  the 
formula  V{^^,  I2,  ^3)  =  [toA^s  '  ^2X^3  -  ^1)]. 
As  the  potential  goes  to  infinity,  the  corresponding 
equipotential  surfaces  approach  the  focal  ellips  (cf. 
the  description  of  the  ellipsoidal  coordinate  sys¬ 
tem).  Far  from  the  origin,  the  potential  vanishes  as 
and  the  corresponding  equipotential  surfaces 
approach  a  spherical  shape.  Generally,  the  shape 
of  the  equipotential  surfaces  does  not  depend  on 


the  value  y^.  We  have  here  only  plotted  a  quarter 
of  each  surface  for  three  typical  cases. 

The  corresponding  numerical  results  are  dis¬ 
played  in  Figure  3.  One  can  see  here  the  examples 
of  scattering  diagrams,  i.e.,  the  plots  k  |/(n,  Mq)|  for 
different  incident  directions:  parallel  to  axis  OX, 
parallel  to  axis  OY,  and  parallel  to  axis  OZ,  from 
left  to  right.  The  parameter  values  are  ^  2.0, 
0)  ==  10.0,  and  yo  “0.1,  the  accuracy  of  the 
auxiliary  ODE  calculations  is  £:  =  10“^,  and  the 
maximum  n  =  15.  We  have  not  displayed  the  scat¬ 
tering  amplitude  for  positive  yo,  since  it  will  not 
give  a  large  visual  difference.  In  this  particular 
case,  the  potential  will  only  perturb  the  phase  shift 
— a  perturbation  effect  which  is  approximately  lin¬ 
ear.  From  the  series  expansion  of  /(n,  Hq),  it  fol¬ 
lows  that  its  absolute  value  does  not  depend  on 
the  sign  of  the  potential.  This  means  that  within 
the  linear  domain  the  amplitudes  in  Figure  3  would 


0=0  cp=0  Q~k/2  9=0  6=71/2  9=7c/2 

FIGURE  3.  Examples  of  k \f(n,  n^)]  for  the  potential  displayed  in  Figure  2.  Various  incident  directions  are  shown  from 
left  to  right. 
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naturally  scale  with  the  potential  so  that  their 
shapes  are  preserved. 
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ABSTRACT:  Electronic  structure  calculations,  carried  out  at  the  HF/6-311G*, 
MP2/6-311G*,  and  BLYP/6-311G*  levels  of  theory,  reveal  an  unexpectedly  large 
influence  of  electron  correlation  on  bonding  in  the  octasulfur  dication  Sg"^.  Whereas  a 
structure  with  a  strong  transannular  bond  linking  two  sulfur  atoms  with  localized 
positive  charges  is  predicted  within  the  Hartree-Fock  (HF)  approximation,  second-order 
Moller-Plesset  (MP2)  and  Becke-Lee- Yang-Parr  density  functional  (BLYP)  calculations 
produce  delocalized  dications  in  which  the  transannular  interactions  are  completely 
missing.  Rigorous  analysis  of  the  computed  electronic  wave  functions  explains  this 
phenomenon,  which  is  also  observed  in  the  HF/TZVP  and  MP2/TZVP  optimized 
geometries  of  Seg"^.  In  agreement  with  the  experimental  data,  the  endo-exo  conformers 
are  found  to  possess  the  lowest  energies  for  the  octachalcogen  dications,  in  contrast  to  the 
corresponding  neutral  species  in  which  steric  interactions  determine  the  relative  energies, 
favoring  the  exo-exo  ring  conformations.  None  of  the  three  theoretical  approaches  is 
capable  of  reproducing  the  experimental  solid-state  geometries  of  Sg"^  and  Se|^,  which 
are  almost  certainly  distorted  to  a  large  degree  by  the  crystal  packing  and  counterion 
effects.  The  results  of  the  present  study  underscore  the  need  for  a  concerted  experimental 
and  theoretical  effort  to  provide  the  definitive  answer  to  the  question  of  transannular 
interactions  in  chalcogen  compounds.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65: 
609-616,  1997 


Introduction 

With  its  propensity  to  catenation  (formation 
of  homocyclic  chains),  sulfur  occupies  a 
unique  place  among  the  known  elements.  Com- 
Correspondence  to:  J.  Cioslowski. 

Contract  grant  sponsor:  National  Science  Foundation. 
Contract  grant  number:  CHE-9224806. 


bined  with  flexible  packing  in  solid  state,  the  large 
number  of  homocyclic  compounds  of  sulfur  gives 
rise  to  a  multitude  of  allotropic  modifications.  So 
far,  no  fewer  than  17  allotropes  of  sulfur,  namely 
Sg,  Sy  (4  modifications),  Sg  (3  modifications),  S9  (2 
modifications),  S^q,  S^,  S^g  (2  conformers), 

and  S20,  have  been  isolated  in  pure,  crystalline 
form  [1].  In  fact,  sulfur  remains  the  element  with 
the  largest  number  of  known  allotropes,  although 
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this  may  change  in  the  near  future  with  the 
prospective  discoveries  of  new  fullerenes  [2], 

The  most  common  form  of  sulfur  is  the  octasul- 
fur  Sg  [1],  which  exists  in  three  crystalline  modifi¬ 
cations,  each  composed  of  eight-membered  rings 
possessing  the  exo-exo  conformation  [Fig.  1(a)] 
[3-5].  The  monoclinic  a-selenium  and  y-selenium 
have  analogous  crystal  structures  [6,  7].  In  con¬ 
trast,  salts  containing  the  Sg^  and  Scg"^  dications 
form  crystals  in  which  the  chalcogen  homocycles 
assume  the  less  symmetrical  endo-exo  conforma¬ 
tion  [Fig.  1(b)]  [8-11].  These  salts  are  prepared  by 
the  oxidation  of  Sg  with  a  variety  of  reagents  such 


c 


FIGURE  1.  Three  conformations  of  the  eight-membered 
homocycles  of  sulfur  and  selenium:  (a)  exo-exo,  (b) 
endo-exo,  (c)  endo-endo. 


as  AsFg  or  SbFg  [12]: 

Sg  +  SXFg 

^  +  XFg,  X  =  As  or  Sb.  (1) 

The  salts  of  the  Seg^  dication  can  be  obtained  in  a 
similar  way  [13]. 

The  short  intraring  S3-S7  distance  [Fig  1(b)] 
constitutes  a  prominent  structural  feature  of  the 
experimentally  determined  solid-state  geometries 
of  the  Sg'^  dication.  The  proximity  of  these  two 
atoms,  which  are  found  to  be  spaced  by  only  ca. 
2.9  A  [8,  9],  has  given  rise  to  the  speculation  of 
transannular  interactions  operating  across  the  sul¬ 
fur  octagons.  Early  theoretical  studies  have  seemed 
to  confirm  the  presence  of  such  interactions.  A 
short  (R  =  2.22  A)  transannular  bond  has  been 
found  in  HF/STO-3G  calculations  on  Sg'^  [14]. 
Although  the  properties  of  this  bond  have  been 
analyzed,  the  discrepancy  between  the  experimen¬ 
tal  and  theoretical  bond  lengths  has  been  neither 
explained  nor  even  discussed.  An  even  shorter 
bond  (R  =  2.060  A)  has  been  predicted  by  the 
MNDO  method  [15].  INDO  calculations,  carried 
out  at  the  experimental  geometries  of  the  dications 
in  question,  have  produced  localized  molecular 
orbitals  spanning  transannular  bonds  in  Sg”^  and 
Seg"^  [16].  For  the  latter  species,  no  ab  initio  elec¬ 
tronic  structure  studies  have  ever  been  reported. 

The  aforedescribed  results  clearly  demonstrate 
the  tendency  of  the  Hartree-Fock  (HF)  method  to 
grossly  exaggerate  the  binding  between  the 
transannular  atoms  in  the  octachalcogen  dications. 
It  is  therefore  obvious  that  approximate  treatments 
of  electron  correlation  have  to  be  invoked  to  pro¬ 
vide  a  balanced  description  of  bonding  in  these 
systems.  Results  of  such  a  study  are  reported  in 
this  work.  In  particular,  relative  stabilities  and 
geometries  of  the  three  conformers  of  the  Sg'^  and 
Seg"^  dications  are  predicted.  The  failures  of  the 
earlier  calculations  and  the  difficulties  encoun¬ 
tered  in  the  present  study  are  traced  down  to  the 
unusual  electronic  structures  of  these  species.  The 
implications  of  the  new  predictions  to  the  future 
experimental  and  theoretical  work  are  discussed. 


Details  of  Calculations 

All  the  calculations  described  in  this  study  were 
carried  out  with  the  GAUSSIAN  G92 /DFT  system 
of  programs  [17]  running  on  an  SP2  cluster  located 
at  FSU /SCRI.  For  the  sulfur  homocycles,  the  basis 
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set  of  McLean  and  Chandler  [18]  augmented  with 
polarization  functions  (as  implemented  under  the 
name  6-311G*  in  GAUSSIAN  92/DFT)  was  used. 
Optimized  geometries  and  vibrational  frequencies 
were  computed  at  the  HF/6-311G*,  MP2/6-311G* 
(frozen  core),  and  BLYP/6-311G*  levels  of  theory. 
Less  extensive  calculations  were  performed  for  the 
three  conformers  of  Seg  and  Seg^,  geometries  of 
which  were  optimized  at  the  HF/TZVP  and 
MP2/TZVP  (frozen  core)  levels  of  theory  [19].  The 
properties  of  atoms  in  the  systems  under  study 
were  determined  with  the  algorithm  described 
elsewhere  [20].  The  covalent  bond  orders  [21]  and 
the  AOM-localized  natural  orbitals  [22]  were  com¬ 
puted  from  the  second-order  Moller-Plesset  (MP2) 
one-electron  density  matrices. 


Results 

Analysis  of  the  data  compiled  in  Table  I  leads  to 
the  conclusion  that  the  stabilities  of  the  individual 
conformers  of  Sg  and  Seg  are  determined  mostly 
by  steric  effects.  This  conclusion  is  borne  out  by 
two  observations.  First,  the  ordering  of  the  relative 
energies  follows  that  of  the  increasing  crowding 
between  the  nonbonded  chalcogen  atoms,  the 
exo-exo  conformers  being  found  the  most  stable 
and  the  endo~endo  ones  the  least.  Second,  the  com¬ 
puted  energies  barely  change  upon  the  inclusion  of 
electron  correlation  effects,  implying  the  predomi¬ 
nance  of  interactions  that  are  not  specific  to  the 
electronic  properties  of  any  particular  conformer. 
In  contrast,  the  relative  stabilities  of  the  Sg^  and 
Seg"^  dications  do  not  appear  to  be  governed  by 
steric  crowding  and  are  strongly  affected  by  elec¬ 
tron  correlation,  the  endo-exo  octacycles  having 
the  lowest  energies  at  all  levels  of  theory  but  the 
HF  energy  differences  being  too  small  to  account 
for  the  experimental  failure  to  observe  the  other 
two  conformers. 

The  presence  of  unusual  bonding  in  the  dicat¬ 
ions  under  study  is  also  hinted  at  by  the  relative 
magnitudes  of  the  second  adiabatic  ionization  po¬ 
tentials  of  the  parent  species  (Table  II).  The  com¬ 
puted  MP2  and  BLYP  ionization  potentials  are 
significantly  lower  than  their  HF  counterparts,  de¬ 
spite  the  fact  that  the  electron  correlation  effects 
tend  to  increase  in  importance  with  the  number  of 
electrons,  in  general  stabilizing  the  neutral  systems 
more  than  the  doubly  charged  ones.  As  expected, 
Seg  is  predicted  to  undergo  oxidation  somewhat 
more  readily  than  Sg. 


TRANSANNULAR  INTERACTIONS  IN  8^+  AND 


TABLE  I _ 

Relative  energies  (kcal/mol)  of  the  eight-membered 
homocycles  of  sulfur  and  selenium,® 


Conformation 

Method 

Species 

(symmetry) 

HF 

MP2 

BLYP 

Sa 

exo-exo 

0.0 

0.0 

0.0 

endo-exo  (0^) 

8.8 

8.4 

9.0 

endo-endo  (Dgcy) 

20.5 

19.1 

19.1 

sr 

exo-exo  (C2v) 

6.4 

11.8 

9.1 

endo-exo  (Cg) 

0.0 

0.0 

0.0 

endo-endo  (€2^) 

1.3 

9.3 

5.8 

Seg 

exo-exo  {D^^) 

0.0 

0.0 

n/a 

endo-exo  (C^) 

5.3 

5.4 

n/a 

endo-endo  {D2d) 

12.8 

11.9 

n/a 

Sep 

exo-exo  (Cg^) 

2.9 

9.7 

n/a 

endo-exo  (C^) 

0.0 

0.0 

n/a 

endo  -endo  (Cg^) 

1.5 

3.5 

n/a 

®Zero-point  energies  not  included. 


The  optimized  geometries  of  the  exo-exo  con¬ 
formers  of  the  neutral  species  are  in  good  agree¬ 
ment  with  the  experimental  solid-state  data, 
though  the  bond  lengths  are  consistently  overesti¬ 
mated  (Table  III).  This  overestimation  is  especially 
pronounced  at  the  BLYP/6-311G*  level  of  theory 
(which  also  appears  to  slightly  overestimate  the 
bond  angles)  and  is  worse  for  Seg  than  for  Sg.  With 
better  data  lacking  at  present,  these  slight  discrep¬ 
ancies  can  be  attributed  with  equal  probability  to 
the  basis-set  incompleteness,  crystal  packing  ef¬ 
fects,  and/or  experimental  errors. 

The  removal  of  two  electrons  from  the  exo-exo 
conformers  of  Sg  and  Seg  results  in  symmetry 
lowering  from  to  €2^  The  distances  between 
the  formally  nonbonded  atoms  3  and  7  [Fig.  1(a)] 
become  shorter,  whereas  those  involving  the  1-5 
pairs  increase  (Table  III).  The  ensuing  unequiva- 

TABLE  II _ 

Second  adiabatic  potentials  (eV)  of  the  Sg  and 
Seg  molecules. 


Method 


Species 

Conformation 

HF 

MP2 

BLYP 

Ss 

exo-exo 

22.35 

21.05 

21.44 

endo  -exo 

21.69 

20.18 

20.65 

endo  -endo 

21.24 

20.12 

20.47 

SSg 

exo  -exo 

20.80 

19.39 

n/a 

endo  -exo 

20.44 

18.73 

n/a 

endo  -endo 

20.19 

18.60 

n/a 
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TABLE  III _ 

Geometries  of  the  exo  -exo  conformers  of  the  eight-membered  homocycles  of  sulfur  and  selenium. 


Interatomic  distances®  (A)  Bond  angles®  (°) 


Species 

Method 

1  -2 

2-3 

3-7 

1  -5 

1  -2-3 

2-3  -4 

CO 

i 

LO 

i 

Sb 

HF 

2.069 

2.069 

4.719 

4.719 

107.4 

107.4 

107.4 

MP2 

2.075 

2.075 

4.763 

4.763 

108.5 

108.5 

108.5 

BLYP 

2.146 

2.146 

4.967 

4.967 

109.8 

109.8 

109.8 

exp.'=‘ 

2.051 

2.051 

n/a 

n/a 

108.2 

108.2 

108.2 

exp.^ 

2.050 

2.050 

n/a 

n/a 

107.9 

107.9 

107.9 

exp.^ 

2.045 

2.045 

n/a 

n/a 

107.7 

107.7 

107.7 

sr 

HF 

2.064 

2.146 

2.225 

5.685 

94.8 

103.1 

96.9 

MP2 

2.093 

2.081 

3.328 

5.423 

99.8 

105.6 

92.7 

BLYP 

2.154 

2.160 

3.134 

5.670 

98.1 

106.7 

95.3 

863 

HF 

2.391 

2.391 

5.406 

5.406 

106.1 

106.1 

106.1 

MP2 

2.413 

2.413 

5.498 

5.498 

107.3 

107.3 

107.3 

exp.® 

2.336 

2.336 

n/a 

n/a 

105.7 

105.7 

105.7 

exp.^ 

2.334 

2.334 

n/a 

n/a 

105.8 

105.8 

105.8 

Sei+ 

HF 

2.394 

2.497 

2.492 

6.634 

95.2 

101.4 

96.5 

MP2 

2.443 

2.447 

4.000 

6.292 

99.7 

103.9 

91.3 

®See  Fig.  1(a)  for  atom  numbering. 

^Symmetry-averaged  X-ray  structure  of  orthorhombic  a-sulfur  at  100  K  [3]. 
^Symmetry-averaged  X-ray  structure  of  monoclinic  /3-sulfur  at  113  K  [4]. 
^Symmetry-averaged  X-ray  structure  of  monoclinic  y-sulfur  [5]. 

® Symmetry-averaged  X-ray  structure  of  monoclinic  a-selenium  [6]. 
^Symmetry-averaged  X-ray  structure  of  monoclinic  y-selenium  [7]. 


lence  of  chalcogen  atoms  produces  three  distinct 
bond  angles  and  a  pattern  of  short  and  long  bonds 
that  interleave  in  pairs.  The  computed  geometries 
are  extremely  sensitive  to  the  electron  correlation 
effects,  which  equalize  interatomic  distances  (in¬ 
cluding  bond  lengths)  but  appear  to  have  some¬ 
what  less  influence  on  the  bond  angles.  Particu¬ 
larly  drastic  changes  are  observed  for  the  3-7 
distances.  Without  the  inclusion  of  electron  corre¬ 
lation,  these  distances  are  predicted  short  enough 
to  suggest  the  presence  of  strong  transannular 
bonds  possessing  lengths  comparable  to  those  of 
the  ordinary  S-S  and  Se-Se  links.  On  the  other 
hand,  both  the  MP2  and  BLYP  data  seem  to  rule 
out  such  a  possibility. 

The  shortening  of  distances  between  the  non- 
bonded  atoms  (compare  the  3-7  and  2-5  columns 
in  Tables  III  and  IV)  observed  upon  going  from  the 
exo-exo  to  the  endo~exo  conformations  of  Sg  and 
Seg  produces  an  increase  in  steric  repulsions,  which 
in  turn  results  in  a  slight  lengthening  of  bonds  on 
average.  Due  to  the  lack  of  experimental  data, 
neither  this  lengthening  nor  the  concomitant  bond 
alternation  can  be  verified  at  present.  On  the  other 
hand,  experimentally  determined  geometries  are 
available  for  the  corresponding  dications  [8-11]. 


Examination  of  these  geometries  reveals  the  very 
poor  performance  of  the  Hartree-Fock  method  in 
predicting  the  strengths  of  the  transannular  inter¬ 
actions  between  the  atoms  in  positions  3  and  7  in 
the  endo-~exo  conformers  of  Sg'^  and  Seg"^.  Unfortu¬ 
nately,  the  approximate  inclusion  of  electron  corre¬ 
lation  effects  at  the  MP2  and  BLYP  levels  of  theory 
does  not  appear  to  bring  the  predicted  bond  lengths 
and  angles  into  a  closer  agreement  with  their  ex¬ 
perimental  counterparts.  Although  the  computed 
MP2/6-311G*  and  BLYP/6-311G*  S3-S7  dis¬ 
tances  in  the  endo-exo  conformer  of  Sg"^  turn  out 
to  be  very  similar,  they  nevertheless  deviate  from 
the  corresponding  experimental  value  by  roughly 
the  same  amount  as  the  HF/6-311G*  prediction 
does,  the  former  deviation  being  positive  and  the 
latter  negative. 

The  optimized  geometries  of  the  experimentally 
unknown  endo~endo  conformers  [Fig.  1(c)]  are 
listed  in  Table  V.  As  mentioned  above,  the  non- 
bonded  pairs  of  atoms  are  positioned  much  closer 
in  those  conformers  than  in  the  exo-exo  ones.  In 
fact,  the  close  proximity  of  the  flag-pole  atoms 
turns  the  D2^  endo-endo  conformer  of  Sg  into  a 
transition  state  at  the  HF/6-311G*  and  MP2/6- 
31 IG*  levels  of  theory,  although  the  correspond- 
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TABLE  IV _ 

Geometries  of  the  endo  -exo  conformers  of  the  eight-membered  homocycles  of  sulfur  and  selenium. 


Interatomic  distances^  (A)  Bond  angles^  (°) 


and  nnethod 

1  -2 

2-3 

1 

CO 

4-5 

3-7 

2-5 

1  -2-3 

2-3  -4 

3-4-5 

4-5  -6 

00 

1 

1 

ro 

^8 

HF 

2.073 

2.075 

2.103 

2.056 

3.991 

3.609 

103.7 

105.8 

107.1 

111.4 

110.7 

MP2 

2.081 

2.072 

2.125 

2.056 

4.147 

3.583 

104.2 

105.5 

106.7 

113.2 

112.1 

BLYP 

2.166 

2.105 

2.240 

2.115 

4.623 

3.775 

106.5 

107.8 

108.1 

116.7 

113.0 

S§" 

HF 

2.060 

2.140 

2.106 

2.069 

2.235 

3.816 

95.1 

107.5 

98.3 

95.8 

96.3 

MP2 

2.086 

2.059 

2.097 

2.073 

3.660 

3.510 

103.1 

106.3 

106.2 

97.9 

95.9 

BLYP 

2.146 

2.136 

2.146 

2.141 

3.669 

3.709 

102.6 

107.7 

106.8 

98.5 

96.7 

exp.^ 

2.055 

2.033 

2,022 

2.043 

2.860 

n/a 

99.3 

104.2 

100.7 

90.6 

95.1 

0 

d. 

X 

<D 

2.037 

2.005 

2.046 

2.038 

2.906 

n/a 

98.3 

104.0 

101.6 

94.0 

93.1 

Se0 

HF 

2.394 

2.397 

2.418 

2.378 

4.515 

4.062 

102.5 

104.2 

105.6 

109.8 

109.5 

MP2 

2.416 

2.415 

2.455 

2.396 

4.767 

4.019 

103.1 

103.3 

104.8 

112.0 

111.2 

Ser 

HF 

2.393 

2.486 

2.475 

2.395 

2.490 

4.384 

95.5 

105.1 

97.4 

96.1 

96.0 

MP2 

2.433 

2.428 

2.477 

2.425 

4.763 

3.938 

104.4 

103.6 

105.8 

102.3 

96.0 

exp,^^ 

2.32 

2.31 

2.35 

2.30 

2.84 

n/a 

95.2 

101,3 

98.2 

93.3 

89.8 

exp.® 

2.300 

2.295 

2.319 

2.312 

2.905 

n/a 

94.6 

102.4 

99.1 

92.7 

90.0 

^See  Fig.  1(b)  for  atom  numbering. 

^Symmetry-averaged  X-ray  structure  of  (Sg"^)  (AsF6')2  [8]. 
‘^Symmetry-averaged  X-ray  structure  of  (Sg'^)  (SbgFi^)  (SbFe" 
‘^Symmetry-averaged  X-ray  structure  of  (Seg'^)  {AICI4  )2  [10]. 

® Symmetry-averaged  X-ray  structure  of  (Teg"^)  (Seg’^)  (AsFg"), 

ing  imaginary  frequencies  of  16/  and  33/  cm”\ 
respectively,  are  too  small  to  rule  out  the  possibil¬ 
ity  of  this  system  being  actually  a  true  minimum 
on  the  corresponding  potential  energy  hypersur¬ 
face.  An  analogous  phenomenon  is  not  observed 
for  the  endo- endo  conformer  of  Seg.  At  the  MP2/6- 
311G*  level  of  theory,  the  C2y  endo-endo  con- 
former  of  Sg’^  is  also  a  transition  state  with  a  small 
imaginary  frequency  of  41/  cm“^  Again,  large 


)  [9]. 

(SO2)  [111. 

differences  between  the  HF  and  MP2  geometry 
predictions  are  found  for  Sg^  and  Seg^. 

In  summary,  all  the  three  theoretical  methods 
employed  in  this  study  are  successful  in  predicting 
both  the  energetics  and  geometries  of  the  neutral 
octachalcogen  species.  For  the  corresponding  dicat¬ 
ions,  the  inclusion  of  electron  correlation  within 
the  MP2  and  BLYP  approximations  suffices  to  pro¬ 
duce  correct  relative  stabilities  but  fails  to  yield 


TABLE  V _ 

Geometries  of  the  endo  -endo  conformers  of  the  eight-membered  homocycles  of  sulfur  and  selenium. 


Species 

Method 

Interatomic  distances®  (A) 

Bond  angles®  (° 

) 

1  -2 

2-3 

3-7 

1  -5 

1  -2-3 

1 

CO 

1 

(N 

4-5-6 

^8 

HF 

2.083 

2.083 

3.464 

3.464 

107.7 

112.8 

112.8 

MP2 

2.091 

2.091 

3.459 

3.459 

106.7 

113.3 

113.3 

BLYP 

2.162 

2.162 

3.686 

3.686 

107.2 

115.6 

115.6 

Si" 

HF 

2.065 

2.118 

2.201 

3.665 

100.3 

117.0 

96.1 

MP2 

2.086 

2.065 

3.475 

3.466 

110.1 

120.7 

99.6 

BLYP 

2.143 

2.138 

3.223 

3.797 

108.8 

123.0 

97.7 

SBs 

HF 

2.402 

2.402 

3.869 

3.869 

106.4 

114.4 

114.4 

MP2 

2.428 

2.428 

3.866 

3.866 

104.9 

112.0 

112.0 

Sei+ 

HF 

2.394 

2.477 

2.470 

4.142 

99.2 

114.3 

96.4 

MP2 

2.411 

2.437 

6.418 

4.804 

116.5 

120.6 

115.4 

^See  Fig.  1(c)  for  atom  numbering. 
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accurate  geometries.  This  failure  is  particularly 
conspicuous  for  the  distances  between  the  atoms 
supposedly  engaged  in  transannular  interactions, 
for  which  the  discrepancies  between  the  "best" 
theoretical  predictions  and  the  experimental  val¬ 
ues  are  as  large  as  0.7  A  (Table  IV)!  These  trouble- 
somely  large  errors  stem  from  a  quite  unusual 
coincidence  of  several  unfavorable  factors.  First  of 
all,  the  available  experimental  geometries  [8-11] 
are  of  inferior  quality  by  the  contemporary  stan¬ 
dards.  Even  more  importantly,  the  potential  en¬ 
ergy  hypersurfaces  of  the  dications  in  question  are 
very  flat  in  the  directions  corresponding  to  a  lat¬ 
eral  squashing  of  the  rings.  For  example,  in  the 
endo-exo  conformer  of  Sg^,  the  frequency  of  the  A' 
vibration  that  involves  predominantly  the  83-57 
stretch  is  predicted  to  be  equal  363  and  75  cm^^  at 
the  HF/6-311G*  and  MP2/6-311G*  levels  of  the¬ 
ory,  respectively.  Consequently,  the  across-the-ring 
distances  are  expected  to  be  quite  sensitive  to 
small  perturbations.  With  respect  to  the  experi¬ 
mental  data,  this  sensitivity  implies  a  substantial 
influence  of  the  crystal  packing  and  counterion 
effects  on  the  measured  solid-state  geometries.  At 
the  same  time,  relatively  small  errors  in  the  com¬ 
puted  energies  and  energy  gradients  are  bound  to 
translate  into  large  deviations  in  the  optimized 
intraring  distances. 

Rigorous  analysis  of  the  computed  electronic 
wave  functions  elucidates  the  reasons  behind  the 
observed  dramatic  effects  of  electron  correlation 
on  bonding  in  the  octachalcogen  dications.  The 
inspection  of  the  AOM-localized  natural  orbitals 
(LNOs)  in  the  endo-endo  conformer  of  Seg  reveals 
the  presence  of  eight  equivalent  LNOs  describing 
the  Se-Se  bonds  (LNOs  129-136  in  Table  VI)  in 
addition  to  two  unequivalent  lone-pair  LNOs  per 
each  atom  (LNOs  113-128).  All  of  these  LNOs  are 
almost  doubly  occupied.  The  covalent  bond  orders 
equal  1.186  for  the  bonded  pairs  of  atoms  and 
0.083  for  the  Se3-Se7  and  Se^-Seg  nonbonded  pairs. 


At  the  MP2/TZVP  level  of  theory,  the  removal  of 
two  electrons  from  this  system  produces  a  delocal¬ 
ized  dication,  as  indicated  by  the  computed  atomic 
charges  of  0.1842  (Se^  and  Seg),  0.2486  (Se2,  Se4, 
Seg,  and  Seg),  and  0.3186  (Seg  and  Se7).  The  bond 
alternation  observed  in  the  optimized  geometries 
(Table  V)  is  reflected  in  the  covalent  bond  orders 
of  1.229  (Sei-Se2/  etc.)  and  1.164  (Se2-Se3,  etc.).  At 
the  same  time,  the  covalent  bond  orders  of  0.031 
and  0.124,  obtained  for  the  Se^-Seg  and  Se3-Se7 
pairs  of  atoms,  respectively,  rule  out  the  presence 
of  any  significant  transannular  interactions. 

In  principle,  two  qualitatively  distinct  types  of 
bonding  in  the  dications  under  study  are  possible. 
On  one  hand,  the  removal  of  two  electrons  from 
two  lone-pair  LNOs  located  across  the  ring  may 
yield  an  orbital  bonding  of  two  chalcogen  atoms 
with  localized  positive  charges  (as  in  structure  1). 
Such  bonding  can  easily  overcome  the  strong  elec¬ 
trostatic  repulsion  between  the  charges  (the  He^”^ 
dication  with  Rne-He  ^  0.741  A  being  a  good  ex¬ 
ample)  [23].  On  the  other  hand,  the  two  electrons 
may  come  from  several  lone-pair  LNOs,  producing 
a  dication  with  a  delocalized  positive  charge  (as  in 
structure  2). 


S  +  S 
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In  light  of  the  above  discussion,  the  exaggera¬ 
tion  of  transannular  interactions  observed  within 
the  Hartree-Fock  approximation  is  attributable  to 
an  excessive  localization  of  the  positive  charge. 
This  spurious  localization  is  readily  explainable  by 
the  fact  that  the  occupancies  of  the  Hartree-Fock 
localized  orbitals  are  restricted  to  either  zero  or 
two.  In  other  words,  the  removal  of  two  electrons 
from  the  lone-pair  orbitals  must  produce  either  a 


TABLE  VI _ 

MP2/TZVP  valence  localized  orbitals  in  the  endo-endo  conformer  of  Seg. 


Orbitals 

Occup. 

Log.  Num. 

-7(au.) 

Description^ 

113-116 

1.982 

1.027 

-3.713 

Lone  pairs  of  atoms  2,  4,  6,  and  8  (98.7%  loc.) 

117-120 

1 .983 

1.030 

-3.682 

Lone  pairs  of  atoms  1 , 3,  5,  and  7  (98.5%  loc.) 

121  -124 

1.981 

1.171 

-2.175 

Lone  pairs  of  atoms  1 , 3,  5,  and  7  (92.3%  loc.) 

125-128 

1.981 

1.163 

-2.151 

Lone  pairs  of  atoms  2,  4,  6,  and  8  (92.6%  loc.) 

129-136 

1.957 

2.072 

-2.112 

Covalent  Se  -Se  bonds  (98.2%  loc.) 

®See  Fig.  1  (c)  for  atom  numbering. 
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new  bond  or  multicentric  bonding  orbitals,  the 
latter  possibility  (which  would  be  compatible  with 
the  structure  2)  being  highly  unlikely  because  of 
the  angular  misalignment  of  the  lone  pairs.  On  the 
other  hand,  fractional  occupancies  of  LNOs  are 
possible  once  electron  correlation  is  turned  on, 
favoring  the  delocalized  picture  of  bonding.  Such 
fractional  occupancies  are  indeed  observed  in  the 
MP2/TZVP  LNOs  of  the  endo-endo  conformer  of 
Se^+  (Table  VII). 

Almost  certainly,  the  actual  bonding  in  the 
and  Seg"^  dications  is  closer  to  that  represented  by 
structure  2  than  to  that  portrayed  by  structure  1. 
Consequently,  because  of  the  relatively  low  occu¬ 
pancies  of  the  LNOs,  the  actual  ground-state  wave 
functions  of  these  species  are  expected  to  possess 
considerable  multiconfigurational  character,  neces¬ 
sitating  the  use  of  high-level  treatments  such  as 
CAS  SCF  or  CCSD(T)  for  quantitative  predictions 
of  electronic  properties  and  geometries.  Because  of 
the  computer  hardware  limitations,  such  calcula¬ 
tions  are  not  feasible  at  present.  One  should  also 
note  in  passing  that  the  electron  correlation  effects 
described  in  this  work  are  quite  unique,  differing 
in  their  nature  from  the  dispersive  effects  encoun¬ 
tered  in  the  description  of  the  attractive  closed-shell 
interactions  between  chalcogen  atoms  [24]. 


Conclusions 

Transannular  interactions  in  the  octachalcogen 
dications  pose  a  severe  challenge  to  the  modern 
techniques  of  computational  quantum  chemistry. 
The  combination  of  flat  potential  energy  hypersur¬ 
faces  and  unexpectedly  large  electron  correlation 
effects  renders  the  approximate  approaches  such 


as  MP2  and  BLYP  inadequate  for  the  prediction  of 
the  equilibrium  geometries  of  these  species.  On  the 
other  hand,  the  relatively  large  size  and  low  sym¬ 
metry  of  the  dications  in  question  preclude  the  use 
of  more  sophisticated  methods  such  as  CAS  SCF  or 
CCSIXT)  at  present. 

It  is  clear  from  the  results  reported  in  this  study 
that  the  definitive  answer  to  the  question  whether 
the  transannular  interactions  in  Sg*^  and  Seg"^  are 
indeed  reality  or  just  an  artifact  will  ultimately 
require  a  coalescence  of  experimental  and  theoreti¬ 
cal  research.  On  the  experimental  front,  precise 
gas-phase  geometry  determinations  will  be  neces¬ 
sary  in  order  to  eliminate  the  ring  distortions 
caused  by  the  crystal  packing  effects  and  counteri¬ 
ons.  At  the  same  time,  large-scale  ab  initio  elec¬ 
tronic  structure  calculations  involving  compar¬ 
isons  among  several  high-level  electron  correlation 
treatments  will  have  to  be  carried  out. 

The  present  work  uncovers  the  reasons  behind 
the  failures  of  the  previously  published  calcula¬ 
tions  to  correctly  predict  the  transannular  chalco- 
gen-chalcogen  distances  in  the  Sg"^  and  Seg"^  dica¬ 
tions.  The  descriptions  of  bonding  in  these  systems 
are  completely  different  at  the  HF  and  correlated 
levels  of  theory,  the  latter  appearing  to  be  closer  to 
reality.  If  present  at  all,  the  transannular  interac¬ 
tion  are  weak  and,  stemming  from  a  delicate  bal¬ 
ance  between  charge  delocalization  and  localized 
bonding,  certainly  distinct  from  conventional 
bonds. 
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TABLE  VII _ 

MP2/TZVP  valence  localized  orbitals  in  the  endo-endo  conformer  of  Se|L 


Orbitals 

Occup. 

Log.  Num. 

-  T  (a.u.) 

Description® 

113-114 

1.979 

1.036 

-3.714 

Lone  pairs  of  atoms  3  and  7  (98.3%  loc.) 

115-118 

1.979 

1.038 

-3.681 

Lone  pairs  of  atoms  2,  4,  6,  and  8  (98.2%  loc.) 

119-120 

1.975 

1.034 

-3.671 

Lone  pairs  of  atoms  1  and  5  (98.4%  loc.) 

121  -122 

1.988 

1.278 

-2.181 

Lone  pairs  of  atoms  1  and  5  (88.3%  loc.) 

123-126 

1.929 

2.081 

-2.158 

2-3  (etc.)  bonds  (10.0%  ion.,  97.5%  loc.) 

127-130 

1.926 

2.094 

-2.142 

1  -2  (etc.)  bonds  (6.1%  ion.,  97.5%  loc.) 

131  -134 

1.818 

1.292 

-2.178 

Lone  pairs  of  atoms  2,  4,  6,  and  8  (87.8%  loc.) 

135-136 

1.389 

1.313 

-2.243 

Lone  pairs  of  atoms  3  and  7  (87.0%  loc.) 

®See  Fig.  1(c)  for  atom  numbering. 
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ABSTRACT:  Angular-resolved  kinetic  energy  distributions  of  fragments  resulting 
from  dissociation  induced  by  intense,  short,  linearly  polarized  laser  pulses  are  calculated 
using  an  accurate  three-dimensional  Fourier  transform  method  in  spherical  coordinates. 
The  rotational  excitation  of  the  molecule  leads,  in  general,  to  an  alignment  of  the 
photofragments  with  respect  to  the  field  polarization  vector.  But,  unexpectedly,  increasing 
the  field  strength  may  also  produce  fewer  aligned  fragments  at  the  higher  kinetic 
energies  of  the  multiphoton  above  the  threshold  dissociation  spectrum.  H  J 
photodissociating  by  interaction  with  an  Nd:YAG  laser  at  A  =  532  nm  and  for  intensities 
of  10-50  TW /crrP  is  taken  as  an  illustrative  example,  for  which  some  angular-resolved 
experimental  spectra  are  available.  A  comprehensive  interpretation  is  provided  within 
the  field-dressed  Floquet  picture  by  referring  to  two  strong-field  mechanisms,  namely, 
the  potential  barrier  lowering  (also  responsible  for  bond  softening)  and  the  nonadiabatic 
transitions  (also  responsible  for  the  vibrational  trapping).  ©  1997  John  Wiley  &  Sons,  Inc. 
Int  J  Quant  Chem  65:  617-624,  1997 


Introduction 

Progress  in  laser  technology  producing  very 
intense  and  short  pulses  has  motivated  a 
completely  new  formulation  for  the  description  of 
multiphoton  dissociation  dynamics  of  molecules 
[1].  Very  complicated  molecular  behavior  is  ob¬ 
served  as  a  response  to  the  electromagnetic  field. 

Contract  grant  sponsor:  Institut  du  Developpement  et  des 
Ressources  en  Informatique  Scientifique  I.D.R.l.S. 

Contract  grant  number:  940425. 

International  Journal  of  Quantum  Chemistry,  Vol.  65,  617-624  (1997) 
©  1997  John  Wiley  &  Sons,  Inc. 


Repeated  stimulated  photon  absorption  and  emis¬ 
sion  mechanisms  lead  to  above-threshold  dissocia¬ 
tion  (ATD),  evidenced  by  the  observation,  in  the 
fragment  kinetic  energy  distributions,  of  peaks 
spaced  by  one  quantum  of  the  photon  energy  [2, 
3].  In  addition  to  scalar  observables  such  as  total 
or  partial  dissociation  rates  or  the  energy  spectra 
of  fragments,  vectorial  observables  such  as  frag¬ 
ment  angular  distributions  provide  complemen¬ 
tary  views  for  a  detailed  understanding  of  the 
strong  laser-induced  dissociation  dynamics.  Two 
possibilities  have  so  far  been  invoked  for  the  inter¬ 
pretation  of  aligned  fragments  with  respect  to  the 
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field  polarization  vector,  namely,  an  enhancement 
of  the  dissociation  cross  section  for  initially  aligned 
molecules  [2,  4,  5]  or  a  dynamical  alignment  occur¬ 
ring  during  the  fragmentation  process  as  a  conse¬ 
quence  of  the  torque  that  the  laser  field  exerts  on 
the  molecule  [6”8].  The  extensive  use  of  one-di¬ 
mensional  models,  where  the  fragmentation  of 
diatomic  molecules  is  merely  described  by  a  single 
degree  of  freedom,  relies  upon  the  first  assump¬ 
tion  [2,  9,  10].  The  possibility  of  a  dynamical  align¬ 
ment  motivates  the  development  of  models  where 
rotational  degrees  of  freedom  are  fully  included 
[11-14]. 

The  most  popular  conviction  so  far  experimen¬ 
tally  supported  by  different  observations  is  that, 
whatever  the  mechanism  is,  molecules  subjected  to 
an  intense,  linearly  polarized  laser  radiation  disso¬ 
ciate  into  fragments  aligned  with  the  polarization 
axis.  There  is,  however,  experimental  evidence,  on 
multiphoton  dissociation,  that  protons  are 
emitted  along  the  polarization  axis  in  narrow  dis¬ 
tributions,  becoming  broader  as  intensity  increases 
[6].  The  bond-softening  (BS)  mechanism  resulting 
from  laser-induced  lowering  of  a  potential  energy 
barrier  supports  the  interpretative  scheme  of  the 
alignment  process.  More  recently,  we,  conversely, 
pointed  out  that  the  vibrational  trapping  (VT) 
mechanism  may  be  responsible  for  fragmentations 
occurring  in  a  plane  nearly  orthogonal  to  the  polar¬ 
ization  axis  [14].  This  surprising  prediction  has 
been  confirmed  by  quantum  calculations  on  H2 
with  an  excitation  wavelength  of  A  =  80  nm,  pro¬ 
ducing  a  curve-crossing  situation  in  a  field-dressed 
molecule  picture  occurring  at  the  left  of  the  equi¬ 
librium  position.  The  detailed  understanding  of 
such  a  mechanism  involves  very  efficient  trapping 
in  all  spatial  directions  except  a  direction  quasi¬ 
perpendicular  to  the  applied  field  where  fragmen¬ 
tation  may  take  place. 

The  aim  of  the  present  work  was  to  understand 
another  somewhat  unexpected  observation  in 
of  less  aligned  photofragments  for  stronger  fields 
at  a  wavelength  of  A  =  532  nm  [6],  by  calculating 
the  angular-resolved  kinetic  energy  spectra  and 
interpreting  them  in  terms  of  the  interplay  of 
field-induced  chemical  bond  softening  and  harden¬ 
ing  (trapping)  mechanisms.  The  article  is  orga¬ 
nized  as  follows:  The  next  section  provides  a  brief 
theoretical  survey  of  a  time-dependent  quantum 
model  where  an  initial  (vibrational-rotational) 
state  of  is  propagated  on  its  two  lowest  elec¬ 
tronic  states,  namely,  ^2^  and  The  linearly 
polarized  laser  imposes  a  privileged  direction  in 


the  laboratory  frame  with  respect  to  which  the 
molecular  axis  is  positioned.  For  short  internuclear 
distances  defining  an  inner  region,  a  Fourier  trans¬ 
form  spectral  method  is  used  in  spherical  coordi¬ 
nates  by  combining  a  contact  transformation  with 
Dateo  and  Metiu's  [13]  grid  approach  for  angular 
degrees  of  freedom.  For  the  large  distances  of  the 
outer  region,  an  analytic  approach,  which  is  a 
three-dimensional  generalization  of  Volkov-type 
solutions  of  [15],  is  adopted.  The  third  section  is 
devoted  to  the  interpretation  of  the  angular-re- 
solved  fragment  kinetic  energy  spectra  of  FJ  J  irra¬ 
diated  by  an  intense  pulse  (I  =  10-50  TW/cm^)  of 
A  =  532  nm  delivered  by  a  Nd:YAG  laser  source. 
Special  attention  is  directed  toward  the  under¬ 
standing  of  the  alignment  properties  of  the  experi¬ 
mental  spectra  of  [6],  in  terms  of  multiphoton 
processes  involving  adiabatic  potential  energy  bar¬ 
riers  (BS  situation)  and  nonadiabatic  transitions 
(VT  situations)  affecting  molecular  dynamics  dur¬ 
ing  dissociation. 


Theory 


Our  purpose  was  to  solve  the  nuclear  time-de¬ 
pendent  Schrodinger  equation  on  a  Born-Op- 
penheimer  level  of  approximation  involving  two 
electronic  states  labeled  g  (ground,  ^2^)  and  u 
(excited,  ^2jt)  following,  for  convenience,  the  stan¬ 
dard  symmetry  notations  of  Hj: 


where  the  total  molecule-plus-field  Hamiltonian  is 


H  =  i(fR  +  %  +  fj  +  W(R,  0;  t),  (2) 


1  being  the  identity  matrix,  and  R,  9,  and  (p,  the 
spherical  coordinates  of  the  internuclear  vector  R 
in  the  laboratory  frame  [14].  Kinetic  terms  are 
given  by 


d  d 

= - R^ — 

2mR^  dR  dR 

1  d  (  d  \ 

- r - —  sin  9— 

2mR^  sin  9  d9\  d9  j 

1  d^ 

2mR^  sin^  9  dcp^' 


(3) 

(4) 

(5) 
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where  m  represents  the  reduced  mass.  The  radia¬ 
tive  coupling  written  in  the  length  gauge  involves 
the  scalar  product  of  the  dipole  moment  JL  (paral¬ 
lel  to  R  for  a  X  X  transition,  as  is  the  case  of 
and  the  electric-field  vector  s  which  is  taken 
along  the  laboratory  Z-axis: 


the  cosine  transform,  which,  in  principle,  assumes 
even  parity,  may  lead  to  some  numerical  unstabili¬ 
ties  and  inaccuracies  in  angular-resolved  spectra 
(due  to  the  nonzero  derivatives  of  the  =  ±1 
wave  function  at  0  =  0,  77)  [17].  The  contact  trans¬ 
formation,  defined  by 


Jl' s  =  cos  de(t)  cos  cot.  (6) 

The  laser  pulse  carrier-wave  frequency  is  co  and  its 
envelope  s{t)  has  a  Gaussian  shape: 

eit)  =  (7) 

of  half-width  r.  The  potential  part  contains  the 
field-free  molecular  potentials  V  as  diagonal  ele¬ 
ments  and  the  matter-field  couplings  as  nondiago¬ 
nal  terms: 

V^(R)  -iJi(R)cos  dsiOcos  (ot\ 

\  — /x(K)cos  6s(t)cos  (ot  j 

(8) 

As  clear  from  Eq.  (8),  the  motion  associated  with 
the  azimuthal  angle  cp  remains  separated  under 
the  action  of  the  (^-independent  potential  term  and 
the  Z-projection  of  the  total  rotational  angu¬ 
lar  momentum  N,  is  a  good  quantum  number 
describing  the  invariance  through  rotation  about 
the  field-polarization  vector.  The  potential-energy 
curves  as  well  as  the  electronic  transition  moment 
are  taken  from  Bunkin  and  Tugov's  fit  [16].  The 
initial  state  of  (^S^)  with  an  isotropic  ensem¬ 
ble  of  para-  and  ortho-hydrogen  in  equilibrium 
involves  two  components  with  N  =  1,  =  0 

(statistical  weight  of  1  /3),  and  N  =  1,  =  +  X 

(statistical  weight  of  2 /3),  each  of  them  having  to 
be  propagated  separately  and  further  combined 
using  their  respective  weights  [14]. 

Fourier  transform  methodology  is  one  of  the 
most  popular  approaches  to  solve  Eq.  (1),  the  an¬ 
gular  variables  being  treated  either  by  using  basis- 
set  expansions  or  grid  techniques.  This  last  ap¬ 
proach  has  recently  been  significantly  improved 
by  Dateo  and  Metiu  [13]  with  the  implementation 
of  a  unitary  Cayley  scheme  for  Tq  combined  with  a 
Feit-Fleck  split  operator  technique.  The  action  of 
potential  and  kinetic  operators  on  the  wave  func¬ 
tion  is  evaluated  through  a  combination  of  coordi¬ 
nate  and  momentum  representations  using  an  ex¬ 
ponential  Fourier  transform  for  R  and  a  cosine 
transform  for  d.  Although  very  accurate  results  are 
obtained  for  =  0,  it  turns  out  that  for  =  + 1 


J?\/sin  6 


(9) 


is  a  way  out  of  this  difficulty;  the  numerical  unsta¬ 
bilities  related  to  the  cosine  transform  performed 
on  can  easily  be  removed  by  applying  the 
procedure  to  propagation  technique, 

previously  described  [13,  14],  is  based  on  the  re¬ 
peated  use  of  short-time  propagators: 


=  exp|-^[T«  +  r,  + 

(10) 


where  kinetic  operators  are  merely 
~  2m 

^  1 


(11) 

(12) 


and  the  effective  potential  is 


2mR^ 


+ 


{Ml  -  1/4) 
ImR^  sin^  6 


.  (13) 


Since  the  three  operators  T^,  T^,  and  do  not 
commute,  the  exponential  in  [Eq.  (10)]  is  split  into 
five  terms,  using  MoyaTs  formula  [18]  at  the 
third-order  approximation  0(6f^). 

The  calculation  of  kinetic  energy  spectra  in¬ 
volves,  in  principle,  the  propagation  of  the  full 
wavepacket  on  very  large  grids,  during  the  total 
pulse  duration,  such  that  the  postpulse  analysis 
leads  to  free-fragment  properties.  The  overall  pro¬ 
cedure  is  very  time-consuming,  especially  for  a 
molecule  as  H2  (odd  charged  homonuclear  ion) 
due  to  its  asymptotically  divergent  coupling  with 
the  field.  To  overcome  this  difficulty,  the  wave 
function  is  split  into  two  regions:  an  internal  and 
an  asymptotic  one.  The  latter  is  analyzed  analyti- 
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cally  by  a  three-dimensional  generalization  of  the 
Volkov-type  solutions  of  [15],  while  the  above- 
mentioned  numerical  propagation  is  carried  out  on 
the  former.  Angular-resolved  fragments  probabil¬ 
ity  distributions  are  then  given  by 


6)dk 


=  lim 

t-^QO 


0;  t)\  -h  I  6;  t\ 


dk, 

(14) 


where  (t  ^  oo)  means  that  not  only  the  laser  pulse 
is  switched  off,  but  also  that  there  is  no  outgoing 
continuum  components  of  the  inner  part  wave- 
packet.  represent  the  Fourier  transform  of  the 
asymptotic  region  wavepacket  components  on  the 
g  and  u  channels,  respectively. 


Results 


FIGURE  1 .  Field-dressed  potential-energy  curves  of 
(A  =  532  nm),  for  an  intensity  of  /  =  10'*^  W  /  cm^,  in  the 
diabatic  (solid  lines)  and  adiabatic  (broken  lines)  frames. 
Curve-crossing  regions  are  outlined  by  rectangular  boxes 
X1 ,  X2,  and  X3.  The  energy  of  the  v  =  4  vibrational  level 
is  indicated  by  the  thin  horizontal  line. 


The  angular-resolved  fragments  spectra  are  first 
analyzed  with  respect  to  their  kinetic  energy  distri¬ 
butions.  In  the  second  step,  the  basic  mechanisms 
leading  to  the  alignment  dynamics  are  presented 
and  the  part  that  they  are  playing  is  illustrated  in 
some  examples.  Two  strong-field  intensities  are 
considered,  namely,  10^^  and  5  X  10^^  W/cm^,  at 
a  wavelength  of  A  =  532  nm,  with  pulse  rise  and 
fall  off  times  of  r  =  30  fs  and  the  maximum  at 
Iq  =  75  fs,  in  order  to  follow  as  closely  as  possible 
the  experimental  situation  [6].  The  field-dressed 
potential  energy  curves  involved  in  the  two  main 
Floquet  blocks  are  displayed  in  their  diabatic  and 
adiabatic  representations  in  Figure  1.  The  adiabatic 
representation  showing  avoided  curve-crossings 
results  from  the  diagonalization  of  the  radiative 
interaction  for  the  intensity  10^^  W/cm^  and  a 
fixed  molecule-field  orientation  (^  =  0  or  tt,  which 
maximizes  the  coupling).  The  curve-crossing  re¬ 
gions  upon  which  the  interpretation  rests  are  indi¬ 
cated  by  rectangular  boxes:  XI  between  the  ground 
and  the  one-photon  channels,  X2  between  the 
ground  and  the  two-photon  channels,  and  X3  be¬ 
tween  the  two-  and  three-photon  channels.  Three 
initial  vibrational  states  of  are  taken  into 
account,  their  energies  corresponding  to  some  spe¬ 
cific  situations  with  respect  to  the  height  of  the 
adiabatic  barrier.  At  the  XI  avoided  crossing  v  ^  2 
lies  below  and  v  =  5  above  this  barrier  for  both 
field  strengths;  as  for  i;  =  4,  it  lies  above  the  bar¬ 
rier  for  the  highest  intensity  and  below  for  the 
lowest. 


KIMETIC  ErVERGY  DISTRIBUTIONS 

Three-dimensional  viewgraphs  of  the  spectra 
are  given  for  I  ^  10^^  W/cm^  in  Figure  2.  A 
behavior  which  is  typical  for  ATD  can  be  detected 
in  terms  of  peaks,  in  the  proton  kinetic  energy 
distributions,  separated  by  the  photon  energy  [2, 
3].  The  branching  ratios  (relative  heights  of  the 
so-called  one-,  two-  or  three-photon  peaks)  reflect 
the  energy  positioning  of  the  initial  states.  For 

=  2,  the  potential  barrier  is  too  high  at  XI  for  the 
dissociation  to  proceed  by  the  absorption  of  a 
single  photon.  A  third-order  matter-field  interac¬ 
tion  at  X2  gives  rise  to  dissociation  by  absorbing 
two  or  three  photons.  For  the  intensity  under  con¬ 
sideration,  an  important  nonadiabatic  transition  at 
X3  results  in  a  competition  between  the  two-  and 
three-photon  peaks.  The  situation  is  completely 
different  for  v  =  5;  the  field-induced  barrier  low¬ 
ering  at  XI  favors  the  single-photon  process,  with 
the  one-photon  peak  in  the  spectra.  The  case  of 
z;  ^  4  is  intermediate:  There  is  still  some  competi¬ 
tion  between  the  barrier  lowering  (XI)  and  the 
nonadiabatic  transition  (X2)  mechanisms,  leading 
to  a  spectrum  displaying  three  peaks  with  a  major 
one-photon  peak  contribution, 

ANGULAR  DISTRIBUTIONS 

Alignment  processes  are  very  sensitive  to  the 
dynamics  which  takes  place  at  the  crossing  regions 
XI,  X2,  and  X3.  They  are  basically  controlled 
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FIGURE  2.  Angular-resolved  kinetic  energy  spectra  for  the  initial  vibrational  level  v  =  2  [panel  (a)],  v  =  4  [panel  (b)], 
and  V  =  5  [panel  (c)]  of  irradiated  by  a  A  =  532  nm  laser  delivering  an  Intensity  of  /  =  10^^  W/cm^.  Panel  (d) 
gathers  angular  distributions  in  polar  representation,  for  each  v,  of  fragments  contributing  to  the  first  photon  peak  (thick 
solid  line)  to  the  second  photon  peak  (thin  solid  line)  or  the  third  photon  peak  (broken  line). 


by  two  mechanisms:  the  barrier  lowering  and 
the  nonadiabatic  transition.  Let  us  examine  them 
separately: 

Barrier  Lowering 

In  strong-field  situations,  the  adiabatic  potential 
barrier  is  lowered  in  an  efficient  way  proportional 
to  the  radiative  coupling,  i.e.,  ^/I/Ji(R)cos  0.  For  a 
given  intensity,  this  lowering  is  much  more  impor¬ 
tant  for  cos  0^1,  i.e.,  for  ^  0  or  tt.  For  cos  0  0 

{0  7r/2),  the  high  potential  barrier  is  hardly  pen¬ 

etrable.  The  torque,  which  is  exerted  by  the  laser 
on  the  molecule,  allows  the  wavepacket  to  skirt 
around  the  potential  barrier  and  find  the  mini¬ 
mum-energy  pathway  toward  the  dissociative  val¬ 
ley  (i.e.,  for  directions  close  to  0  =  0  or  tt).  In  later 


steps,  the  large  distance  dynamics  is  no  longer 
affected  by  an  additional  torque  (due  to  the 
factor  in  [Eq.  (12)]),  such  that  the  angular  distri¬ 
bution  is  close  to  its  value  at  the  position  XI  of  the 
barrier,  where  alignment  with  the  field  (0  =  0 
or  tt)  is  favored. 

Nonadiabatic  Transition 

In  avoided  crossing  regions  X2  and  X3,  the 
dynamics  proceeds  via  a  nonadiabatic  jump  from 
the  upper  attractive  to  the  lower  repulsive  adia¬ 
batic  surfaces.  This  is  monitored  by  a  nonadiabatic 
kinetic  interstate  coupling  located  at  the  avoided 
crossing  point  and  presenting  an  amplitude  maxi¬ 
mum  for  0  ^  77/2,  We  recall  that  strong  radiative 
interactions  in  the  diabatic  frame,  either  for  0  =  0, 
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TT  or  for  high  fields,  correspond,  accordingly, 
to  small  kinetic,  nonadiabatic  couplings  respon¬ 
sible  for  the  dissociation  in  the  adiabatic  frame. 
Finally,  less  aligned  fragments  are  expected  when 
nonadiabatic  transitions  are  responsible  for  the 
dissociation. 

In  realistic  situations,  it  is  in  the  interplay  of  the 
two  mechanisms  that  the  interpretation  is  to  be 
looked  for.  A  direct  consequence  of  the  previous 
analysis  remains,  however,  that  the  one-photon 
peak,  resulting  from  the  barrier-lowering  mecha¬ 
nism  (XI),  leads  to  aligned  fragments,  whereas  the 
two-photon  peak  resulting  from  a  single  nonadia¬ 
batic  transition  (X2)  is  characterized  by  less  aligned 
fragments.  The  protons  with  a  velocity  distribution 
within  the  three-photon  peak  are  even  less  aligned, 
as  two  nonadiabatic  transitions  (X2  and  X3)  are 
involved  in  their  dynamics.  This  is  clearly  illus¬ 
trated  in  Figure  2  for  all  three  initial  states  i?  =  2, 
4,  or  5.  Another  direct  consequence  is  the  intensity 
dependence  of  the  alignment.  An  increase  in  inten¬ 
sity  produces  more  efficient  barrier  lowering,  re¬ 
sulting  in  more  aligned  fragments  when  the  lead¬ 
ing  mechanism  is  precisely  the  tunneling  at  XI, 
This  can  be  seen  in  the  angular  distributions  of  the 
one-photon  peaks  of  v  =  4:  and  5  displayed  in 
Figure  3.  Alignments  are  efficiently  increased  when 
going  from  10^^  to  5  X  10^^  W/cm^  •  z;  =  5,  with 
an  energy  well  above  the  adiabatic  potential  barri¬ 
ers  for  this  range  of  intensities,  is  less  sensitive  to 


the  tunneling  mechanism.  The  increase  in  its  align¬ 
ment  is  much  less  than  i;  =  4.  Conversely,  when 
nonadiabatic  transitions  monitor  the  dynamics, 
distributions  are  less  aligned  at  high  fields.  This  is 
the  case  for  u  =  4  and  5  in  Figure  3  and  corrobo¬ 
rates  the  experimental  observation  of  [6]. 

More  detailed  analysis  is  required  for  a  thor¬ 
ough  interpretation  of  some  observations  on  the 
alignment  patterns  in  terms  of  the  role  played  by 
the  initial  vibrational  state  v  or  the  intensity-de¬ 
pendent  relative  contribution  of  the  initial  angular 
distribution  (i.e.,  =  0  or  ±  1).  Two  arguments 

can  be  invoked  to  support  this  analysis: 

(i)  Different  initial  vibrational  states  give  rise 
to  different  kinetic  energies  during  the  dis¬ 
sociation  process.  A  wavepacket  issued  from 
a  high  V  presents  a  high  kinetic  energy 
distribution  and  feels  the  alignment  effects 
of  the  X2  and  X3  avoided  crossing  regions 
in  a  reduced  way.  It  is  also  less  sensitive  to 
the  long-range  dynamics  of  the  two-photon 
channel  surface  which  tends  to  favor  the 
6  ^  7r/l  direction  for  the  fragmentation. 
This  is  why  the  fragments  of  a  given  kinetic 
energy  (i.e.,  within  a  given  peak)  are  more 
aligned  for  higher  initial  vibrational  states. 
Figure  2  shows,  precisely,  that  fragments 
contributing  to  the  two-photon  peak  are 
more  efficiently  aligned  for  v  =  5  than  for 


13  2  13  2 

10  W/cm  5.10  W/cm 


FIGURE  3.  Angular  distributions,  in  polar  representation,  of  photons  contributing  to  the  first  (thick  solid  line),  second 
(thin  solid  line),  or  third  photon  peaks  (broken  line)  and  resuiting  from  the  photodissociation  of  H  J  (1/  =  4  and  5)  for  two 
field  strengths  /  =  10^®  and  5  x  10’®  W/cm®  (A  =  532  nm).  Note  that  the  relative  contributions  of  the  three  peaks  to  the 
angular  distributions  are  different,  the  scales  being  indicated  by  the  numbers  (in  arbitrary  units)  on  the  coordinate  axis. 
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i;  =  4.  The  same  observation  is  valid  for  the 
three-photon  peak  whose  alignment  in¬ 
creases  with  V, 

(ii)  To  obtain  results  corresponding  to  an  initial 
isotropic  ensemble  of  molecules,  an  incoher¬ 
ent  sum  of  two  probabilities  based  on  two 
initial  angular  distributions 

=  0)  and  (be.,  =  ±  1)  is  to 

be  performed  designates  the 

spherical  harmonics  and  ^  leads  to  the 
same  result  as  because  the  total  Hamil¬ 
tonian  does  not  depend  upon  the  sign  of 
M^,  as  is  clear  from  Eq.  (13)].  Having  in 
mind  that  Y^g  behaves  as  cos  6  and  Y^  as 
sin  6,  the  potential  barrier,  with  a  maxi¬ 
mum  at  0  =  7r/2,  is  hardly  penetrable  for 
Yii-  It  is  only  for  very  strong  fields  that  the 
barrier  is  lowered  enough  for  the  =  ±  1 
component  to  be  in  competition  with  the 
=  0  component.  The  consequence  is  that 
for  high  intensities  (or  for  vibrational  levels 
above  the  barrier)  fragments  resulting  from 
an  initial  ^11  distribution  may  significantly 
contribute  to  the  spectrum,  producing  less 
aligned  distribution  patterns.  This  argu¬ 
ment  may  also  contribute  to  support  the 
fact  observed  in  Figure  3  that  at  high  inten¬ 
sity,  J  =  5  X  10^^  W/cm^,  the  one-photon 
peak  is  less  aligned  for  i;  =  5  than  for  u  =  4 
(the  potential  barrier  is  more  transparent  for 
the  Yi2  component  of  the  higher-energy  state 
i;  =  5  as  compared  to  i;  =  4). 

We  finally  note  that  the  polar  plots  of  some 
angular  distributions  show,  at  the  high  intensity  of 
Figure  3,  off-axis  structures  for  both  v  =  A  and  5. 
Such  structures  have  already  been  experimentally 
evidenced  in  similar  contexts,  i.e.,  in  the  angular 
distribution  of  the  photoelectron  ATI  spectrum  of 
xenon  [19]  and  the  angular  distributions  of  and 
from  dissociation  of  CO2  or  CS2  [20].  In  the 
latter  case,  an  interpretation  based  on  molecular 
pendular  states  arising  from  the  large  field-in¬ 
duced  polarizabilities  of  these  systems  is  given.  In 
the  former  case,  the  mechanism  which  is  invoked 
is  rather  close  to  the  one  monitoring  the  dissocia¬ 
tion  dynamics  of  the  ATD  regime.  More  precisely, 
the  rings  of  the  ionization  dynamics  are  inter¬ 
preted  in  terms  of  rescattering  from  the  ion  core  of 
the  tunneling  component  of  the  continuum  wave 
function  [19].  Here,  it  is  the  interferences  (en¬ 


hanced  by  the  high-field  intensity)  between  the 
short-distance  nonadiabatic  transition  (responsible 
of  the  multiphoton  excitation  at  X2)  and  the  large 
distance  tunneling  (responsible  of  the  dynamics  at 
XI)  which  are  at  the  origin  of  the  mixing  of  Y^g 
and  Y^i  distributions  and  of  bond  softening  and 
vibrational  trapping  mechanisms  manifesting 
themselves  as  structures  in  the  strongly  anisotropic 
angular  distributions  of  dissociation  products. 

As  a  word  of  conclusion,  the  widely  discussed 
alignment  of  the  fragments  resulting  from  a  multi¬ 
photon  dissociation  process  is  basically  related 
to  an  adiabatic  potential  energy  barrier  lowering 
(which  is  also  responsible  for  the  bond  softening). 
When  using  intense  fields,  there  are,  however, 
other  mechanisms  which  also  lead  to  dissociation, 
namely,  the  nonadiabatic  transitions  (as  the  ones 
involved  in  vibrational  trapping).  These  transitions 
favor  the  6  =  tt/ 2  direction  in  the  dissociation 
dynamics.  The  most  original  finding  is  the  possi¬ 
bility,  at  least  for  some  wavelengths  [14]  or  for 
some  kinetic  energies,  to  observe  photofragments 
in  a  plane  orthogonal  to  the  polarization  axis,  a 
possibility  enhanced  by  the  high  intensity.  This  is 
the  first  step  aimed  at  the  detailed  interpretation 
of  the  intensity  dependence  of  angular-resolved 
spectra  and,  in  particular,  to  the  understanding  of 
some  experimental  observations  of  less  aligned 
fragments  with  increasing  field  strength  [6].  Work 
is  in  progress  in  our  group  for  a  more  detailed 
study  of  the  experimental  data  of  [6]. 
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ABSTRACT:  We  theoretically  investigated  how  the  formation  of  oxygen  vacancies 
and  the  addition  of  niobium  and  chromium  atoms  as  dopants  modify  the  varistor 
properties  of  Ti02.  The  calculations  were  carried  out  at  the  HF  level  using  a  contracted 
basis  set,  developed  by  Huzinaga  et  al.,  to  represent  the  atomic  centers  on  the  (110) 
surface  for  the  large  (Ti02)i5  cluster  model.  The  change  of  the  values  for  the  net  atomic 
charges  and  band  gap  after  oxygen  vacancy  formation  and  the  presence  of  dopants  in  the 
lattice  are  analyzed  and  discussed.  It  is  shown  that  the  formation  of  oxygen  vacancies 
decreases  the  band  gap  while  an  opposite  effect  is  found  when  dopants  are  located  in  the 
reduced  surface.  The  theoretical  results  are  compared  with  available  experimental  data. 
A  plausible  explanation  of  the  varistor  behavior  of  this  system  is  proposed.  ©  1997  John 
Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  625-631,  1997 
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Introduction 

In  recent  years,  interest  in  the  surface  chemistry 
of  metal  oxides  has  rapidly  grown.  The  en¬ 
hanced  attention  is  related  to  different  types  of 
catalytic  reactions  on  oxide  surfaces  and  the  prepa¬ 
ration  of  thin  films  of  these  materials  with  impor¬ 
tant  technological  applications  [1-11].  Metal  ox¬ 
ides,  such  as  ZnO,  Sn02,  and  Ti02,  are  materials 
with  varistor  properties  that  are  used  as  semicon¬ 
ductor  devices  which  present  high  nonlinearity 
between  the  current  density  and  electric  field,  thus 
being  suitable  for  transient  voltage  suppression 
[1-3].  Most  of  the  chemistry  of  metal  oxides  is  due 
to  the  presence  of  highly  reactive  defective  sites 
such  as  cation  or  anion  vacancies  and/or  a  doping 
process  by  substitution  of  metal  cations.  As  most 
of  the  surfaces  studied  experimentally  contain  de¬ 
fects  resulting  from  preparation  conditions  or  from 
ionic  or  electronic  bombardment,  one  is  led,  after 
the  theoretical  study  of  ideal  surfaces,  to  consider 
the  case  of  surfaces  containing  defects.  Once  the 
vacancies  are  formed,  their  electronic  nature  can 
be  changed  by  adding  or  removing  electrons. 
Therefore,  the  electronic  behavior  is  the  key  con¬ 
cept  for  understanding  the  operation. 

In  particular,  different  research  groups  [2-5] 
have  studied  doped  Ti02,  ceramics  having  useful 
varistor  properties,  with  nonlinearity  coefficients 
(a)  values  in  the  range  of  a  =  3-12,  a  being 
defined  by  the  relation  I  =  where  I  is  the 
current;  V,  the  voltage;  and  K,  a  proportionality 
constant  [8].  The  presence  of  this  potential  barrier 
is  due  to  the  creation  of  defects  formed  during 
sintering  of  Ti02  systems.  The  lack  of  lattice  peri¬ 
odicity  due  to  intrinsic  defects  leads  to  a  high 
density  of  structural  defects  that  can  originate  a 
potential  barrier  associated  to  a  double  space 
charge  distribution  and  this  phenomenon  estab¬ 
lishes  variable  resistance  as  a  function  of  the  ap¬ 
plied  electric  field  to  the  solid. 

Understanding  the  chemical  and  physical  prop¬ 
erties  of  local  active  sites  of  metal  oxides  has  a 
twofold  interest:  First,  from  a  fundamental  point 
of  view,  it  is  interesting  to  see  which  new  features 
are  induced  by  the  presence  of  defects  in  the 
surface  and  how  these  surface  defects  are  related 
to  the  bulk  electronic  and  atomic  structure.  Sec¬ 
ond,  the  practical  interest  results  from  the  numer¬ 


ous  technological  applications  where  they  are  used. 
Among  these,  varistor  properties  are  probably  the 
most  important  for  us. 

This  is  a  collaborative  project  between  experi¬ 
mental  and  theoretical  research  groups.  Our  in¬ 
creasing  experimental  efforts  [7,8,12-14]  can  be 
accompanied  by  quantum  chemical  studies  of  local 
defects;  among  them,  the  cluster  approach  [5-17] 
is  probably  the  most  widely  adopted  and  many 
successful  applications  of  this  model  have  been 
made  on  Ti02  [18,19]  and  related  systems  [20-23]. 
The  aim  of  this  work  was  to  use  ab  initio  tech¬ 
niques  to  investigate  the  electronic  structures  of 
doped  Ti02  systems  in  order  to  understand,  at  the 
molecular  level,  the  varistor  properties  of  Ti02 
and  to  rationalize  the  role  played  by  the  oxygen 
vacancies  and  the  substitution  of  Mb  and  Cr  atoms, 
as  dopants,  replacing  Ti  atoms  in  the  lattice. 

The  article  is  organized  as  follows:  The  model 
and  method  of  calculation  are  summarized  in  the 
next  section.  The  results  and  discussion  are  pre¬ 
sented  in  the  third  section,  A  short  section  of 
conclusions  closes  this  article. 


Models  and  Method  of  Calculation 

Ti02  crystallizes  in  the  rutile  structure,  which 
has  tetragonal  symmetry.  There  are  two  for¬ 
mula  units  per  primitive  unit  cell,  with  the  three- 
fold-coordinated  oxygen  atoms  forming  a  dis¬ 
torted  octahedron  around  the  titanium  atoms  [24]. 
This  material  has  a  relatively  strong  ionic  character 
and  its  bulk  valence  band  has  a  dominant  O-p 
orbital  character  and  the  orbital  composition  of  the 
lower  conduction  bands  result  from  atomic  Ti-3d 
states.  Experimental  analysis  [25-28]  shows  that 
the  (110)  surface  is  the  most  stable  face  of  Ti02 
and  oxygen  vacancies  are  by  far  the  most  fre¬ 
quently  occurring  defect.  Therefore,  we  restrict 
ourselves  here  to  the  study  of  this  structure  the 
different  types  of  defects  and  illustrate  some  basic 
aspects  of  the  varistor  phenomena.  This  face  is  a 
nonpolar  surface  that  can  be  studied  even  in  its 
bulk-derived  experimental  geometry.  The  rutile 
structure  can  be  regarded  as  consisting  of  (110) 
planes  of  atoms  containing  both  metal,  Ti,  and 
oxygen,  O,  atoms,  separated  by  planes  containing 
oxygen  alone,  so  that  the  sequence  of  planes  is  O 
— Ti202 — O — O — Ti202 — O,  etc.  The  entire  crys¬ 
tal  can  then  be  built  up  of  the  symmetrical  three- 
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plane  O — Ti202 — O  units.  We  selected  the  large 
(Ti02)i5  cluster  model,  shown  in  Figure  1,  derived 
from  the  crystallographic  data. 

Models  for  the  stoichiometric  (110)  surface  of 
the  titanium  oxide  surface  in  the  oxidized  and 
reduced  forms,  as  well  as  doped  with  niobium  and 
chromium  atoms,  were  constructed  in  this  work 
by  using  the  usual  repeating  slab  geometry.  The 
following  models  were  selected:  (Ti02)i5  (I)  to 
represent  the  fully  oxidized  surface;  (Ti02)i4Ti0 
(II)  and  (Ti02)i3(Ti0)2  (HI)  were  made  by  remov¬ 
ing  one  oxygen  in  position  16  and  two  oxygens  in 
positions  16  and  17  from  model  I,  respectively. 
(Ti02)i3(Cr0)2  (IV)  to  represent  the  surface  doped 
with  the  Cr  atom  in  positions  14  and  14'  and 
(Ti02)i3(Nb0)2  to  represent  the  surface  doped  with 
the  Nb  atom  in  sites  3  and  3'  (V)  and  sites  1  and  1' 

(VI) ;  (Ti02)ii(Nb0)2(Cr0)2  corresponds  to  the  sur¬ 
face  simultaneously  doped  with  the  Nb  atom  in 
the  1  and  V  sites  and  Cr  in  the  14  and  14'  sites 

(VII) .  In  model  VIII,  the  Nb  atom  is  placed  in  the 
3  and  3'  sites,  while  the  Cr  atom  is  located  in  the 
14  and  14'  sites,  and  in  model  IX,  Nb  is  placed  in 
the  14  and  14'  sites,  while  the  Cr  atom  is  located  in 
the  1  and  1'  sites. 

The  calculations  were  carried  out  with  the 
Gaussian94  package  [29].  The  ab  initio  Hartree- 


Fock  (HF)  level  of  the  theory  was  employed  with 
the  contracted  basis  set  for  titanium  (4322 /42 /3), 
oxygen  (33/3),  niobium  (43333/433/43), 
and  chromium  (4322/42/3)  atoms  developed  by 
Fluzinaga  et  al.  [30,31].  The  orbital  exponent  and 
contraction  coefficients  of  the  atomic  wave  func¬ 
tions  were  determined  by  optimizing  the  total  en¬ 
ergies  as  explained  in  detail  in  a  previous  article  of 
Sakai  et  al.  [32],  All  calculations  were  carried  out 
in  the  ground  singlet  electronic  state.  HOMO- 
LUMO  corresponds  to  the  band  gap  value. 

In  addition,  we  try  to  determine  how  the 
boundary  conditions  can  affect  the  different  values 
of  the  band  gap  and  net  atomic  charges  of  the 
different  cluster  models.  The  computational  ap¬ 
proach  for  the  theoretical  treatment  is  based  on 
continuum  models  [33]:  The  cluster  is  embedded 
in  a  cavity  while  the  environment,  treated  as  a 
continuous  medium  having  the  same  dielectric 
constant  as  the  bulk  system,  is  incorporated  into 
the  solute  Hamiltonian  as  a  perturbation.  In  this 
reaction-field  approach,  the  bulk  system  is  polar¬ 
ized  by  the  environment.  The  electronic  distribu¬ 
tion  of  the  cluster  system  polarizes  the  continuum 
which  generates  an  electric  field  inside  the  cavity, 
which,  in  turn,  affects  the  solute's  geometry  and 
electronic  structure.  The  implementation  of  this 


FIGURE  1.  (Ti02)i5  cluster  model  for  the  etoichiometric  (110)  surface.  Black  and  gray  circles  represent  oxygen  and 
metal  ions,  respectively. 
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interaction  scheme  is  achieved  through  the  self- 
consistent  reaction  field  method  developed  ini¬ 
tially  by  Tapia  [34,35].  In  this  work,  we  employed 
the  continuum  model  of  Rivail  et  al.  [36-38]  based 
on  the  use  of  cavities  and  a  multipolar  expansion 
of  the  solute  electrostatic  potential.  Calculations 
were  made  using  ellipsoidal  cavities  surrounded 
by  a  continuum  of  the  dielectric  constant  (s  =  86.0, 
bulk  Ti02  rutile)  and  multipolar  expansion  up  to 
order  6.  An  analysis  and  comparison  between  the 
results  of  the  different  cluster  models  obtained  in 
vacuo  and  by  taking  into  account  a  continuum 
representation  of  the  environment  show  that  the 
values  of  the  band  gap  and  net  atomic  charges  are 
similar.  For  this  reason,  only  the  results  given  in 
vacuo  are  presented  in  this  work. 


Results  and  Discussion 

Charges  on  atoms  are  calculated  using  the  Mul- 
liken  population  analysis.  In  addition,  to  improve 
the  population  representation,  we  selected  the  nat¬ 
ural  population  analysis  based  on  Weinhold's  nat¬ 
ural  bond  orbital  (NBO)  [39-41].  The  Mulliken 
charges  are  always  lower  than  the  NBO,  but  simi¬ 
lar  trends  are  obtained  with  respect  to  the  varia¬ 
tion  between  pure  (I)  and  doped  (IV,  V,  VI,  VII, 
VIII,  and  IX)  clusters  and  clusters  with  vacancies 
(II,  III).  It  seems  difficult  to  discuss  the  absolute 
values  of  the  charge  on  the  individual  Ti  and  O 
atoms;  however,  some  useful  information  could  be 
drawn  from  the  results  by  comparing  the  charge 
on  the  atoms  in  the  corresponding  model. 

The  values  of  the  net  atomic  charge  and  band 
gap  for  the  nine  models  are  presented  in  Table  I. 


The  results  point  out  that  the  surface  is  reduced 
when  oxygen  vacancies  are  formed  in  the  II  and 
III  models,  with  a  corresponding  decrease  in  the 
charge  density  of  titanium,  i.e.,  the  and 
atoms  decrease  their  positive  charge  whereas 
changes  in  the  charge  for  more  distant  atoms  are 
very  small.  This  result  can  be  represented  by  the 
following  equation: 

Ti02 

TiO^  ^  Ti'^i  +  X,-  +  OJ  +  (1) 

in  which  the  species  Tij^  are  formed. 

Therefore,  the  creation  of  intrinsic  defects  in  the 
structure  of  titanium  oxide  promotes  the  reduction 
of  the  titanium  atoms.  The  ab  initio  results  repro¬ 
duce,  in  a  reasonable  way,  this  redox  reaction 
which  occurs  in  n-type  semiconductors  such  as 
Ti02.  As  experimentally  verified  in  our  laboratory 
[7,8,12-14],  there  is  a  decreasing  of  the  band  gap 
value  in  the  formation  of  n-type  semiconductors, 
which  is  due  to  the  V^-  and  V^-formed  species.  The 
results  show  that  this  effect  is  proportional  to  the 
increase  of  the  concentration  of  the  vacancies.  The 
two-step  reduction  of  the  titanium  atom  can  be 
rationalized  on  the  basis  of  the  simulation  of  one 
and  two  oxygen  vacancies'  formation.  The  per- 
centual  variation  of  the  charge  density  of  oxidized 
titanium  to  reduced  titanium,  Ti|4  and  is 

around  40  and  6%,  respectively,  going  from  model 
I  to  model  II.  When  the  second  vacancy  is  formed, 
Tii4  experiences  a  percentual  reduction  of  45%  in 
the  charge  density.  These  results  suggest  the  for¬ 
mation  of  the  species  Tij^.  Marucco  et  al.  [43] 
proposed  that  the  defects  induced  by  the  reduction 
process  in  titanium  dioxide  are  oxygen  vacancies 
and  interstitial  titanium,  their  relative  importance 


TABLE  I 


Charge  on  titanium,  oxygen,  chromium,  and  niobium  atoms  and  band  gap  (eV). 

Atoms 

I 

il 

HI 

iV 

V 

VI 

VII 

VIII 

IX 

1.70 

1.60 

1.46 

1.49 

1.37 

1.31 

1.07 

1.42 

1.51 

^(14) 

1.56 

1.60 

0.87 

0.97 

1.19 

1.22 

1,15 

1.20 

1.07 

^(14') 

1.56 

0.94 

0.87 

0.97 

1.19 

1.22 

1.15 

1.20 

1.07 

X(i) 

1.71 

1.62 

1.55 

1.70 

1.63 

1.60 

1.61 

1.62 

1.32 

^(1’) 

1.71 

1.62 

1.55 

1.70 

1.63 

1.60 

1.61 

1.62 

1.32 

X(3) 

1.73 

1.64 

1.71 

1.77 

1.68 

1.73 

1.67 

1.57 

1.75 

X(3') 

Band 

1.73 

1.74 

1.71 

1.77 

1.68 

1.73 

1.67 

1.57 

1.75 

gap 

3.53 

2.28 

1.98 

3.64 

2.57 

3.64 

3.20 

2.62 

3.12 

X  is  Ti,  Nb,  or  Cr  in  the  x  and  x'  positions  on  the  crystal  cluster  of  1102- 
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being  governed  mainly  by  temperature.  Oxygen 
vacancies  dominate  below  1000°C.  To  interpret  the 
electrical  behavior,  one  can  assume  that  an  inter¬ 
mediate  ion  between  and  Ti^"^,  the  Ti^^  ion, 
can  be  formed  during  vacancy  formation,  accord¬ 
ing  to  the  equation 

2Ti^i  +  2TiVi  +  V,  +  (2) 

This  assumption  is  also  in  agreement  with  the 
experimental  work  of  Watanabe  et  al.  [9]  which 
studied  the  reduction  of  the  Sn02  surface  associ¬ 
ated  with  the  change  of  the  valence  state  from 
Sn^^  to  Sn"*^^,  which  is  similar  to  that  observed  in 
the  Ti02.  These  authors  pointed  out  a  decreasing 
of  the  band  gap  value  associated  with  an  incre¬ 
ment  of  oxygen  vacancies.  On  the  basis  of  their 
results,  they  proposed  the  coexistence  of  an  inter¬ 
mediate  Sn304  structure  in  the  reduced  crystal 
(Sn02_  Y)/  which  characterizes  the  Sn^^  ion.  In  the 
present  work,  the  substitution  of  the  Ti  atom  by  a 
metal,  either  Nb  or  Cr,  should  involve  a  change  in 
the  stoichiometric  plane  [O] — [Ti02] — [O],  with 
formal  charges  [2  —  ] — [4  +  ] — [2  -  ],  by  the 
planes  [O] — [Cr202] — [O]  or  [O] — [Nb202] — [O], 
with  formal  charges  [2  —  ] — [2  +  ] — [2  —  ]  and 
[2  -  ] — [6  +  ] — [2  -  ],  respectively.  The  results  of 
Table  I  support  this  suggestion,  showing  the  coex¬ 
istence  of  two  species  with  different  charge  densi¬ 
ties  in  the  site  after  reduction:  the  Ti’^^  and  the 
Ti^^  species. 

After  substitution  of  two  Cr  atoms  for  two  Ti 
atoms,  the  IV  model,  the  Ti^  atoms  increase  their 
positive  charge,  around  10%,  while  the  0^2  atoms 
increase  their  negative  charge.  After  substitution 
of  two  Nb  atoms  for  two  Ti  atoms,  the  V  model, 
Ti5,  Till,  Tii5  atoms  decrease  their  positive 
charge  around  6%,  while  Tii  and  Tii4  atoms  in¬ 
crease  their  positive  charge  around  5  and  37%, 
respectively,  and  O4  atoms  decrease  their  negative 
charge  while  O12  atoms  increase  their  negative 
charge.  The  double  substitution  of  niobium  and 
chromium  atoms  in  Ti  sites,  the  VI  model,  yields  a 
decreasing  of  the  positive  charge  for  Ti5,  Tin, 

Tii5  atoms,  4,  6,  and  10%,  respectively,  while  Ti^ 
atoms  increase  their  positive  charge  about  40% 
and  O4  and  Oiq  atoms  increase  their  negative 
charge.  For  the  VII  model,  Ti3,  Tin,  Tii5 
atoms  decrease  their  positive  charge  around  2,  4, 
and  27%,  respectively,  and  O9,  Oiq,  and  atoms 
increase  their  negative  charge  while  O7  atoms  de¬ 
crease  their  negative  charge.  For  the  VIII  model. 


Til  atoms  increase  their  positive  charge  about  5%, 
while  O12  atoms  increase  their  negative  charge. 
For  the  IX  model.  Tin  atoms  decrease  their  posi¬ 
tive  charge  about  9%,  while  Ti^  atoms  increase 
their  positive  charge  about  23%. 

Table  I  presents  the  calculated  values  of  the 
band  gap,  which  shows  the  influence  of  vacancy 
formation  and  the  presence  of  dopants  on  the 
surface  of  Ti02.  There  is  a  partial  oxidation  when 
chromium  substitutes  titanium  in  the  lattice  and 
the  species  Cr^  and  V^  are  formed  to  preserve  the 
electroneutrality,  thus  decreasing  the  conductivity 
of  Ti02.  When  the  reduced  Ti02  is  doped  with 
chromium  or  niobium,  the  band  gap  value  in¬ 
creases  and  Ti02  becomes  an  n-type  semiconduc¬ 
tor  when  a  vacancy  is  formed.  Our  results  for  the 
doped  Ti02  agree  with  the  formation  of  an  n-type 
semiconductor;  however,  this  effect  is  highly  de¬ 
pendent  on  the  location  of  the  dopant  in  the  lattice. 
The  band  gap  presents  the  lowest  value  when  the 
doping  process  takes  place  in  the  Ti3  or  Ti3.  (V 
and  VIII  models).  Similar  values  of  the  band  gap 
are  found  when  two  Cr  atoms  or  two  Nb  atoms 
are  located  in  position  14,  14',  or  1,1',  respectively 
(IV  and  VI  models).  There  is  a  partial  oxidation 
when  Cr  or  Nb  substitutes  titanium  in  the  lattice 
and  the  species  (X  =  Cr,  Nb)  and  Vq  are  formed 
to  preserve  the  electroneutrality,  thus  decreasing 
the  conductivity  of  Ti02.  This  result  agrees  with 
the  experimental  results  of  Gupta  and  Carlson  [4] 
and  Leite  et  al.  [7]  which  characterize  the  degrada¬ 
tion  of  the  varistor.  For  example,  the  addition  of 
Nb205  in  small  amounts  to  the  Ti02  system  leads 
to  an  increase  of  the  electronic  conductivity  in  the 
Ti02  lattice  [13,14].  It  is  promoted  by  the  forma¬ 
tion  of  defects  in  the  surface  and  in  the  interior  of 
the  grain,  tending  to  electroneutralize  mutually. 
The  varistor  behavior  of  Ti02  can  be  explained  by 
the  introduction  of  defects  in  the  crystal  lattice  that 
are  responsible  for  the  formation  of  Schottky-type 
potential  barriers  at  the  grain  boundaries.  By  anal¬ 
ogy  with  the  atomic  defect  model  proposed  by 
Gupta  and  Carlson  [4]  for  the  ZnO  varistor,  the 
potential  barrier  is  formed  by  intrinsic  defects  of 
Ti02  (oxygen  vacancy  and  interstitial  ionized  tita¬ 
nium  atoms)  and  by  extrinsic  defects  created  by 
solid  substitution  of  dopants  and  negative  charges 
at  the  interface.  To  explain  the  adsorption  of  oxy¬ 
gen  on  the  Ti02  surface,  Bueno  et  al.  [8]  proposed 
reactions  1  and  2  to  be  responsible  for  the  forma¬ 
tion  of  a  potential  barrier  and,  consequently,  for 
the  nonlinearity  of  Ti02-based  varistors. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


629 


SAMBRANO  ET  AL 


Conclusions 

We  carried  out  an  ab  initio  study  on  the  Ti02 
system  in  order  to  rationalize  our  previous  experh 
mental  results.  The  varistor  properties  of  this  ma¬ 
terial  were  analyzed  taking  into  account  the  for¬ 
mation  of  oxygen  vacancies  and  the  presence  of 
Nb  and  Cr  as  dopants  in  the  lattice.  Large  cluster 
models  were  selected  and  a  contracted  basis  set 
developed  by  Huzinaga  et  al.  was  used  to  repre¬ 
sent  the  atomic  centers.  The  specific  details  of  the 
process  may  change  at  higher  levels  of  theory  (e.g., 
inclusion  of  correlation  energy),  but  despite  the 
approximate  procedure  of  the  calculations  (e.g., 
band  gap  and  Mulliken  charges  values  obtained  at 
HF  level  and  embedding  representation  of  cluster 
models)  employed  here,  some  important  features 
were  clarified.  Taken  together  with  the  results  of 
our  previous  experimental  studies,  the  following 
conclusions  can  be  drawn  from  the  results  re¬ 
ported  in  this  study;  (i)  The  ab  initio  calculations 
show  that  the  formation  of  oxygen  vacancies  in 
Ti02  leads  to  a  decrease  in  the  band  gap  value,  (ii) 
Two  different  valence  states  of  titanium  may  coex¬ 
ist,  probably  Ti*^^  and  Ti"^^.  (hi)  The  band  gap 
values  are  strongly  affected  by  the  doping  process, 
(iv)  The  global  analysis  of  the  results  agrees  with 
the  experimental  data  and  the  physical  conception 
of  the  performance  of  the  Ti02  varistor,  (v)  Volt¬ 
age  barrier  formation  and  degradation,  associated 
with  the  band  gap,  can  be  explained  by  a  redox 
reaction  between  atomic  defects.  This  fact  can  be 
of  relevance  for  the  understanding  of  varistor 
properties  of  Ti02  and  related  materials. 
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ABSTRACT:  The  geometry  dependence  of  the  exchange  integrals  and  the  total 
energies  of  sulfur-bridged  complexes  of  the  types  L5NiSNiL5,  L4NiS2NiL4,  and 
L3NiS3NiL3  is  studied  by  quantum  chemical  ab  initio  methods.  The  linear  monobridged 
complex  is  antiferromagnetic,  the  bi-  and  triply-bridged  complexes  are  ferromagnetic  for 
NiSNi  angles  between  85°  and  100°  and  antiferromagnetic  for  smaller  and  larger  angles. 
The  superexchange  mechanism  is  analyzed,  and  a  comparison  with  oxygen-bridged 
complexes  and  experimental  data  is  performed.  ©  1997  John  Wiley  Sons,  Inc,  Int  J  Quant 
Chem  65:  633-641,  1997 


Introduction 

Though  sulfur-bridged  binuclear  Ni(II)  units 
are  the  active  sites  in  some  enzymes,  e.g., 
urease  [1-3],  not  much  is  known  about  the  mag¬ 
netic  coupling  of  the  spins  at  the  two  Ni^’^  cations 
in  S-bridged  binuclear  Ni(II)  complexes  of  the  gen¬ 
eral  form  LNiS„NiL,  n  =  1,2,3,  with  one,  two,  or 
three  sulfur  bridges  (L  stands  for  any  terminal 
ligands).  In  urease,  it  seems  that  the  Ni(ll)  ions 
are  all  high  spin  and  exhibit  a  weak  antiferromag¬ 
netic  coupling  (/  =  —  6  cm"^)  [1-3].  Trinuclear 
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complexes  of  the  form  LNiS3NiS3CoL  and 
LNiS3NiS3NiL  with  octahedrally  coordinated 
Ni(II)  ions  have  been  recently  synthesized  by  Beis- 
sel  et  al.  [4].  The  analysis  of  magnetic  susceptibil¬ 
ity  measurements  yielded  values  of  about  -30 
cm“^  for  the  exchange  coupling  between  two  adja¬ 
cent  Ni(II)  ions  bridged  by  three  S  atoms  (which, 
however,  are  not  isolated  S  atoms  or  S^“  anions, 
but  belong  to  the  ligand  system).  Such  values 
indicate  that  the  antiferromagnetic  coupling  is 
stronger  than  in  similar  oxygen-bridged  com¬ 
plexes,  but  it  is  not  certain  whether  differences  in 
the  geometries,  partial  d-occupations  of  the  sulfur 
atoms  in  the  bridging  unit  or  other  reasons  are  the 
source  for  the  stronger  antiferromagnetism  in  S- 
bridged  systems.  One  trinuclear  sulfur-bridged 
nickel  thiolate  cluster  with  an  almost  planar  Ni3S3 
ring  has  been  reported  recently  in  the  literature  as 
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well  [5].  This  complex,  however,  contains  Ni^^ 
cations  in  square-planar  coordination  and  is  dia¬ 
magnetic.  To  the  best  of  our  knowledge,  no  further 
bi-  or  trinuclear  S-bridged  Ni(II)  complexes  are 
known  for  which  both  X-ray  structures  and  mag¬ 
netic  susceptibility  data  have  been  determined. 

In  a  previous  work  [6]  we  have  presented  ab 
initio  calculations  for  the  exchange  coupling  in 
oxygen-bridged  NiONi  and  Ni02Ni  systems,  to¬ 
gether  with  an  analysis  of  the  geometry  depen¬ 
dence  of  the  exchange  integral  /.  The  present  work 
extends  this  study  to  S-bridged  Ni(II)  complexes  of 
the  form  LNiS^NiL,  n  =  1,2,3.  Since  the  Ni-S 
distances  vary  considerably  in  different  com¬ 
pounds,  we  analyzed  the  geometry  dependence  of 
the  total  energies  and  of  the  exchange  integrals  in 
the  mono-  and  bibridged  complexes  and  compared 
the  results  to  those  of  oxygen-bridged  complexes. 

The  study  is  organized  as  follows:  The  second 
section  gives  a  brief  description  of  the  method  of 
calculation.  The  third  section  contains  the  results 
for  the  linear  monobridged  complexes,  the  fourth 
and  fifth  sections  those  for  the  bi-bridged  and 
triply-bridged  complexes,  respectively.  The  last 
section  gives  a  short  summary. 


Method  of  Calculation 

The  present  calculations  have  been  performed 
in  essentially  the  same  way  as  in  our  previous 
study  [6]  on  oxygen-bridged  Ni(II)  complexes. 
They  consist  of  mainly  three  steps: 

1.  A  self-consistent  field  (SCF)  or,  more  pre¬ 
cisely,  a  restricted  open-shell  Hartree-Fock 
(ROHF)  calculation  for  the  high-multiplicity 
state:  For  all  complexes  in  this  study  this  is  a 
quintet  state  since  the  two  Ni(II)  ions,  each  of 
them  with  a  configuration  and  a  spatially 
nondegenerate  ^^2^  (S  =  1)  ground  state  in 
octahedral  coordination,  can  be  coupled  to  a 
quintet  (S  =  2),  a  triplet  (S  =  1),  and  a  sin¬ 
glet  (S  =  0)  state. 

2.  A  full  valence  configuration  interaction  (VCI) 
calculation  [7]  using  the  ROHF  orbitals  of  the 
first  step,  or  alternatively  a  complete  active 
space  SCF  (CASSCF)  calculation  [8]  for  all 
three  states  (quintet,  triplet,  singlet)  simulta¬ 
neously  or  for  each  of  the  states  separately  in 
order  to  reoptimize  the  orbitals  for  the  triplet 


and  singlet  states.  There  is  virtually  no  differ¬ 
ence  between  these  three  treatments  since  the 
three  electronic  states  are  very  close  in  en¬ 
ergy.  The  active  space  for  the  VCI  and 
CASSCF  calculations  consisted  of  the  four 
c^-type  3d  atomic  orbitals  (AOs)  at  the  two 
Ni  atoms  (for  more  details  see  Ref.  [6]. 

3.  Dynamic  correlation  effects  are  included  on 
top  of  the  VCI  or  CASSCF  calculation  by  the 
multiconfiguration  coupled  electron  pair  ap¬ 
proach  (MC-CEPA)  method  [9].  We  have  al¬ 
ways  correlated  the  ten  3d  AOs  at  the  metal 
atoms  and  the  3s, 3  p  valence  orbitals  at  the 
bridging  S  atoms.  In  general,  the  averaged 
coupled  pair  functional  (ACPF)  variant  of 
our  MC-CEPA  program  [9]  has  been  used, 
since  this  variant  yields  the  most  reliable 
values  for  the  exchange  splittings  [6]. 

It  has  been  found  before  [6,  10-12]  that  the 
inclusion  of  dynamic  correlation  effects  is  neces¬ 
sary  if  one  wants  to  calculate  reliable  values  for 
exchange  integrals.  The  main  reason  is  that  the 
antiferromagnetic  contribution,  /^p,  to  the  total 
exchange  integral  /  is  determined  by  the  extent  to 
which  "ionic"  (charge  transfer)  configurations  are 
mixed  to  the  dominant  "neutral"  (covalent)  ones 
in  the  VCI.  As  long  as  the  ionic  configurations  are 
built  up  from  orbitals  which  have  been  optimized 
for  covalent  states  and  no  relaxation  effects  are 
accounted  for,  their  energies  are  too  high  and 
consequently  their  contribution  to  /^p  is  too  small. 
In  a  separate  study  [10]  we  present  "effective"  Cl 
calculations  in  which  the  VCI  matrix  is  modified  in 
such  a  way  as  to  simulate  these  relaxation  effects 
in  an  effective  and  cheap  manner.  In  the  present 
work  we  have  simply  scaled  the  antiferromagnetic 
contribution  with  a  weighting  factor,  which  has 
determined  by  a  comparison  between  VCI  and 
ACPF  results.  More  details  are  given  in  the  section 
on  the  bridged  L4NiS2NiL4  complex. 

In  order  to  save  computer  time,  we  have  re¬ 
placed  the  bulky  terminal  ligands,  which  are  nec¬ 
essary  in  the  experimental  work  to  stabilize  the 
rather  unstable  bi-  and  trinuclear  complexes,  by 
the  He-like  model  ligands  introduced  in  the  earlier 
studies  of  this  series  [6,  11,  12].  In  the  present 
calculations  the  same  parameters  have  been  used 
as  optimized  before  for  the  NiONi  and  Ni02Ni 
complexes  [6],  i.e.,  an  effective  charge  of  1.80  for 
the  He-like  model  ligands  L  and  a  Ni-L  distance 
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o 

of  1.725  A.  In  all  complexes  the  number  of  termi¬ 
nal  ligands  was  chosen  in  such  a  way  that  each 
metal  ion  had  a  full  though  distorted  octahedral 
coordination. 

The  two  Gaussian  basis  sets  that  have  been 
used  in  the  present  calculations  are  summarized  in 
Table  1.  The  basis  set  for  the  model  ligand  is 
identical  to  the  one  optimized  before  for  NiONi 
[6].  For  Ni  and  S,  the  basis  I  has  double  zeta  (DZ) 
quality,  basis  11  has  triple  zeta  plus  polarization 
(TZP)  quality. 

Since  the  difference  in  the  antiferromagnetic 
coupling  through  O  and  S  bridges  may  be  at¬ 
tributed  to  a  partial  d-occupation  in  the  bridg¬ 
ing  ions,  we  have  systematically  varied  the  d  ex¬ 
ponent  at  the  S-atom  between  0.0  (no  d  functions) 
and  1.5  and  monitored  its  influence  on  both  the 
total  energies  and  the  value  of  /,  i.e.,  the  splitting 
between  the  lowest  singlet,  triplet,  and  quintet 
states.  These  tests  have  been  performed  for  a  linear 
NiSNi  complex  at  a  fixed  Ni-S  distance  of  2.20  A 
(i.e.,  slightly  shorter  than  the  equilibrium  distance, 
compare  next  section)  and  for  a  bent  NiS2Ni  com¬ 
plex  in  the  vicinity  of  its  SCF  equilibrium  structure 
with  R(Ni-S)  =  2.35  A  and  ZNiSNi  =  90°  (see 
fourth  section).  Basis  set  I  was  used  for  these  tests. 
The  results  of  these  calculations  can  be  summa¬ 
rized  shortly  as  follows:  The  optimum  value  for 
the  d  exponent — as  judged  from  the  gain  in  the 
total  energies — is  about  0.2  at  the  CASSCF  and  0.3 
at  the  ACPF  level.  The  gain  in  total  energy  is 
rather  small  at  the  CASSCF  level  ( ~  0.020  a.u.  for 
the  linear  NiSNi  complex  and  -  0.050  a.u.  for 
NiS2Ni),  but  large  at  the  ACPF  level  («  0.120  a.u. 
for  NiSNi;  NiS2Ni  was  not  treated  at  this  level). 
The  /  values,  on  the  other  hand,  are  hardly  af¬ 
fected  by  the  change  of  the  d  exponent:  At  the 


CASSCF  level  there  is  no  noticeable  influence  of 
the  d  functions  on  /.  This  holds  both  for  the  NiSNi 
and  the  NiS2Ni  complexes,  but  for  the  bibridged 
complex  with  an  NiSNi  angle  of  90°  the  antiferro¬ 
magnetic  part  of  the  magnetic  coupling  is  so  small 
that  statements  concerning  the  d  contribution  are 
not  conclusive.  At  the  ACPF  level,  the  contribution 
of  the  d  AOs  to  /  is  in  the  order  of  10%  for  the 
linear  NiSNi  unit. 

The  results  of  these  basis  tests  indicate  that  the 
d  orbitals  at  the  bridging  sulfur  atoms  do  not 
directly  influence  the  active  orbitals,  but  are  only 
needed  for  describing  dynamic  correlation  and  re¬ 
laxation  effects.  All  calculations  reported  in  the 
next  sections  are  performed  with  basis  II  and  a  d 
exponent  of  0.3  (compare  Table  I). 


Linear  L5]\iSNiL5  Complex 

Table  II  contains  our  results  for  the  total  energy 
of  the  ground  state  and  for  the  exchange 

integral  /  in  the  linear  monobridged  L5NiSNiL5 
complex,  as  functions  of  the  Ni-S  distance.  Only 
the  results  of  the  ACPF  variant  of  our  MC-CEPA 
program  [9]  are  included.  As  mentioned  above,  all 
metal  3d  and  the  sulfur  3s3p  orbitals  have  been 
correlated. 

From  the  data  of  Table  II  one  can  conclude  that 

o 

the  equilibrium  distance  amounts  to  2.32  A  at  the 
yCI  (or  SCF)  level  and  is  slightly  shortened  to  2.28 
A  at  the  ACPF  level.  A  direct  comparison  with 
experiment  is  not  possible  since  no  X-ray  structure 
of  a  complex  with  a  linear  NiSNi  bridge  is  known 
and  since  our  calculations  have  been  performed 
with  the  model  ligands,  which  make  the  complex 
charge  neutral.  Bulky  terminal  ligands,  effective 


TABLE  I _ 

Gaussian  basis  sets  for  S-bridged  binuclear  Ni(ll)  complexes. 


Atom 

Type 

Contraction 

Ref. 

Basis  1 

Ligand 

TZ 

7s  ^  3s(5,2*1)  (30.2,9.8,3,5, 1.4,0.55,0.17,0.05) 

6 

Ni 

DZ 

12s6p4d  -»  7,5*1;  4,2*1;  2, 2*1 

13 

S 

DZ 

9s5p  ^  4,  5*1;  2,3*1  +  1s(0.1),  1p(0.1) 

14 

Basis  II 

Ligand 

TZ 

As  basis  I 

Ni 

TZP 

14s9p5d  10,4*1;  6,3*1;  3,2*1  +  s{0.35),  f(2.0) 

15 

S 

TZP 

11s7p  ^5,6*1;4,3*1  +2s(1.0, 0.075),  2p(1.0,  0.075),  lGf(0.3) 

14 
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TABLE  II _ 

VCI  and  ACPF  results  for  the  total  energy  and  the  exchange  integral  J  in  the  linear  LgNiSNiLg  complex.^^ 


ft(Ni-S) 

A 

a.u. 

VCI 

^0  ^  1 
cm"^ 

VCI 

cm”'' 

VCI 

a.u. 

ACPF 

1 

cm”'' 

ACPF 

^1-^2 

cm”"' 

ACPF 

2.20 

-3433.083  115 

-44.1 

-45.2 

-3433.657  409 

-112.7 

-116.2 

2.25 

.087  559 

-39.1 

-39.9 

.659  846 

-101.7 

-104.5 

2.30 

.089  419 

-34.5 

-35.1 

.659  905 

-91.9 

-94.0 

2.35 

.089  163 

-30.5 

-31.1 

.658  081 

-83.3 

-84.9 

2.40 

0.87  171 

-27.2 

-27.5 

.654  691 

-76.2 

-77.2 

^L  = 
^  J 

He-like  model  ligand  [6]. 

,  =1/2[£(S  =  1)  -  e(S  =  0)]; 

=  1/4[£{S  =  2) 

-£(S  =  1)]. 

charges  on  the  complexes,  counter  ions,  etc.  might 
have  some  influence  on  the  exact  value  of  the 
equilibrium  distance,  in  particular  since  the  poten¬ 
tial  curve  is  rather  flat  and  its  minimum  is  much 
less  pronounced  than,  e.g.,  in  L5NiONiL5  [6].  On 
the  other  hand,  Ni-S  distances  of  about  2.30  A 
seem  reasonable  for  octahedrally  coordinated  Ni 
atoms  [1,  16,  17]. 

The  exchange  integral  /  is  negative  for  all  dis¬ 
tances  and  decays  nearly  exponentially  with  in¬ 
creasing  distance  jR,  both  at  the  VCI  and  the  ACPF 
levels.  As  for  the  linear  NiONi  unit  the  /  values  at 
the  ACPF  level,  i.e.,  after  inclusion  of  dynamical 
correlation  effects,  are  nearly  a  factor  of  3  larger 
than  at  the  VCI  level.  Again,  we  attribute  this 
failure  of  the  "'active  orbital"  approach  [18]  to  the 
fact  that  in  the  VCI  or  CASSCF  treatments  the 
energies  of  the  charge  transfer  states  are  calculated 
with  the  unrelaxed  orbitals  optimized  for  the  cova¬ 
lent  ground  state  [10].  Finally,  the  splitting  be¬ 
tween  the  singlet  ground  state  and  the  first  excited 
triplet  and  quintet  states  shows  a  regular  Lande 
pattern  with  a  ratio  2:4,  both  in  the  VCI  and  ACPF 
treatments. 

If  we  compare  the  absolute  value  of  the  ex¬ 
change  integral  in  the  linear  NiSNi  unit  (Table  II) 
with  our  previous  results  for  the  NiONi  bridge  [6] 
at  the  respective  equilibrium  distances,  we  find 
that  the  antiferromagnetic  coupling  through  the  S 
bridge  is  slightly,  but  not  substantially,  larger  than 
through  the  O  bridge:  For  L5NiONiL5  at  R(Ni-O) 
=  2.08  A  we  found  /  =  -24  cm"^  (VCI)  and  /  - 
—  80  cm“^  (best  ACPF  value)  [6]  while  the  corre¬ 
sponding  values  for  the  L5NiSNiL5  complex  at 
R(Ni-S)  =  2.30  A  are  —35  and  -94  cm“h  This  is 
again  an  indication  that  a  partial  occupation  of  the 
3d  AOs  in  the  bridging  sulfur  atom  is  no  impor¬ 


tant  effect  for  the  antiferromagnetic  coupling 
through  S  bridges.  It  is  more  important  that  J 
depends  quite  sensitively  on  the  Ni-S  distance. 


Bi-bridged  L4MS2lNiL4  Complex 

Figure  1  contains  contour  plots  for  (a)  the  total 
energy  of  the  lowest  quintet  state  and  (b)  the 
exchange  integral  of  the  bi-bridged  L4NiS2NiL4 
complex  as  functions  of  the  Ni-S  distance  and  the 
NiSNi  angle,  calculated  at  the  VCI  level  with  basis 
II  without  the  /  functions  at  the  Ni  atoms. 

Figure  1(a)  shows  that  the  potential  energy  sur¬ 
face  possesses  a  rather  shallow  minimum.  At  the 
VCI  level  the  equilibrium  geometry  corresponds  to 
a  Ni-S  distance  of  2.41  A  and  a  NiSNi  angle  of 
86.3°.  The  elongation  of  the  Ni-S  bond  length  as 
compared  to  the  value  for  the  NiSNi  unit  with  a 
single  linear  sulfur  bridge  (2.32  A,  previous  sec¬ 
tion)  and  the  fact  that  the  NiSNi  bond  angle  is 
markedly  below  90°  indicate  that  the  bulky  Sc¬ 
anlons  try  to  avoid  each  other.  The  direct  compari¬ 
son  with  experimental  data  is  again  difficult,  since 
most  Ni(II)  complexes  with  S-donor  ligands  have  a 
square  planar  coordination  [5,  17]  and,  again,  our 
model  ligands  are  not  directly  related  to  real  lig¬ 
ands.  However,  bond  angles  between  85°  and  94° 
have  been  reported  for  bi-bridged  NiS2Ni  units 
[17].  On  the  other  hand,  the  binuclear  Ni(II)  com¬ 
plex  with  two  SPh  bridges  and  octahedrally  coor¬ 
dinated  Ni(II)  ions,  as  synthesized  by  Baidya  et  al. 
[16],  exhibits  much  longer  Ni-S  distances  (2.41 
and  2.52  A)  and  a  larger  NiSNi  angle  of  105.3°. 

Figure  1(b)  shows  that  the  exchange  integral 
changes  sign  just  in  the  region  of  the  minimum  of 
the  potential  surface.  The  traditional  explanation 
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(a) 


(b) 


FIGURE  1 .  (a)  Contour  plot  for  the  total  energy  of  the  lowest  quintet  state  of  the  L4NiS2NIL4  complex.  ROHF,  basis  II 
without  f  functions  at  Ni.  The  lowest  contour  (dotted  line)  has  an  energy  of  -3826.225  a.u.,  the  spacing  of  the  contours 
is  0.01 0  a.u.  =  26.24  kJ  /  mol.  (b)  Contour  plot  of  the  exchange  integral  in  the  L4NiS2NiL4  complex.  The  spacing  of  the 
contours  is  5  cm full  contours  represent  negative,  dashed  ones  positive  values  of  J,  the  nodeline  is  dotted. 


[6, 18, 19]  is  that  the  magnetic  orbitals  are  orthogo¬ 
nal  at  an  NiSNi  angle  of  about  90°  or  slightly 
larger.  The  antiferromagnetic  superexchange  con¬ 
tribution  in  this  region  of  the  potential  energy 
surface  is  therefore  smaller  than  the  direct  cou¬ 


pling,  hence  the  coupling  is  ferromagnetic  in  the 
region  of  about  S5°  to  100°  [6]  and  antiferromag¬ 
netic  for  smaller  and  larger  angles.  This  explana¬ 
tion  is  corroborated  by  the  present  calculations. 
Figure  1(b)  shows  that  /  is  very  sensitively  depen- 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


637 


FINK,  WANG,  AND  STAEMMLER 


dent  on  the  geometry  of  the  bridging  unit,  there¬ 
fore  an  interpretation  of  experimental  data  for  /  is 
meaningless  unless  an  X-ray  structure  is  available. 

As  is  well  known  [18],  the  exchange  integral  / 
consists  of  two  contributions,  the  direct  ferromag¬ 
netic  exchange  coupling,  /^ ,  between  the  magnetic 
orbitals  and  the  antiferromagnetic  superexchange 
coupling,  /^p,  which  is  caused  by  an  admixture  of 
ionic  configurations  to  the  predominantly  covalent 
wave  functions  of  the  low-lying  spin  states  [11]: 

/  =  /f  +  /af  • 

The  ferromagnetic  contribution  /p  can  be  deter¬ 
mined  separately  in  a  first  step  from  a  modified 
VCI  calculation  in  which  the  active  orbitals  are 
localized,  e.g.,  by  a  Boys  localization  procedure 
[20],  and  only  the  "'neutral"  (covalent)  determi¬ 
nants,  i.e.,  those  determinants  in  which  exactly  two 
electrons  occupy  the  localized  c^-type  orbitals  at 
each  Ni  center,  are  included.  The  "ionic"  (charge 
transfer)  determinants  which  are  responsible  for 
the  superexchange  coupling  are  thus  excluded 
from  the  VCI  and  only  the  direct  exchange  cou¬ 
pling  }p  between  the  localized  magnetic  orbitals  is 
obtained.  Of  course,  the  value  of  Jp  depends  to  a 
certain  extent  on  the  localization  scheme. 

In  a  second  step,  the  full  VCI  calculation  con¬ 
taining  both  neutral  and  ionic  determinants  is  per¬ 
formed.  It  yields  the  total  exchange  splitting  /,  i.e., 
the  sum  of  Jp  and  /^p.  However,  as  long  as  ROHF 
or  CASSCF  orbitals  are  used  which  have  been 
optimized  for  the  predominantly  covalent  low-ly¬ 


ing  spin  states,  Jp  is  described  correctly  but  /^p  is 
largely  underestimated.  The  reason  is  that  the  en¬ 
ergies  of  the  ionic  states  calculated  with  nonre- 
laxed  covalent  orbitals  are  much  too  high.  Cl  cal¬ 
culations  with  single  and  double  excitations  into 
the  full  virtual  space  are  necessary  to  properly 
account  for  the  relaxation  effects  in  the  charge 
transfer  states  [10].  Our  previous  experience  with 
ACPF  and  MC-CEPA  calculations  for  various  bi- 
nuclear  systems  [6,  10-12]  has  shown  that  the 
antiferromagnetic  contribution  /^p  is  enhanced  by 
a  factor  of  2-3  after  inclusion  of  dynamic  correla¬ 
tion  and  relaxation  effects.  Stated  differently,  the 
relaxation  effects  lower  the  excitation  energies  of 
the  charge  transfer  states  by  as  much  as  a  factor  of 
2  or  even  more. 

In  order  to  avoid  the  very  time  consuming  ACPF 
calculations  at  every  point  of  the  two-dimensional 
potential  energy  surface  for  the  L4NiS2NiL4  com¬ 
plex,  we  have  simply  scaled  the  VCI  result  for  the 
antiferromagnetic  part  /^p  by  a  scaling  factor 
fixing  Jp  at  its  VCI  value.  ^  was  determined 
semiempirically  by  comparing  the  VCI  results  and 
ACPF  results  at  some  preselected  points.  Table  III 
shows  that  the  ACPF  results  can  be  reasonably 
well  simulated  by  a  scaled  VCI  exchange  interac¬ 
tion 

/sc  =  /f  +  ^/af 

with  a  scaling  factor  ^  =  2.0. 

Figure  2  shows  a  full  two-dimensional  contour 
plot  of  the  scaled  exchange  integral  J^^  iJ^sing 


TABLE  III _ 

Decomposition  of  the  exchange  integral  J  in  the  L4NiS2NiL4  compiex  in  a  direct  ( Jp)  and  a  superexchange 
(J^p)  contribution® 


Distance^ 

NISNi 

angle 

J(VC\y 

Jaf 

-  Jp  2'^‘JaF 

J(ACPF)' 

2.0 

CD 

O 

o 

-47.52 

24.80 

-72.32 

-119.83 

2.1 

90° 

-17.34 

16.19 

-33.52 

-50.86 

2.2 

90° 

-4.17 

11.08 

-15.25 

-19.42 

-19.7 

2.3 

90° 

0.99 

7.74 

-6.75 

-5.76 

2.2 

100° 

9.44 

14.43 

-4.99 

4.44 

8.9 

2.2 

80° 

-23.04 

9.66 

-32.70 

-55.75 

-50.3 

®  Basis  II,  without  f  functions,  J  in  cm 

*  Ni-S  distance  in  A. 

^  J  calculated  at  VCI  level. 

^  The  antiferromagnetic  contribution  to  J  was  scaled  by  a  factor  of  2.0  to  obtain  good  agreement  with  the  ACPF  results. 
®  ACPF  results. 
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^‘70  75  80  85  90  95  100 

NiSNi  angle 


FIGURE  2.  Same  as  Figure  1  (b),  the  antiferromagnetic  part  of  J  was  scaled  by  a  factor  of  2.0. 


consistently  ^  =  2.0.  The  comparison  between  the 
unmodified  VCI  results  in  Figure  1(b),  and  the 
scaled  VCI  results  in  Figure  2  shows  that  the 
ferromagnetic  part  of  /  remains  unchanged  while 
the  antiferromagnetic  part  of  /  is  strongly  en¬ 
hanced  after  the  inclusion  of  dynamic  correlation 
effects,  either  by  real  ACPF  calculations  or  by  the 
scaling  of  Therefore,  the  region  of  positive 
values  and  the  position  of  the  node  line  are  very 
similar  in  Figures  1(b)  and  2,  while  the  increase 
toward  larger  antiferromagnetic  values  of  /  for 
smaller  Ni-S  distances  and  bond  angles  is  much 
steeper  at  the  scaled  VCI  level. 

The  dependence  of  the  exchange  integral  /  on 
the  Ni-S  distance  and  on  the  NiSNi  angle,  found 
in  the  present  study  for  L4NiS2NiL4,  is  very  simi¬ 
lar  to  the  behavior  in  the  corresponding  O-bridged 
L4Ni02NiL4  complex  [6]. 


L3NiS3ML3 

We  did  not  calculate  full  two-dimensional  sur¬ 
faces  of  the  total  energy  and  the  exchange  integral 
for  the  triply  S-bridged  complex  L3NiS3NiL3.  The 
reason  is  that  due  to  the  use  of  the  nearly  charge 
neutral  model  ligands  our  L3NiS3NiL3  complex 


has  a  total  charge  of  -3.2  and  no  stable  equilib¬ 
rium  geometry  could  be  found.  If  the  Ni-S  distance 
is  fixed  to  the  experimental  value  of  2.36  A  ob¬ 
served  by  Beissel  et  al.  [4]  in  the  trinuclear 
NiS3NiS3Ni  unit,  the  optimum  NiSNi  angle  turns 
out  to  be  smaller  (-  71°)  than  the  experimental 
equilibrium  value  (79°  [4]).  The  electrostatic  repul¬ 
sion  between  the  three  negative  S^“  anions  is  too 
large  in  our  model  and  has  to  be  reduced  by  using 
either  singly  charged  SR“  anions  or  by  adding 
reasonable  counterions. 

The  behavior  of  /  as  a  function  of  the  Ni-S 
bond  length  and  the  NiSNi  angle  in  the  triply 
S-bridged  L3NiS3NiL3  complex  is  very  similar  to 
what  was  found  before  in  the  bibridged  complex. 
This  is  nicely  documented  in  Figure  3.  From  the 
similarity  of  the  curves  for  the  L4NiS2NiL4  com¬ 
plex  with  an  NiSNi  angle  ojf  80°  and  for 
L3NiS3NiL3  at  79°,  we  conclude  that  the  scaled 
VCI  results  for  /  as  presented  in  Figure  2  for  the 
bibridged  complex  can  as  well  be  used  to  predict 
the  magneto-structural  relations  in  triply  S-bridged 
Ni(II)  complexes. 

At  the  experimental  geometry  (R(Ni“S)  =  2.36 
A  with  an  NiSNi  angle  of  79°  [4])  we  obtained 
/  =  —9.5  cm“^  with  /p  =  7.0  cm”^  and  /^p  = 
- 16.5  cm  “L  If  we  again  scale  /^p  by  a  factor  of  2, 
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FIGURE  3.  Exchange  integral  J  as  a  function  of  the  Ni-S  distance  in  mono-,  bi-,  and  triply  S-bridged  binuclear  Ni(ll) 
complexes.  VCI  level,  basis  II. 


we  obtained  /  =  /f  +  2/^p  =  -26  cm“\  which  is 
in  agreement  with  the  experimental  value  /  = 
-27.0  cm“^,  reported  by  Beissel  et  al,  for  the 
trinuclear  NiS3NiS3Ni  complex  [4]. 


Summary  and  Conclusions 

In  the  present  study  we  calculated  the  exchange 
integral  /  in  mono-,  bi-,  and  triply  S-bridged  binu¬ 
clear  Ni(II)  complexes,  we  examined  the  geometry 
dependence  of  /  and  the  total  energy  of  the  com¬ 
plexes,  and  compared  the  results  to  oxygen- 
bridged  complexes.  Inclusion  of  the  3d  orbitals  at 
the  bridging  sulfur  ions  lowers  the  total  energy  of 
the  complexes,  but  has  nearly  no  influence  on  /. 
The  superexchange  mechanism  is  the  same  as  in 
oxygen-bridged  complexes.  Even  the  angular  be¬ 
havior  of  /  is  quite  similar.  The  complexes  are 
ferromagnetic  for  NiSNi  angles  between  85°  and 
100°  and  antiferromagnetic  for  smaller  and  larger 
angles. 

In  contrast  to  the  oxygen-bridged  Ni(II)  com¬ 
plexes  in  which  the  Ni-O  distances  are  quite  simi¬ 
lar  in  different  compounds,  the  experimentally 
known  values  for  Ni-S  distances  in  different  com¬ 
plexes  with  octahedrally  coordinated  Ni  vary  be¬ 
tween  2.3  and  2.5  A.  This  is  caused  by  very  shal¬ 
low  potential  surfaces.  Because  of  the  strong 


geometry  dependence  of  /  and  the  change  of  its 
sign  in  the  region  of  the  minimum  of  the  poten¬ 
tial  surfaces,  the  values  for  /  will  be  different  in 
different  complexes. 

In  order  to  avoid  the  time-consuming  ACPF 
calculations,  we  used  a  scaled  VCI  in  which  /^p 
was  weighted  with  a  factor  of  2.  With  this  method 
we  found  very  good  agreement  between  our  calcu¬ 
lated  value  for  /  in  a  triply  S-bridged  complex  and 
the  experimental  data  of  Beissel  et  al.  [4]. 
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ABSTRACT:  The  reaction  pathway  for  the  insertion  reaction  of  SnCl2  into  the  Pt — Cl 
bond  on  the  cfs-Pt(Cl)2(PH3)2  compound  was  investigated  at  the  ab  initio  MO  level  of 
theory.  The  optimized  structure  obtained  for  the  transition  state  indicates  that  this 
reaction  proceeds  through  a  three-center  transition  state,  and  the  formed  intermediate 
c/s-Pt(Cl)(PH3)2(SnCl3)  easily  isomerizes  to  the  trans-Ft(C\)(PH^)2(SnCl^)  compound. 
The  nature  of  the  bonds  was  investigated  with  the  charge  decomposition  analysis  (CDA) 
method  and  this  method  indicates  that  the  SnCl3  group  is  a  stronger  trans  director  than 
is  the  PH 3  group.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  643-650,  1997 


Introduction 

The  hydroformilation  of  olefins  catalyzed  by 
platinum  complexes  in  combination  with 
group  IV  A  metal  halides  is  of  great  interest.  These 
systems  have  been  shown  to  be  highly  active  and 
selective  in  hydroformilation  of  olefins  and  their 
catalytic  activity  is  comparable  to  that  industrially 
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used  and  well-known  rhodium  catalyst  [1],  As 
an  example,  catalyst  precursors  of  the  type  cis- 
Pt(Cl)2(PR3)2/  when  combined  with  SnC^,  have 
been  shown  to  be  useful  catalysts  for  the  hydro¬ 
genation  and  hydroformilation  of  primary  olefins 
[Ib-le].  Several  questions  related  to  this  catalyst 
are  still  unclear  or  unanswered.  For  example,  the 
fact  that  catalytic  activity  is  observed  only  in  the 
presence  of  SnCl2  [2]  suggest  that  the  Pt — Sn  bond 
plays  a  fundamental  role  in  the  catalytic  cycle. 
Therefore,  an  understanding  of  the  nature  of  the 
Pt — Sn  bond  is  very  important. 

The  ^^P  and  ^^^Pt  nuclear  magnetic  resonance 
(NMR)  characterization  of  the  complexes  which 
arise  in  solution  when  SnC^  reacts  with  cis-Ft 
(C1)2(PR3)2  was  studied  by  Pregosin  and  Sze  [3]. 
The  reaction  scheme  proposed  by  these  authors  (1) 
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for  this  reaction  indicates  that  the  insertion  of 
SnCl2  into  the  platinum/chlorine  bond  in  cis- 
Pt(Cl)2(PR3)2  is  followed  by  rapid  isomerization: 


PR, 

I 

p( 

I 

Cl 


R3P  Pt  Cl  +  SnClz 


PR,  PR, 

I  3  I  3 

R3P— Pt— SnCl3  Cl— Pt— SnClj  (1) 


Cl 

cis 


PR, 


trans 


SCHEME  1 


Despite  the  fact  that  ^^PH{^H}-  and  ^^^Pt-NMR 
spectroscopy  of  complexes  containing  platinum, 
tin,  and  phosphorus  provide  a  sensitive  probe  for 
complex  structures,  questions  related  to  the  elec¬ 
tronic  nature  of  the  Pt — Sn  and  Pt — P  bonds  are 
not  answered  directly  by  this  method. 

In  this  work,  we  investigated  the  reaction  path 
for  the  insertion  reaction  of  SnCl2  into  the  Pt — Cl 
bond  (Scheme  1  in  gas  phase  and  using  R  =  H)  at 
the  ab  initio  level  of  theory.  All  the  stationary 
points  present  on  the  potential  energy  surface  (PES) 
for  this  reaction  were  located  and  the  nature  of  the 
bonds  between  the  ligands  and  the  platinum  atom 
were  investigated.  The  trans  effects  of  the  SnCl3 
and  PH  3  ligands  are  discussed  and  this  may  give 
some  insight  into  how  the  elementary  reactions 
(olefin  insertion,  carbon  monoxide  insertion,  etc.), 
which  take  place  in  the  catalytic  cycle  for  hydro- 
formilation  of  olefins,  may  proceed. 


Method  of  Calculation 

Full-symmetry  unconstrained  geometry  opti¬ 
mizations  were  performed  at  the  RHF  level  of 
theory  using  the  ECP  and  valence  basis  sets  of 
Stevens,  Basch,  Krauss,  and  Jasien  (SBKJ)  [4]  for 
the  Pt  and  Sn  atoms,  6-31G*  [5]  basis  sets  for  the 
P  and  Cl  atoms,  and  a  3”21G  [5]  basis  set  for  the 
hydrogen  atoms.  The  tin  atom  can  expand  its  va¬ 
lence  shell  and  use  d  empty  orbitals  to  form  a  back 
coordination  of  the  type  d-d  with  the  platinum 
atom  [6].  Therefore,  to  analyze  the  role  of  the  d 
empty  orbitals  of  the  tin  atom  on  the  nature  of  the 
Pt — Sn  bond,  a  set  of  five  d  polarization  functions 
(a  =  0,180)  [7]  were  added  to  the  Sn  atom.  The 
stationary  points  located  on  the  PES  were  identi¬ 


fied  as  minima  (none  imaginary  frequencies)  or 
transition  states  (characterized  by  having  at  least 
one  imaginary  frequency)  through  harmonic  fre¬ 
quency  calculation.  To  obtain  better  energetic  re¬ 
sults,  we  carried  out  second-order  Moller-Plesset 
(MP2)  perturbation  theory  single-point  calcula¬ 
tions  on  the  RHF-optimized  geometries  with  the 
same  basis  set.  All  calculations  were  carried  out 
using  the  GAMESS96  [8]  and  GAUSSIAN94  [9] 
packages,  at  the  Laboratorio  de  Quimica  Computa- 
cional  e  Modelagem  Molecular  (LQC-MM),  Depar- 
tamento  de  Quimica,  Universidade  Federal  de  Mi¬ 
nas  Gerais  (UFMG). 


Results  and  Discussions 

The  optimized  structural  parameters  obtained 
for  the  reactants  [SnCl2  1  and  cis-Fi(Cl)2(FU.^)2  2], 
transition  state  (TS),  and  the  products  cis- 
Pt(C  1)(PH  3)2(SnCl3)  4  and  trans- 

Pt(Cl)(PH3)2(SnCl3)  5]  are  shown  in  Figure  1.  The 
optimized  angles  of  2,  4,  and  5  around  the  plat¬ 
inum  atom  are  a  little  distorted  from  the  expected 
optimal  value  of  90°  for  a  d^  square  planar  com¬ 
plex,  The  Pt — P  bond  distances  predicted  in  our 
calculations  are  in  good  agreement  with  the  exper¬ 
imentally  observed  Pt — P  bond  distances  of  2.284 
A  in  Pt(CH3)2(PCH3Ph2)2  [10]  and  2.303  A  in 
trans-Ft  (H)  (SnClg)  (PPh3)2  [11].  The  Pt— Sn  bond 
length  of  2.604  A  in  complex  4  and  2.576  A  in 
complex  5  are  in  good  agreement  with  the  experi¬ 
mental  Pt — Sn  values  of  2.634  A  in  trans-Ft(SnCl^) 
(COPh)  (PEt3)2  [12],  2.601  A  in  trans- 

Pt(H)(SnCl3)(PPh3)2  [11],  and  2.600  A  in  trans- 
Pt(H)(SnCl3)(PCy3)2  [13].  The  Sn— Cl  bond  dis¬ 
tances  of  2,331  and  2.350  A  in  complexes  4  and  5, 
respectively  (see  Fig.  1),  agrees  with  the  experi¬ 
mental  finding  of  2.283-2.367  A  in  the  complex 
trans-Pt(H)(SnCl3)(PPH3)2  [13].  The  structural  pa¬ 
rameters  calculated  for  the  transition  state  (see  Fig. 
1)  indicate  that  the  formation  of  compound  4  oc¬ 
curs  in  a  concerted  way,  where  the  Pt — Sn  bond  is 
forming  and  the  Pt — Cl  bond  is  breaking,  leading 
to  a  three-center  transition  state. 

It  is  well  known  that  bond  distance  is  a  very 
sensitive  parameter  in  describing  the  trans  influ¬ 
ence,  i.e.,  the  ability  of  a  ligand  to  labilyze  the 
bond  trans  to  it.  If  we  compare  the  bond  length  of 
compounds  4  and  5  (see  Fig.  1),  it  can  be  seen  that 
the  SnCl3  group  has  a  stronger  trans  influence 
than  has  the  PH  3  group,  because  the  Pt — Cl  bond 
in  compound  5  (2.411  A),  where  the  chlorine  is 
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r(Pt-C16)  =  2.364 

i 

r(Pt-P)  =  2.281(trans  to  SnCla) 
r(Pt-P)  -  2.340  (cis  to  SnCla) 
r(Pt-C13>  =  2.871 
r(Pt-Sn)  =  3.000 
r(Sn-C14)  «  2.421 
0(Pt-Sn-C13)  =  61.099 
e(Sn-C13-Pt)  =  66.186 
e(C13-Pt-Sn)  =  52.715 

FIGURE  1.  RFH-optimized  structural  parameters  for  the  SnClg  (1),  trans-Pt(CI)2(PH3)2  (2),  transition  state  (TS), 
c/s-Pt(CI)(PH3)2(SnCl3)  (4)  and  frans-Pt(CI)(PH3)2(SnCI)3)  (5)  molecules.  Bond  lengths  in  angstroms  and  bond  angles 
in  degrees. 


trans  to  SnCl3  group,  is  greater  than  is  the  Pt — Cl 
bond  in  compound  4  (2.373  A)  where  the  chlorine 
is  trans  to  the  PH 3  group.  It  can  also  be  seen  from 
Figure  1  that  the  Pt — P  bond  length  trans  to  the 
SnCl3  group  in  compound  4  (2.412  A)  is  greater 
than  is^  the  Pt — P  bond  length  in  compound  2 
(2.228  A).  This  fact  corroborates  our  assertion  that 
SnClg  is  a  stronger  trans  director  than  is  the  PH  3 
group. 

The  trans  influence  can  qualitatively  explain 
why  the  compound  cis-Ft(Cl)(FH^)2(SnClo),  4,  iso- 
merizes  to  the  trans-Ft(Cl)(FH2,)2(SnCl^),  5,  com¬ 
pound.  If  the  PH  3  ligand  stays  in  a  trans  position 


in  relation  to  the  SnCl3,  these  two  groups  will 
compete  unevenly  for  the  electron  density  from 
the  Pt  atom,  so  the  Phg  group  chooses  to  stay  in  a 
more  favorable  position  trans  to  another  PH3 
group  in  compound  5.  These  qualitative  argu¬ 
ments  will  be  discussed  in  a  quantitative  way 
later. 

The  total  energies  obtained  for  compounds  1-5 
are  shown  in  Table  I,  and  the  energy  diagram  for 
the  reaction  is  shown  in  Figure  2.  As  can  be  seen 
from  Figure  2,  the  energetic  trend  is  maintained 
for  the  reactants  (1  +  2)  and  products  (4  and  5) 
when  an  MP2  single-point  calculation  is  per- 
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(4) 


r(Pt-Cl) 

SI 

2.373 

r(Pt-Sn) 

- 

2.604 

r(Pt-P4) 

= 

2.412 

r(Pt-P5) 

2.307 

r(Sn-Cl) 

- 

2.331 

a  -  85.467 

P  =  83.645 

y  =  99.211 

8  =  91.678 

rfPt-Cl) 

= 

2.411 

r (Pt-P) 

=  ; 

2.350 

r(Pt~Sn) 

= 

2.576 

r(Sn-Cl) 

= 

2.350 

a  -  91.880 

P  =  86.074 

y  -  85.815  6  *  96.231 


FIGURE  1.  (continued) 


formed  on  the  RHF-optimized  geometries.  The  for¬ 
mation  of  intermediate  c/s-Pt(Cl)(PH3)2(SnCl3),  4, 
is  favored  by  an  amount  of  —37.5  kcal/mol  and 
the  occurrence  of  the  isomerization  cis- 
Pt(Cl)(PH3)2(SnCl3)  trans-Pt(Cl)(PH3)2(SnCl3) 
(4  5)  is  favored  by  -  7.0  kcal/mol.  The  MP2 

single-point  calculation  on  the  RHF-optimized  ge¬ 


ometry  for  the  transition  state  (TS)  structure  gave 
an  unexpected  result  -4.4  kcal/mol  for  the  differ¬ 
ence  between  the  energies  of  the  reactants  and  the 
transition  state  (see  Table  1  and  Fig.  2).  What  this 
result  shows  us  is  that  the  transition  state  is  more 
stable  than  are  the  reactants,  i.e.,  the  SnC^  inserts 
into  the  Pt — Cl  bond  without  a  barrier.  This  result 


TABLE  I _ 

Total  energy  values  obtained  for  the  stationary  points  on  the  potential  energy  surface  (Fig.  1). 


Energy  (au) 


Compound 

RHF^ 

MP2‘’ 

SnCIa  (1) 

-922.3586 

-922.6748 

R(CI)2(PH3)2  (2) 

-1722,8362 

-1723.4704 

TS 

-2645.1839 

-2646.1522 

c/s-Pt(CI)(PH3)2{SnCl3)  (4) 

-;2645.2359 

-2646.2050 

frans-Pt(CI)(PH3)2(SnCl3)  (5) 

-2645.2497 

-2646.2161 

®The  basis  set  used  Is  described  in  the  Method  of  Calculation  section. 
^  Single-point  calculation  at  the  RHF-optimized  geometry. 
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can  be  related  to  the  fact  that  the  formation  of  the 
TS  occurs  in  a  concerted  way,  where  the  Ft — Sn 
bond  formation  and  Ft — Cl  bond  breaking  occur 
at  the  same  time.  Therefore,  the  energy  involved  in 
the  Ft — Sn  bond  formation  is  sufficiently  high  to 
stabilize  the  TS  leading  to  an  insertion  without 
barrier. 


Nature  of  the  Bond 

The  nature  of  the  bond  in  transition-metal  com-* 
plexes  has  gained  much  attention  from  the  theoret¬ 
ical  point  of  view  since  the  appearance  of  the 
Dewar "Chatt”Duncanson  model  [14].  This  model 
considers  two  factors  responsible  for  stabilizing 
the  bond  in  transition-metal  complexes:  the  cr- 
donating  interactions  from  the  ligands  to  the  metal 
and  the  tt  back  donation  from  the  metal  to  the 
ligands. 

Several  methods  have  appeared  in  the  literature 
focusing  on  the  evaluation  of  the  energetic  contri¬ 
bution  to  the  bond  energy  as  well  as  investigating 
the  mixing  of  orbitals  in  the  region  of  interaction 
between  the  metal  and  ligand.  The  construction  of 
molecular  orbitals  in  terms  of  fragment  molecular 
orbitals  was  earlier  employed  by  Fukui  and  co¬ 
workers  [15]  in  a  method  called  coupled  fragment 
molecular  orbital  interaction  for  interacting  sys¬ 
tems.  Bagus  et  al.  [16]  developed  the  constrained 
space  orbital  variation  (CSOV),  where  the  wave 
function  of  the  complex  is  constructed  from  the 
molecular  orbitals  of  the  ligand  and  the  metal 
atom.  Ziegler  and  co-workers  developed  the  ex¬ 
tended  transition  state  (ETS)  method  [17],  based 
on  the  density  functional  formalism,  which  is  simi¬ 


lar  to  the  CSOV  method.  More  recently,  Frenking 
and  co-workers  developed  the  charge  decomposi¬ 
tion  analysis  (CDA)  [18]  using  fragment  molecular 
orbitals.  The  CDA  method  consists  of  using  linear 
combinations  of  fragment  orbitals  (LCFO)  of  prop¬ 
erly  chosen  fragments  A  and  B  for  the  interpreta¬ 
tion  of  the  interactions  in  a  molecule  AB.  The 
interaction  is  divided  into  three  main  contribu¬ 
tions:  (a)  the  mixing  between  the  occupied  orbitals 
of  A  and  empty  orbitals  of  B,  which  indicates  the 
magnitude  of  electron  donation  from  A  to  B  (A 
B),  (b)  the  mixing  between  the  occupied  orbitals  of 
B  and  the  empty  orbitals  of  A,  which  gives  the 
extent  of  back  donation,  i.e.,  the  electron  donation 
from  B  to  A  (A  <-  B),  and  (c)  the  mixing  between 
the  occupied  orbitals  of  A  and  the  occupied  or¬ 
bitals  of  B  which  indicates  the  extent  of  charge 
polarization  in  the  region  of  bonding.  Based  on 
these  points,  the  CDA  method  can  be  used  as  a 
quantitative  view  of  the  Dewar-Chatt-Duncanson 
model. 

To  analyze  the  nature  of  the  bond  in  complexes 
czs-Ft(Cl)(FH  3)2(SnCl3),  4,  and  trans- 

Ft(Cl)(FH3)2(SnCl3),  5,  we  used  the  CDA  method, 
as  implemented  in  the  CDA  2.1  program  [19].  The 
total  amount  of  donation,  back  donation,  and  bond 
energy  obtained  by  the  CDA  is  shown  in  Table  II. 
As  can  be  seen,  in  all  situations  analyzed,  the 
magnitude  of  donation  from  the  FH3  fragment  to 
the  Ft(Cl)(FH3)(SnCl3)  fragment  is  greater  than  is 
the  magnitude  of  back  donation,  which  indicates 
that  the  FH3  is  not  a  good  7r-acceptor.  The  other 
fact  that  contributes  to  the  low  extent  of  back 
donation  in  relation  to  the  electron  donation  is  that 
back  donation  is  favored  when  there  is  an 
electron-rich  metal,  which  is  not  the  present  case. 


TABLE  II _ 

Total  amount  of  donation,  back  donation,  and  bond  energy  for  different  chosen  fragments  A  and  B, 
obtained  by  the  CDA  method. 


Fragment 


Bond  energy 

S  donation  1  backdonatlon  (kcal  /  mol) 


A  B 

DU  3  D^ 


PH3" 

R(CI)(PH3)(SnCl3) 

0.597  e 

0.172  e 

52.0 

PH3'’ 

R(CI)(PH3)(SnCl3) 

0.563  e 

0.071  e 

26.0 

PHa" 

R(CI)(PH3)(SnCl3) 

0.614  e 

0.125  e 

46.2 

cis  to  SnClg  group  in  c/s-Pt(CI)(PH3)2(SnCl3),  4,  compound  (see  Fig.  1). 

^PHg  trans  to  SnClg  group  In  c/s-Pt{CI){PH3) 2(80013),  4,  compound. 

^PHg  trans  to  another  PH3  group  in  compound  frans-Pt(C()(PH3)2(SnCl3),  5,  compound  (see  Fig.  1). 
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The  platinum  atom  is  in  a  formal  oxidation  state  of 
+  2,  assuming  a  configuration.  Therefore,  de¬ 
spite  the  fact  that  PH  3  is  a  poor  77-acceptor  ligand, 
the  extent  of  back  donation  should  be  increased  if 
we  have  a  more  electron-richer  metal  atom. 

When  we  compare  the  extent  of  back  donation 
of  the  two  PH  3  groups  in  the  cis- 
Pt(Cl)(PH3)2(SnCl3)  complex,  4  (see  Fig.  1  and 
Table  II),  the  extent  of  back  donation  from  the  PH  3 
group  trans  to  the  SnCl3  is  diminished,  indicating 
that  the  SnClg  group  is  withdrawing  electron  den¬ 
sity  from  the  platinum  atom,  and  this  reduces  the 
electron  density  available  for  donating  to  the  PH  3 
ligand.  Based  on  this  argument,  the  extent  of  back 
donation  of  the  PH  3  trans  to  another  PH  3  group  in 
compound  trans-Fi(Cl)(FH2)2(SnCl^),  5  (see  Fig. 
1),  should  be  greater  than  the  one  found  to  the 
PH3  group  trans  to  SnCl3  in  complex  4  and  less 
than  the  extent  of  back  donation  of  the  PH  3  group 
cis  to  the  SnClg  group  in  compound  4  (see  Fig.  1 
and  Table  II).  The  results  of  CD  A  agree  with  these 
arguments  as  can  be  seen  in  Table  IL 

The  Pt — P  bond  energy  calculated  for  the  PH  3 
group  cis  to  the  SnCl3  group  (52.0  kcal/mol)  is  ca. 
26  kcal/mol  greater  than  is  the  Pt — P  bond  energy 
calculated  for  the  PH  3  group  trans  to  the  SnCl3 
group  (26.0  kcal/mol),  as  can  be  seen  in  Table  II. 
This  result  provides  a  semiquantitative  view  of  the 
trans  effect  of  the  SnCl3  group,  which,  of  course, 
could  change  if  a  correlated  calculation  is  used  to 
analyze  the  bond  energy. 


The  occurrence  of  cis  ->  trans  isomerization  in 
platinum  (II)  and  Palladium  (II)  phosphine  com¬ 
plexes  is  well  known  [20].  Three  mechanisms  can 
explain  this  reaction:  the  consecutive  displacement 
mechanism  [21],  the  Berry  pseudorotation  mecha¬ 
nism  [20,22],  and  the  dissociative  pathway  [23].  In 
addition,  changes  of  metal,  ligand,  solvent,  or  tem¬ 
perature  can  substantially  affect  the  equilibrium. 
Therefore,  the  czs-Pt(Cl)(PH3)2(SnCl3)2  ^  trans- 
Pt(Cl)(PH3)2(SnCl3)2  (4  5)  isomerization  reac¬ 

tion  needs  a  more  detailed  study  focusing  on  the 
solvent  and  ligand  influence.  A  theoretical  study 
on  this  isomerization  reaction  is  already  under¬ 
way.  Taking  into  account  the  strong  trans  influ¬ 
ence  of  the  SnCl3  (Table  II),  as  discussed  before, 
can  lead  to  a  possible  mechanism  for  the  isomer¬ 
ization  occurring  in  the  gas  phase.  This  may  be  a 
dissociative  pathway  through  T-shaped  three-coor¬ 
dinate  intermediates,  as  is  exemplified  in  Scheme  2 
below.  Some  kinetic  evidence  for  the  intermediacy 
of  a  three-coordinate  T-shaped  species  was  ob¬ 
tained  for  cis  and  frans-PdMe2(PR3)2  complexes 
[23]. 


Concluding  Remarks 

We  studied  the  insertion  reaction  of  SnCl2  into 
the  Pt — Cl  bond  and  the  resulting  complex  cis- 
Pt(Cl)2(PH3)2  using  ab  initio  MO  methods.  All  the 
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stationary  points  located  on  the  PES  were  fully 
optimized.  The  transition-state  structure  obtained 
indicates  that  the  reaction  proceeds  through  a 
three-center  transition  state.  The  optimized  struc¬ 
tures  are  in  good  agreement  with  experimental 
findings  for  similar  systems.  The  analysis  of  the 
nature  of  the  bonds  showed  that  the  SnCl3  ligand 
is  a  stronger  trans  director  than  is  the  PH  3  group, 
and  it  is  capable  of  weakening  the  Pt — P  bond 
trans  to  it  by  ca.  26  kcal/mol.  The  amount  of  back 
donation  of  the  PH  3  group  is  less  than  the  dona¬ 
tion,  indicating  that  the  PH  3  is  not  a  good  7r-accep- 
tor  ligand.  The  results  of  the  charge  decomposition 
analysis  gave  us  support  to  infer  with  the  cis- 
Pt(Cl)(PH3)2(SnCl3)  ^  trans-Ft(Cl)(FH^)2(SnCl^) 
isomerization  reaction  proceeds  through  a  disso¬ 
ciative  pathway  with  formation  of  T-shaped 
three-coordinate  intermediates.  We  think  that  the 
information  provided  about  this  cocatalyst  (strong 
trans  influence  of  SnCl3  and  the  formation  of  very 
stable  compounds  formed  when  tin  interacts  with 
platinum)  is  very  important  to  study  the  elemen¬ 
tary  reactions  for  the  hydroformilation  of  olefins 
using  this  heterobimetallic  complex.  Our  group  is 
currently  investigating  this  catalytic  cycle. 
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ABSTRACT:  Both  theoretical  and  experimental  studies  in  the  past  have  indicated  that 
the  charge  transport  in  doped  polyacetylene  is  due  primarily  to  traveling  charged  solitary 
waves  along  the  polymer  chain  backbone  accompanied  by  hopping  from  one  chain  to 
another.  The  conductivity  in  this  model  is  still  determined  by  a  band  gap.  The  nature  of 
the  ground  and  excited  states  of  the  doped  system,  however,  is  not  fully  understood. 
Previous  ab  initio  calculations,  based  on  transoctatetraene  simulating  transpolyacetylene 
interacting  with  a  single  iodine  atom  and  also  a  fairly  low  level  of  basis  set,  have  pointed 
to  the  possibility  that,  while  the  calculations  at  the  Hartree-Fock  level  puts  the  charge- 
transfer  state  below  the  ^'covalent"  state,  the  results  are  reversed  when  correlation  is 
included.  Apart  from  the  basis-set  inadequacy  and  the  small  size  of  the  polyene 
considered  in  this  study,  it  is  also  true  that,  in  solution,  iodine  exists  only  as  polyiodide 
ions  I3,  I5 ,  etc.  In  this  work,  we  address  some  of  the  above  deficiencies  of  the  model. 
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Introduction 

The  discovery  [1,  2]  that  poly  acetylene  can  be 
doped  to  near  metallic  conductivities,  using 
donors,  such  as  alkali  metals,  and  acceptors,  such 
as  I2  or  ASF5,  as  dopants  has  intensified  interest  in 
understanding  the  mechanism  of  charge  transport 
in  these  materials.  While  qualitatively  the  conduc¬ 
tivity  is  now  firmly  believed  to  result  from  charged 
solitary  waves,  as  first  proposed  by  Su,  Schrieffer, 
and  Heeger  (SSH)  [3],  the  description  of  the  ground 
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and  excited  states  participating  in  the  charge- 
transport  process  remains  a  theoretical  challenge. 

In  a  previous  work,  Das  et  al.  [4]  showed  that  if 
one  simulates  transpolyacetylene  (f-PA)  by 
transoctatetraene,  the  charge  environment  of  a  sin¬ 
gle  iodine  atom  interacting  with  the  polyene  goes 
through  a  complete  reversal  as  one  goes  from 
Hartree-Fock  to  a  correlated  calculation.  While  the 
Hartree-Fock  ground  state  is  a  charge-transfer 
state  with  iodine  acquiring  close  to  a  full  electron 
at  the  expense  of  the  polyene,  the  picture  changes 
radically  when  correlation  is  included;  the  ground 
state  now  is  essentially  a  neutral  iodine  interacting 
with  the  polyene,  while  the  excited  state  is  a 

CCC  0020-7608  /  97  /  050651  -04 


DAS  AND  DUDIS 


charge-transfer  state  CgHj^o*  Based  on  this 
finding,  it  was  proposed  that  the  charge  is  trans¬ 
ported  by  the  electrons  populating  these  excited 
states  via  excitons  representing  a  "'charged  soliton" 
traveling  along  the  length  of  the  polymeric  chains. 
A  quantitative  study  of  the  time  evolution  of  such 
processes  was  carried  out,  leading  to  a  high  diffu¬ 
sion  mobility  and,  consequently,  large  conduc¬ 
tivity. 

The  above  study,  however,  is  flawed  by  several 
deficiencies:  First,  the  polyene  considered  is  too 
small  with  potentially  large  end  effects.  Second, 
iodine  in  solution  is  found  to  exist  only  in  clusters 
such  as  1 3  and  1 5.  Also,  the  basis  set  used  in  the 
above  calculation  may  not  be  adequate  to  repre¬ 
sent  the  charge-transfer  states  via-a-vis  the  neutral 
states  even  though  the  correlation  level  considered 
may  have  been  right.  Moreover,  we  neglected  the 
effect  of  the  neighboring  polymer  units,  which  is 
expected  to  be  important  since  the  separation  be¬ 
tween  neighboring  units  (6-7A)  is  not  large  enough 
for  the  dopant  system  to  interact  solely  with  one 
unit.  In  what  follows,  we  analyze  all  the  above 
deficiencies  except  that  of  the  basis  set,  which, 
because  of  the  great  complexity  of  the  problem,  we 
defer  to  a  later  work,  retaining  for  the  present  the 
simplest  basis  used  in  our  previous  work,  namely, 
the  Hay-Wadt  ECP  basis  for  iodine  and  STO-3G 
for  the  rest. 


Choice  of  Our  Prototype  System 

Using  I3  instead  of  I  increases  the  complexity  of 
the  analysis  in  several  ways:  First,  we  have  a  much 
larger  number  of  valence  electrons  to  deal  with. 
Also,  the  determination  of  the  ground  and  excited 
states  has  to  take  into  account  the  fact  that  the 
geometry  of  1 3  is  very  different  from  1 3  in  addi¬ 
tion  to  the  large  geometry  change  the  polyene 
system  undergoes.  Since  1 3  is  really  12  +  I  and 
Hartree~Fock  gives  very  poor  results  for  the  bond¬ 
ing  in  1 2  (usually  predicting  no  binding  at  all),  it  is 
necessary  that  we  employ  all  excitations  involving 
all  the  valence  electrons.  At  the  Hay-Wadt  level  of 
the  ECP  basis,  the  various  quantities  of  interest 
pertaining  to  the  I2  +  I  problem  is  shown  in 
Table  I. 

We  shall  consider  C^2^u  ir^stead  of  CgFIi^  in 
this  work.  This  will  ensure  much  smaller  end 
effects  when  the  dopant  is  a  single  iodine  atom 
and  is  likely  to  be  adequate  for  I3. 


TABLE  I _ 

ECP  energies  of  atomic,  diatomic,  and  triatomic 
iodine  and  their  ions  at  the  level  of  Hay  -  Wadt 
ECP  basis  set. 


System 

Energy  (SCF) 
(Hartree) 

Energy  (MCSCF) 
(Hartree) 

-11.15723 

1  - 

- 1 1 .23469 

L 

-22.31185 

-22.34149 

-33.43640 

-33.48324 

-33.59687 

-33.61844 

I2  +  1 

-33.46908 

-33.49872 

Finally,  adding  to  the  above  complexity,  we 
have  also  to  include  in  our  discussion  the  interac¬ 
tion  of  the  iodine  molecule  with  both  the  adjacent 
polyene  systems  between  which  it  finds  itself. 
Thus,  it  will  be  necessary  to  calculate  or  estimate 
the  interaction  of  the  iodide  ion  if  and  when  it 
forms  not  only  with  the  polyene  that  it  shares  but 
also  the  neutral  one  on  its  other  side. 


General  Computational  Framework 

The  present  calculations  were  performed  using 
the  GAMESS  software  with  some  important  modi¬ 
fications.  In  our  earlier  work  on  the  interaction  of  a 
solitary  iodine  atom  with  transoctatetraene,  the 
active  space  selected  for  the  CASSCF  calculations 
consisted  of  the  eight  tt  valence  and  virtual  or¬ 
bitals  on  the  polyene  and  three  on  the  iodine  atom. 
The  geometry  optimization  using  these  CASSCF 
wave  functions  was  a  major  effort  but  it  was  still 
well  within  the  usual  time  and  "economic"  restric¬ 
tions.  In  the  present  case,  however,  the  active 
space  needs  to  be  enlarged  to  21  orbitals  (nine 
pertaining  to  the  iodine  atoms  and  12  to  the 
polyene  7r-space).  This  is  too  large  an  effort  to  be 
treated  in  a  straightforward  CASSCF  framework, 
the  number  of  configuration  running  to  millions. 
We  adopted  a  different  scheme  to  be  called  the 
modified  MP2  scheme  (MMP2)  to  extract  the  cor¬ 
relation  information. 

For  each  state,  we  carry  out  the  following  steps: 
(a)  We  first  construct  the  wave  function  for  the 
state  in  question  as  a  limited  CASSCF  involving 
only  the  Hartree-Fock  occupied  valence  orbitals  as 
the  active  space.  In  our  case,  this  simply  implies 
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inclusion  of  a  single  excitations  into  the  only  open 
shell  available.  The  main  purpose  of  using  the 
CASSCF  description  is  to  retain  the  nature  of  the 
input  orbitals  through  the  SCF  process  for  easy 
interpretability  both  for  the  ''neutral"  and 
"charge-transfer"  states,  (b)  The  bonding  and  anti¬ 
bonding  TT-orbitals  are  then  localized.  This  is  an 
important  step  and  the  details  are  found  in  [4]. 
This  reduces  the  magnitude  of  the  cross  terms  that 
are  neglected  in  the  present  MP2  process,  (c)  A 
first-order  Cl  is  then  performed  using  all  single 
excitations  into  the  given  virtual  space  from  the 
configurations  generated  in  the  first  step.  This  step, 
by  virtue  of  Brillouin's  theorem,  will  not  yield  any 
significant  contribution  for  the  state  that  has  been 
optimized  via  a  CASSCF  process  described  above. 
Ffowever,  this  will  affect  the  state  not  optimized, 
(d)  Now,  pairwise  correlation  contributions  are 
evaluated  for  each  double  excitation  for  both  the 
ground  and  excited  states.  In  all  the  calculations 
being  reported  here,  we  selected  for  our  virtual 
space  only  the  set  of  7r-antibonding  orbitals  be¬ 
longing  to  the  polyene. 


Results  and  Discussion 

In  Table  II,  we  compare  the  results  obtained 
from  rigorous  CASSCF  calculations  with  those 
based  on  our  approximate  MMP2  model.  Since  the 
quantities  of  interest  are  the  excitation  energies  of 
the  states  and  not  their  absolute  energies,  it  is 
evident  that  the  approximate  scheme  is  adequate 
for  our  purpose. 


In  Table  III,  the  interaction  potential  of  I3  ap¬ 
proaching  the  plane  of  C^2^u  vertically  is  shown 
in  the  ground  and  excited  states.  No  geometry 
reoptimization  is  carried  out  other  than  the  fact 
that  the  geometry  used  for  the  charge-transfer  state, 
which  is  the  ground  state  at  the  uncorrelated  level, 
is  that  of  the  cation  C  12^^14-  Similarly,  the  opti¬ 
mized  geometry  of  neutral  Ci2^u  used  for  the 
covalent  state,  which  is  the  midgap  excited  state  at 
the  Fiartree-Fock  level.  The  geometry  of  13  was 
frozen  at  the  minimum  geometry  of  the  ion  I3 . 

In  Table  IV,  we  show  the  RHF  interaction  ener¬ 
gies  of  1 3  in  its  linear  optimized  geometry  as  it 
approaches  the  plane  of  a  neutral  polyene  molecule 
C12H14.  In  Table  V,  we  show  a  balance  sheet  of 
energies  based  on  our  MMP2  calculations  for  I3. 
Again,  the  geometries  used  for  the  neutral  and 
charge-transfer  states  are  those  of  C^2^u 
Ci2H^4  ,  respectively.  The  charge-transfer-state  en¬ 
ergy  is  corrected  upward  because  of  a  net  repul¬ 
sion  from  the  neighboring  polyene  unit  assumed 


TABLE  III _ 

MMP2  energies  vs.  distance  (R)  of  I3  from  the  plane 
of  the  polyene  0^2^14’  length  of  I3  is  taken 
parallel  to  the  polyene  chain  axis. 


R 

(angstroms) 

Energy 

Ground  state 
(Hartree) 

Excited  state 
(Hartree) 

2,94 

-490.43777 

-490.38256 

3.44 

-490.45474 

-490.39861 

3.94 

-490.45388 

-490.40027 

4.44 

-490.44918 

-490.39996 

TABLE  II 


Comparison  of  rigorous  CASSCf  with  MMP2:  columns  2  and  4  represent  the  CASSCF  results  for  the  ground- 
state  and  first-excited-state  energies,  while  columns  3  and  5  are  the  corresponding  MMP2  results. 

R 

(angstroms) 

Energy 

Ground  state 

Excited  state 

CASSCF 

(Hartree) 

UMP2 

(Hartree) 

CASSCF 

(Hartree) 

MMP2 

(Hartree) 

2.5 

-316.23030 

-316.18606 

-316.17745 

-316.12908 

3.0 

-316.23979 

-316.20431 

-316.19560 

-316.15834 

3.5 

-316.23999 

-316.20677 

-316.19101 

-316.16276 

4.0 

-316.23893 

-316.20625 

-316.18479 

-316.15795 

4.5 

-316.23831 

-316.20577 

-316.17734 

-316.15097 
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TABLE  IV _ _ 

Energies  as  Ig"  approaches  the  neutral  C12H14;  R  Is 
the  perpendicular  distance  of  of  l3~  from  the  plane 
of  the  polyene  (the  length  of  I3  is  taken  parallel  to  the 
polyene  chain  axis). 


R 

RHF  energy 

(angstrom) 

(Hartree) 

2.94 

-490.36435 

3.44 

-490.37127 

3.94 

-490.37145 

4.44 

-490.37144 

-490.40382 

TABLE  V 

Balance  sheet  of  energies  of  the  charge-transfer 

and  covalent  states  for  the  I3  + 

system 

calculated  at  the  optimized  geometries  of  0^2^14 

and  C^2H.,4,  respectively. 

Energies 

Charge-transfer 

state 

Neutral  state 

(Hartree) 

(Hartree) 

MMP2  energy 
Correlation  error 

-490.45474 

-490.40027 

in  I3  and  I3  and 
Repulsive 

-0.02157 

-0.02964 

correction  from 
neighbor 

0.03255 

-0.00227 

Geometry 

correction 

-0.03268 

Total  energy 

-490.44376 

-490.46486 

o 

separated  from  the  first  by  6.88  A.  The  neutral-state 
energy  is  corrected  downward  since  the  lowest 
energy  geometry  of  1 3  is  readily  iodine  and  1 2  well 
separated. 

It  is  clear  that  within  the  approximation  of  the 
basis  set  used  in  this  work  as  well  as  subject  to  the 
omission  of  "'atomic"  correlation  corrections  in  the 
calculations  our  present  model  predicts  that  with 
I3  as  the  dopant  the  energy  of  the  charge-transfer 
state  is  slightly  higher  than  that  of  the  neutral 
state.  When  the  atomic  correlation  terms  are  in¬ 
cluded,  which  essentially  to  improve  the  electron 
affinity  of  iodine  by  nearly  1  eV,  these  two  states 
promise  to  be  near-degenerate. 
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ABSTRACT:  We  propose  a  two-parameter  model  Hamiltonian  to  analyze  the  low- 
energy  dynamics  of  endohedral  C^o  fulierene  complexes  such  as  Li ^@€50/  Na'^@C6o/ 
CO@C(^Q,  LiF@C60/  and  LiH@C5o-  The  simplicity  of  the  model  is  a  direct  consequence  of 
very  strong  constraints  imposed  by  both  high  symmetry  and  close  confinement  conditions, 
on  the  spherical  anisotropy  of  the  guest-cage  interaction.  In  the  parameter  space  region 
expanded  by  the  selected  systems,  the  guest  low-energy  dynamics  undergo  transitions 
from  almost  free  rotations  to  more  or  less  hindered  rotations.  These  dynamics  produce 
very  particular  features  in  the  low-temperature  rotational  spectra,  which  can  be  taken, 
therefore,  as  fingerprints  of  a  particular  guest-cage  interaction.  ©  1997  John  Wiley  &  Sons, 
Inc.  Int  J  Quant  Chem  65:  655-663,  1997 
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Introduction 

The  recent  discovery  of  the  fullerenes  has  open 
a  new  field  in  science  and  technology,  joining 
physicists  and  chemists  in  an  effort  to  understand 
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the  many  interesting  possibilities  that  these  new 
molecules  offer,  both  in  the  gas  and  the  fullerite 
solid  phases.  Among  these  possibilities  we  find 
those  related  with  the  use  of  these  carbon  aggre¬ 
gates  as  microscopic  cavities  to  encapsulate  atoms 
and  even  small  molecules.  Some  experiments  have 
succeeded  in  producing  traces  of  metallofullerenes 
(Li,  Na,  K,  Rb,  Cs,  Ca,  Ba,  La, . . .  )  [1,  2],  rare-gas 
fullerenes  [3, 4],  and  diatomic  fullerenes  (CO@C5g) 
[5]. 

The  solid  phases  of  these  endohedral  complexes 
(endohedrally  doped  fullerites)  become,  in  general, 
significantly  more  stable  than  the  fullerites  with 
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exohedral  doping.  On  the  other  hand,  the  very 
interesting  electric  and  magnetic  properties  that 
have  been  found  in  exohedral  fullerites  are  also 
expected  to  occur  in  their  endohedral  counterparts. 
Moreover,  very  interesting  applications  have  been 
suggested  for  these  particular  materials.  As  an 
example,  we  mention  Cioslowski  and  Nanay ak- 
kara's  proposal  [6]  for  endohedral  fullerites  of  po¬ 
lar  molecules  (CO,  LiF,  LiCl,  NaF,  NaCl),  which 
may  rotate  more  or  less  freely  depending  on  the 
dipole-dipole  interaction  and  on  the  temperature. 
These  endohedral  fullerites  would  ''constitute  a 
new  class  of  ferroelectric  materials  and  the  first 
practical  realization  of  ideal  electric  dipolar  lat¬ 
tices,"  which  could  show  low-temperature  phase 
transitions  between  dipole  ordered  and  disordered 
states.  The  transition  temperatures  estimated  by 
these  authors  are  in  the  range  25-60  K. 

All  these  potential  applications  have  stimulated 
many  theoretical  studies  to  elucidate  the  electronic 
structure  of  these  compounds  and  the  nature  of 
the  guest-cage  interaction.  The  traditional  theoret¬ 
ical  methods  of  molecular  physics  and  quantum 
chemistry  have  been  proved  to  be  very  useful  in 
establishing  new  results.  Besides,  great  experimen¬ 
tal  efforts  are  currently  devoted  to  obtain  endohe¬ 
dral  species  in  macroscopic  quantities,  and  a  very 
promising  technique  has  been  reported  recently 
[7].  In  the  meanwhile,  the  results  from  these  theo¬ 
retical  and  experimental  work  have  motivated  fur¬ 
ther  theoretical  studies  on  dynamical  aspects  such 
as  the  rotational-vibrational  motion  of  the  confined 
species.  These  dynamics  are  going  to  affect  some 
of  the  fullerite  properties.  For  instance,  a  more 
accurate  treatment  of  the  phase  transitions  pre¬ 
dicted  by  Cioslowski  and  Nanayakkara  would  re¬ 
quire  knowledge  of  this  motion.  Vibrational-rota¬ 
tional  spectra  are  a  primary  source  of  information 
on  these  dynamics,  but  experimental  data  of  this 
kind  are  not  yet  available. 

Ab  initio  calculations  for  different  trapped 
species  have  produced  some  sparse  data  on  the 
guest-cage  interaction;  they  provide  mainly  infor¬ 
mation  on  the  equilibrium  positions  and  vibra¬ 
tional  frequencies  for  the  confined  atom  or 
molecule.  When  the  interaction  is  of  physical  na¬ 
ture,  as  may  be  the  case  when  the  guest  species  is 
a  closed-shell  atom  or  molecule,  this  may  be  fairly 
estimated  using  semiempirical  electrostatic  argu¬ 
ments.  In  this  way  analytical  expressions  have 
been  proposed  [8-11],  which  reproduce  rather  ac¬ 
curately  [8,  9]  the  experimental  and  ab  initio  [12, 
13]  data.  All  this  information  on  the  nature  of  the 


guest-cage  interaction  has  made  possible  several 
attempts  to  simulate  theoretically  the  vibrational- 
rotational  spectra  of  some  endohedral  fullerenes. 

The  first  analyses  [14-18]  dealt  with  confined 
atoms  and  used  a  spherical  representation  of  the 
atom-cage  interaction  in  which  the  anisotropic 
terms  were  neglected.  Within  this  approximation 
the  rotational-vibrational  spectrum  for  on-center 
endohedral  fullerenes  is  reduced  to  that  of  an 
isotropic  anharmonic  three-dimensional  (3D)  oscil¬ 
lator  [16,  17],  and  the  corresponding  spectrum  for 
off-center  fullerenes  is  that  of  an  effective  diatomic 
molecule  [14,  15,  18].  The  spherical  representation 
of  the  atom-cage  interaction  is  in  general  well 
justified  for  on-center  atom  complexes  at  low  vi¬ 
brational  energies,  but  not  for  diatomic  guests  and 
off-center  atoms.  Classical  trajectory  studies  for 
Ne@C6o  [19]  and  classical  simulations  of  the  dipole 
spectra  of  Li’^CgtC^o  [20]  anticipated  the  relevance 
of  the  interaction  anisotropy.  This  has  been  con¬ 
firmed  systematically  by  our  group  in  a  series 
of  publications  [21-25]  in  which  we  simulate  di¬ 
pole  and  Raman  rotational-vibrational  spectra  for 
closed-shell  confined  atoms  and  diatomic  mole¬ 
cules.  We  have  found  that  the  most  significant 
manifestations  of  the  interaction  anisotropy  appear 
in  the  low  temperature  (T  50  K  and  lower)  spec¬ 
tra.  Low  anisotropic  interactions  are  found  for  the 
CO@C^o  Na^@C6o  complexes,  whose  spectra 
suggest  the  dominance  of  almost  free  rotations. 
The  more  anisotropic  interactions  of  Li^@C^Q, 
LiF@C60/  and  LiH@C6o  give  rise  to  spectra,  whose 
assignment  requires  the  existence  of  librations  and 
hindered  rotations. 

In  this  study  we  will  show  that  all  these  spectra 
can  be  understood  in  terms  of  only  two  scaled 
parameters,  which  determine  the  relative  magni¬ 
tude  of  the  anisotropic  part  of  the  guest-cage 
potential.  We  will  consider  isolated  complexes,  but 
many  of  our  conclusions  can  be  easily  extended 
with  little  change  to  the  corresponding  fullerites. 
For  completeness  we  will  also  review  some  of  our 
previous  results.  In  the  following  section  we  pre¬ 
sent  an  effective  Hamiltonian  describing  the  rota¬ 
tional  motion  of  endohedral  complexes  of  atoms 
and  diatomic  molecules.  The  third  section  is  dedi¬ 
cated  to  a  systematic  analysis  of  the  eigenvalue 
spectrum  of  this  Hamiltonian  as  a  function  of  the 
two  scaled  parameters.  In  the  fourth  section  we 
show  how  to  use  this  information  in  order  to 
assign  the  features  of  the  low-temperature  rota¬ 
tional  spectra.  A  further  discussion  and  the  main 
conclusions  are  presented  in  the  last  section. 
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Effective  Scaled 
Rotational  Hamiltonian 

The  different  time  scales  involved  in  the  dy¬ 
namics  of  endohedral  complexes  of  atoms  and 
diatomic  molecules  can  be  used  to  obtain  an  effec¬ 
tive  Hamiltonian  for  the  low-frequency  motion  in 
which  we  are  interested.  After  separating  the  elec¬ 
tronic  degrees  of  freedom  within  the  Born-Op- 
penheimer  approximation,  the  next  shorter  time 
scales  correspond  to  the  cage,  guest,  and  relative 
guest-cage  vibrational  motions  [21-23].  At  suffi¬ 
ciently  low  energies  (up  to  £  150  cm"^)  the 

complex  will  be  in  the  ground  state  of  these  mo¬ 
tions,  and  a  good  approximation  describing  the 
remaining  dynamics  can  be  provided  by  the  effec¬ 
tive  rotational  Hamiltonian  [22,  23] 

1  H 

^rot  =  r;  (1) 

where  K  =  J  —  1  is  the  cage  angular  momentum 
operator,  which  is  the  difference  between  the  total 
angular  momentum  J  and  the  angular  momentum 
1  associated  with  the  motion  of  the  confined  atom 
or  molecule.  4  ^  is  the  cage  moment 

of  inertia,  with  Mq  being  the  carbon  mass  and  R^ 
the  cage  radius  (3.55  A);  is  the  guest  moment  of 
inertia,  and  is  an  effective  potential  depending 
on  the  cage  (Euler  angles  a,  j8,  y)  and  guest  (0,  </>) 
orientations.  This  Hamiltonian  represents  two  ro¬ 
tors  coupled  by  kinetic  and  potential  terms.  and 
are,  in  principle  H-dependent  operators  due  to 
centrifugal  distortion.  However,  this  dependence 
has  been  found  to  be  unimportant  in  all  cases 
studied  so  far  [21-23],  and  only  very  fine  details 
depend  on  it;  thus  we  will  neglect  it  in  the  follow¬ 
ing  analyses. 

In  Ref.  [23]  we  established  that  this  Hamiltonian 
is  valid  not  only  for  confined  atoms  with  large 
enough  eccentricity  in  their  equilibrium  positions, 
but  also  for  diatomic  molecules  with  axial  equilib¬ 
rium  configurations;  in  this  case,  the  relative  di¬ 
atom-cage  vibrational  motion  must  also  be  in  its 
ground  state.  The  axial  equilibrium  configuration 
is  found  in  all  known  examples  of  endohedral 
complexes  of  diatomic  molecules. 

The  relatively  large  value  of  the  cage  moment  of 
inertia  makes  a  full  quantal  treatment  of  the 
cage  rotation  a  hard  task;  for  instance,  the  number 
of  /  values  involved  in  the  rotational  spectra,  even 


at  the  low  temperature  considered  here  would  be 
very  large  (/  up  to  300).  However,  this  same 
large  ^  value  helps  in  providing  further  approxi¬ 
mations.  The  simplest  one  assumes  an  infinite-mass 
cage,  and  leads  to  the  Hamiltonian 

+  (2) 

where  =  h^/2l^  is  the  effective  rotational  con¬ 
stant.  This  depends  on  the  equilibrium  configura¬ 
tion  of  the  guest  species,  which  is  practically  deter¬ 
mined  by  the  spherically  isotropic  part  of  the 
guest-cage  interaction  [21,  22].  The  values  cal¬ 
culated  for  our  endohedral  complexes  are  given  in 
Table  I. 

It  is  known  that  there  is  a  critical  temperature, 
T  250  K,  in  fullerites  for  the  transition  between 
nonrotating  to  almost  free  rotating  fullerene  mole¬ 
cules.  For  the  low  temperatures  considered  in  this 
work,  then,  the  motion  of  the  fullerene  molecules 
should  be  practically  frozen,  and  the  model  in  Eq. 
(2)  would  be  then  the  adequate  one.  In  this  case 
Vgff  should  also  account  for  additional  interactions 
due  to  the  crystal  lattice.  As  will  be  discussed 
later,  these  additional  terms  are  expected  to  be 
small  when  the  density  of  empty  cages  in  the 
fullerite  is  large  enough. 

If  not  fully  quantally,  the  effects  of  the  cage 
rotation  can  be  included  semiclassically  with  high 
reliability.  This  has  been  done  for  the  determina¬ 
tion  of  rotational  spectra  in  Refs.  [21-23].  It  is 
assumed  there  that  the  C^q  cage  rotates  freely  with 
a  classical  and  constant  angular  momentum.  One 
then  finds  that  the  main  features  of  these  rota¬ 
tional  spectra  are  already  predicted  by  the  Hamil¬ 
tonian  in  Eq.  (2).  So  we  will  concentrate  our  atten¬ 
tion  in  the  analysis  of  this  simple  model.  The 
particular  effects  of  the  cage  rotation  will  be  dis¬ 
cussed  in  the  fourth  section. 


TABLE  I _ _ 

Rotational  constant  and  scaled  potential  parameters 
for  different  endohedral  complexes. 


Guest 

Sg  (cm  T 

C| 

^10 

Li-^ 

1.45 

78.77 

80.22 

Na-** 

1.35 

7.89 

0.78 

CO 

1.78 

5.46 

0.22 

LiH 

1.82 

53.65 

30.76 

LiF 

1.27 

80.94 

31.58 
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The  reliability  of  our  theoretical  analysis  now 
depends  on  the  accuracy  of  cage-guest  effective 
potential  surface  which  belongs  to  the  totally 
symmetric  representation  of  the  4  icosahe- 
dral  group.  As  discussed  in  Ref.  [22],  confinement 
and  symmetry  constrain  enormously  the  form  of 
the  potential.  A  natural  expansion  of  the  potential 
surface  is  done  in  spherical  harmonics  (L,  M); 
then  the  4  symmetry  restricts  severely  the  L 
values  participating  in  such  an  expansion  (L  = 
0, 6, 10, 12, . . . );  furthermore,  if  electrostatic  forces 
(dispersion-repulsion,  cage  polarization,  etc.)  dom¬ 
inate  the  interaction  (as  in  the  closed-shell  ions  and 
molecules  considered  in  our  work)  the  strong  con¬ 
finement  will  make  such  interaction  rather  smooth. 
As  a  result  of  all  these  constraints,  it  turns  out  that 
practically  only  L  =  0, 6,  and  10  manifolds  are 
required  in  the  expansion  of  the  semiempirical 
expressions  for  (relative  errors  are  ^5%  at 
most).  The  1  =  0  term  is  a  constant;  the  L  =  6, 10 
manifolds  are  reducible  representation  of  the  4 
point  group.  When  reduced,  each  one  gives  just 
one  Ag  representation.  Therefore,  we  may  accu¬ 
rately  represent  the  effective  interaction  with  the 
expression 

Veff  =  Co  Vo  +  Co  Vo  +  CioVio,  (3) 

where  are  coefficients,  and  V^^  represents  the 
irreducible  representation  coming  from  the  L 
manifold;  these  functions  are  chosen  to  be  real  and 
normalized  as  fV^  dil  =  1.  Some  of  their  values 
are  shown  in  Figure  1. 

After  scaling  and  redefining  the  energy  origin, 
Eqs.  (2)  and  (3)  lead  to  the  expression 

H*,  =  l*^  +  Co*Vo  +  CfoVio,  (4) 

where  -  W/B„  1*  =  l/h,  C*  = 

C„/B^.  In  conclusion,  only  the  two  parameters  C* 
and  Cfo  determine  the  low-energy  rotational  dy¬ 
namics  in  our  model.  Their  values,  for  the  systems 
selected  in  our  work,  are  given  in  Table  I. 

A  similar  model  has  been  proposed  for  the 
rotational  dynamics  of  atom-spherical  top  van  der 
Waals  complexes  [26],  to  represent  situations  op¬ 
posite  to  ours,  namely  those  in  which  the  spherical 
top  is  much  lighter  than  the  atom.  The  meaning, 
range,  and  effect  of  these  model  parameters  is  thus 
quite  different  from  ours. 

Since  the  dimension  of  the  parameter  space  of 
Hamiltonian  (4)  is  just  two,  a  systematic  analysis 
of  the  effect  of  the  anisotropy  on  the  eigenvalue 


6“ 

FIGURE  1.  Values  of  the  Vg  and  terms  for  the 
guest-cage  interaction.  They  are  given  as  a  function  of 
the  polar  angle  0  for  azimutal  angle  </>  =  0.  The  z  axis 
has  been  chosen  along  the  C5  fullerene  symmetry  axis 
and  one  of  the  neighbor  C2  symmetry  axis  lies  on  xz 
plane.  The  first  maximum  in  corresponds  to  the 
middle  of  a  short  C— C  bond  and  the  second  one  to  a 
long  C — C  bond.  The  four  absolute  minima  in  both 
panels  correspond  to  the  hexagon  centers. 

spectrum  of  this  Hamiltonian  can  be  readily  car¬ 
ried  out.  This  will  be  presented  in  the  following 
section. 


Analysis  of  the  Rotational  Eigenvalue 
Spectrum  in  the  Parameter  Space 

Figure  2  presents  the  parameter  space  expanded 
by  the  endohedral  complexes  selected  in  our  work. 
A  different  symbol  indicates  the  position  of  each 
system.  We  will  now  study  the  evolution  of  the 
low-energy  eigenvalues  following  the  four  straight 
lines  drawn  in  this  figure.  The  corresponding  cor¬ 
relation  diagrams  are  given  in  Figures  3-6.  The 
spherical  harmonic  basis  set  has  been  used 

to  diagonalize  the  scaled  Hamiltonian  in  Eq.  (4). 
Here,  /  values  up  to  30  have  been  included  to  get 
converged  spectra.  In  Figure  3  we  notice  that  the 
eigenvalue  spectrum  for  Na''’@Cgo  corresponds  to 
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C6* 


FIGURE  2_  Parameter  space  of  the  model  Hamiltonian 
in  Eq.  (4).  Different  symbols  indicate  the  position  of  the 
endohedral  complexes  selected  in  our  work.  Along  the 
lines  drawn  on  this  figure  we  have  calculated  the  energy 
correlation  diagrams  In  Figures  3  to  6. 

that  of  a  slightly  perturbed  free  rotor.  In  particular, 
each  I  manifold  splits  according  to  the  different 
irreducible  representations  of  the  4  symmetry 
group,  with  practically  no  mixing  between  differ¬ 
ent  1  manifolds  (basically  the  same  spectrum  is 
obtained  for  CO@C6o  [24,  27]). 


FIGURE  3.  Energy  correlation  diagram  as  function  of 
the  C|  parameter  calculated  along  the  line  in  Figure 
2. 


FIGURE  4.  Energy  correlation  diagram  as  function  of 
the  parameter  calculated  along  the  Pg  In  Figure 
2. 


In  going  from  to  Li’^@C6o,  mixing 

between  different  I  manifolds  increases  and  a  sin¬ 
gle  "quasi-band-gap"  structure  emerges  at  the  bot¬ 
tom  of  the  energy  scale.  The  same  structure  ap¬ 
pears  in  the  eigenvalue  spectrum  of  LiH@C6o  and 
LiF@C6o  (see  Figs.  4  and  5),  and  of  any  system 
lying  in  parameter  space  expanded  by  these  three 


FIGURE  5.  Energy  correlation  diagram  as  function  of 
the  parameter  calculated  along  the  line  in  Figure 
2. 
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FIGURE  6.  Energy  correlation  diagram  as  function  of 
the  C|  parameter  calculated  along  the  line  in  Figure 
2. 


complexes.  Whenever  the  quasi-band  is  formed,  it 
always  contains  20  states  grouped  in  the  six  4 
symmetry  manifolds  ®  ©  T2„  0  0 

0H^.  This  number  coincides  with  the  number  of 
hexagon  wells  in  the  guest”cage  potential  surface. 
Thus  the  band  represents  trapping  of  the  guest 
species  in  the  hexagon  wells  (librational  motion); 
tunneling  connects  different  wells  and  accounts  for 
the  observed  bandwidth.  A  look  at  the  correspond¬ 
ing  eigenfunctions  confirms  these  conclusions: 
These  states  present  the  largest  amplitudes  around 
the  hexagon  wells  of  the  potential  [24,  25].  Above 
the  gap,  no  other  bands  can  be  recognized  in  the 
eigenvalue  spectra  of  any  system  lying  in  parame¬ 
ter  space  around  and  between  the  three  anisotropic 
complexes;  this  is  consistent  with  the  fact  that  the 
hexagon  well  is  not  deep  enough  to  support  more 
than  one  quantum  state;  thus  instead  of  additional 
bands,  a  kind  of  "'quasi-continuum"  of  difficult 
assignment  appears.  The  energy  at  which  the 
quasi-continuum  starts  represents  the  onset  of 
less-localized  motion  (hindered  rotations).  This  re¬ 
sult  is  confirmed  by  the  delocalized  character  of 
the  wave  functions  for  states  lying  in  this  energy 
region.  An  exception  to  this  rule  appears  in  a  set  of 
12  states,  separated  in  the  four  symmetry  species 
0  0  T2„  0  whose  wave  functions  pre¬ 

sent  localization  around  the  12  pentagon  centers. 
This  localization  is,  however,  not  as  perfect  as  the 
one  found  around  the  hexagon  wells;  some  mixing 


takes  place  with  delocalized  states.  Around  the 
pentagon  centers  the  guest-cage  potential  takes 
maximum  values;  a  local  minimum  may  exist  but 
not  deep  enough  to  support  a  state.  By  decreasing 
the  magnitude  of  the  Q  parameter  we  can  in¬ 
crease  the  depth  of  this  well,  which  may  now  be 
able  to  support  one  state.  This  is  what  can  be 
observed  in  Figure  6:  a  pentagon  band  emerges 
out  of  the  manifold  of  delocalized  states  at  low  C* 
values.  When  this  band  hides  in  the  delocalized 
manifold  of  states,  mixing  takes  place,  but  most  of 
the  pentagon  character  is  preserved,  except  at  the 
infrequent  avoided  crossings,  where  that  character 
may  be  shared  by  another  state.  The  whole  situa¬ 
tion  is  that  of  a  kind  of  "quasi-resonance  band." 
This  is  a  quasi-band  analog  of  the  separatrix  and 
saddle  point  quasi-resonances  found  in  simple  sys¬ 
tems  [28].  It  is  known  that  this  localization  phe¬ 
nomenon  is  quite  stable  and  robust  against  pertur¬ 
bations,  as  can  be  confirmed  in  our  case  by  the 
correlation  diagrams. 

At  high  enough  energies  mostly  free  rotations 
should  dominate  the  dynamics;  for  instance,  this 
happens  in  Li‘^@C6o  for  angular  momentum 
quantum  numbers  higher  than  /  ~  20. 


Examples  of  Rotational  Spectra 

The  features  of  the  eigenvalue  spectrum  and 
eigenfunctions  analyzed  in  the  previous  section 
are  going  to  determine  the  structure  of  the  low- 
temperature  rotational  spectra,  which  may  now  be 
considered  as  fingerprint  of  a  particular  guest-cage 
interaction.  Low-temperature  dipole  and  Raman 
rotational  spectra  for  these  endohedral  complexes 
have  been  calculated  in  Refs.  [21-25].  As  an  exam¬ 
ple  we  present  in  Figures  7  and  8  the  dipole 
spectra  (T  =  50  K)  for  the  Na‘^@C6o  and  Li"^ 
complexes  as  a  function  of  the  reduced  frequency 
oA  =  hco/B^,  These  spectra  are  the  result  of  a 
convolution  with  a  Lorentzian  whose  full  width  at 
half  maximum  is  A  co*  =  0.2.  While  panels  (a)  cor¬ 
respond  to  the  infinite-mass  cage  model  of  Eq.  (4), 
panels  (b)  present  the  effect  to  the  cage  rotation 
when  this  is  treated  semiclassically  as  indicated 
earlier.  Note  that  the  rotation  of  the  cage  enhances 
the  free  rotation  of  the  low-anisotropy  Na‘^@C6o/ 
and  broadens  the  librational  bands  of  the  more 
anisotropic  Li'^@C6o.  This  is  a  general  result  estab¬ 
lished  in  Refs.  [22,  23];  there  we  showed  that  this 
opposite  effect  is  the  result  of  the  competition 
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FIGURE  7.  Dipole  rotational  spectra  for  Na'^OCgo 
calculated  at  7  =  50  K.  Intensities,  in  arbitrary  units,  are 
given  as  a  function  of  scaled  frequencies.  Panel  (a)  was 
obtained  with  the  infinite-mass-cage  Hamiltonian  in  Eq. 
(2).  Panel  (b)  was  calculated  using  a  semiclassical 
approximation  for  the  cage  rotation  (Ref.  [23]).  In  all 
cases  a  Lorentzian  of  full  width  0.2  has  been  used  to 
convolute  the  spectrum. 

between  a  centrifugal  term  that  couples  the  guest 
rotation  to  the  cage  rotation  and  the  anisotropic 
terms  of  the  guest-cage  interaction.  When  cen¬ 
trifugal  coupling  dominates  over  anisotropic  po¬ 
tential  coupling,  the  latter  is  effectively  reduced; 


0  20  40  60  80 


FIGURES.  Same  as  Figure  7  for  Li  ■^Cgo- 


the  result  is  a  sharper  rotational  spectrum.  On  the 
contrary,  if  the  anisotropic  potential  coupling  dom¬ 
inates  as  in  the  librational  bands,  the  cage  rotation 
adds  an  extra  broadening,  which  physically  trans¬ 
lates  into  a  high  tendency  for  the  guest  species  to 
follow  the  cage  rotation;  as  was  shown  in  Refs.  [22, 
23],  this  rotational  broadening  can  be  accurately 
simulated  by  a  smoothing  of  the  infinite-mass-cage 
spectrum  with  a  convoluting  function  whose  width 
is  the  estimated  cage-rotational  broadening  1.5 
cm”^  at  T  =  50  K). 

The  spectrum  for  Na'^@C5o  confirms  the  almost 
free-rotor  nature  of  its  eigenvalue  spectrum.  It 
shows  the  2  (just  2  in  reduced  units)  oscillation 
typical  of  a  diatomic  molecule.  Practically  equal 
spectrum  is  obtained  for  CO@C6o.  On  the  other 
extreme,  the  spectrum  of  Li'^(a)C6o  shows  wide 
librational  bands.  The  less  intense  half  of  the  spec¬ 
trum  is  basically  due  to  transitions  within  the 
hexagon  quasiband  structure  of  the  eigenvalue 
spectrum;  the  right  and  more  intense  half  is  caused 
by  transition  out  of  this  band  into  the  states  lying 
above  the  gap.  Most  of  the  high-intensity  lines  can 
be  assigned  to  transitions  ending  in  one  of  the  12 
states  of  the  quasi-resonance  pentagon  band.  This 
is  due  to  the  fact  that  the  threshold  for  hindered 
rotation  is  determined  by  the  high  potential  re¬ 
gions  around  the  pentagon  centers.  The  complexes 
LiH@C60  hiF@C5o  show  similar  features. 


Discussion  and  Conclusions 

We  have  obtained  a  two-parameter  scaled  model 
Hamiltonian  which  may  represent  rather  accu¬ 
rately  the  low-energy  dynamics  of  endohedral 
complexes  of  off-center  atoms  and  small  diatomic 
molecules.  The  simplicity  of  the  model  is  a  direct 
consequence  of  the  high  symmetry  of  the  cage,  the 
strong  confinement  conditions,  and  the  mostly 
physical  nature  of  the  guest-cage  interaction  in 
the  selected  systems  Na'^@C50/  Li^@C6o/  CO@ 
C^o,  LiH@C5o,  LiF@C6o.  This  same  simplicity  has 
allowed  us  to  perform  a  systematic  analysis  of  the 
structure  of  the  eigenvalue  spectrum  for  the  model 
in  the  parameter  space  relevant  to  those  endohe¬ 
dral  complexes.  Energy  correlation  diagrams  as  a 
function  of  the  parameters  show  transitions  be¬ 
tween  basically  two  types  of  structures.  On  one 
side  we  have  a  parameter  region  around  Na^@C6o 
and  CO(®C6o  complexes  in  which  the  model 
eigenvalue  spectrum  and  the  corresponding  low- 
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temperature  rotational  spectra  are  like  those  of 
slightly  perturbed  free  rotors.  In  the  higher  param¬ 
eter  region  where  Li'^@C60/  LiH@C60/  and 
LiF@C6o  the  effects  of  the  guest-cage  interac¬ 
tion  anisotropy  becomes  more  relevant;  a  potential 
well  appears  around  the  hexagon  centers  deep 
enough  to  support  a  quantum  state,  which  causes 
the  appearance  of  a  single  quasi-band-gap  struc¬ 
ture  at  the  bottom  of  the  energy  scale,  whose  states 
show  large  amplitudes  at  the  hexagon  wells.  This 
band  should  therefore  be  associated  with  libra- 
tional  motion  within  these  wells.  Above  the  energy 
gap  we  find  the  threshold  of  hindered  rotation  and 
more  delocalized  states.  Embedded  in  this  region 
we  have  found  a  quasi-resonance  band  with  states 
showing  large  amplitudes  around  the  pentagon 
centers,  which  are,  in  general,  regions  of  unstable 
equilibrium.  By  changing  the  relative  magnitude 
of  the  model  parameters  one  can  stabilize  this 
regions  and  convert  the  quasi-resonance-band  into 
a  quasi-band  separated  by  energy  gaps  from  the 
rest  of  the  states.  We  have  seen  that  the  low 
temperature  rotational  spectra  of  the  system  in  this 
region  can  be  mostly  assigned  in  terms  of  transi¬ 
tions  within  the  hexagon  band  and  out  of  this 
band  into  the  pentagon  quasi-resonance-band. 

All  this  analysis  has  been  carried  out  for  a 
model  that  assumes  that  the  cage  rotation  is  frozen. 
However,  we  have  seen  that  this  rotation  does  not 
change  the  features  in  the  rotational  spectra  which 
are  related  to  the  anisotropy  of  the  guest-cage 
interaction,  on  the  contrary,  it  sharpens  the  transi¬ 
tion  between  the  low  and  high  anisotropy  features. 
Actually  one  could  realize  rotating  and  nonrotat¬ 
ing  cage  cases  using  endohedrally  doped  fullerites. 
First,  take  into  account  the  fact  that  fullerites  pre¬ 
sent  a  transition  (T  250  K)  between  a  phase  in 
which  the  rotation  of  the  individual  fullerene 
molecules  is  hindered  and  a  phase  in  which  this 
rotation  is  isotropically  free,  and  that  below  T  ^  90 
K  all  these  motions  are  frozen  [29].  If  the  density  of 
doping  is  low  enough,  one  can  disregard  the 
dipole-dipole  interaction  appearing  between 
nonempty  different  cages;  then  the  spherical 
anisotropy  of  the  crystal  field  acting  on  a  particu¬ 
lar  guest  is  going  to  be  practically  determined  by 
its  cage,  since  the  distance  between  cages  is  rela¬ 
tively  large  ( ^  10  A).  So,  if  we  lower  the  tempera¬ 
ture  sufficiently  (T  <  90  K),  we  can  realize  our 
low-energy  two-parameter  model.  Above  the  criti¬ 
cal  temperature  for  free  cage  rotation  (T  250  K) 


we  should  observe  the  effects  of  this  rotation  on 
the  rotational-vibrational  spectra,  but  the  tempera¬ 
tures  required  would  make  necessary  the  use  of  a 
model  which  treats  higher  frequency  motions  not 
included  in  our  simple  model.  The  best  low-tem¬ 
perature  experimental  realization  of  the  rotating 
case  could  be  done,  of  course,  using  gas-phase 
techniques. 

Low-temperature  spectra  become,  therefore,  fin¬ 
gerprints  of  a  particular  guest-cage  interaction, 
and  thus  a  primary  experimental  source  of  infor¬ 
mation  on  this  interaction.  This  interaction  thus 
determines  the  low-energy  rotational  dynamics  of 
the  confined  species,  on  which  many  of  the  proper¬ 
ties  of  endohedral  fullerites  may  depend,  e.g.,  the 
critical  temperatures  (25-60  K)  for  the  dipole- 
ordered  phases  predicted  by  Cioslowski  and  Na¬ 
nay  akkara  [6]. 

The  fact  that  the  guest-cage  interaction  in  com¬ 
plexes  studied  in  our  work  has  a  dominantly  phys¬ 
ical  nature  allowed  us  to  use  semiempirical  ex¬ 
pressions  for  it.  Strong  confinement  and  symmetry 
constraints  reduce  enormously  the  effects  of  possi¬ 
ble  inaccuracies  and  thus  increase  the  reliability  of 
the  calculations.  We  believe  that  these  results  could 
be  valid  as  well  for  interaction  of  more  chemical 
nature,  i.e.,  these  same  strong  constraints  may  re¬ 
duce,  also  in  this  case,  the  number  of  relevant 
parameters  to  just  the  same  two  ones  which  come 
from  the  L  =  6,  10  terms  of  the  expansion  of  the 
potential  in  spherical  harmonics.  In  this  way  on 
could  extend  our  analysis  to  off-center  complexes 
such  O@C60.  In  this  case,  an  epoxy-type  (C — O 
— C)  bond  has  been  suggested  under  the  short  C 
— C  fullerene  bond  [20].  A  situation  like  this  can 
be  produced  with  our  model  by  choosing  a  nega¬ 
tive  Cfo  coefficient.  Therefore,  chemical  interaction 
complexes  would  explore  other  regions  of  our 
two-parameter  space.  However,  the  higher  libra- 
tional  frequencies  expected  for  chemical  bonding 
may  make  necessary  the  joint  treatment  of  these 
modes  with  other  vibrational  modes  which  were 
excluded  in  our  model. 
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ABSTRACT:  We  extended  the  dynamic  response  theory  in  the  Moller-Plesset 
perturbation  theory  (MPPT)  based  on  the  quasi-energy  derivative  (QED)  method  for 
closed-shell  systems  to  that  for  open-shell  systems.  In  this  study  we  perform  the 
calculations  of  frequency-dependent  polarizabilities  ai  —  w;  w)  for  nondegenerate  open- 
shell  doublet  systems  (Li,  Na,  and  K  atoms  and  BeH,  MgH,  CaH,  CN,  and  NH2 
molecules)  in  the  second-order  Moller-Plesset  perturbation  theory  (MP2)  starting  with 
time-dependent  restricted  open-shell  Hartree-Fock  (TDROHF)  approximation.  ©  1997 
John  Wiley  &  Sons,  Inc,  Int  J  Quant  Chem  65:  665-677,  1997 


Introduction 

With  the  rapid  development  of  laser  spec¬ 
troscopy  in  experiment,  the  dynamic  re¬ 
sponse  properties  are  great  topics  in  quantum 
chemistry  now.  For  closed-shell  systems,  there  are 
results  of  calculations  of  frequency-dependent  po¬ 
larizabilities  and  hyperpolarizabilities  at  the  not 

Correspondence  to:  T.  Kobayashi. 


only  time-dependent  Hartree-Fock  (TDHF)  level 
[1“3]  but  also  correlated  level  [4-13]  of  theory. 
Sasagane,  Aiga,  and  Itoh  have  proposed  the 
quasi-energy  derivative  (QED)  method,  which  is  a 
more  general  method  to  define  the  dynamic  re¬ 
sponse  property  [14].  In  the  QED  method,  the 
frequency-dependent  polarizability  is  defined  as 


0)) 


d^W(t) 

—  co)  de^(o)) 
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where  Wit)  is  quasi-energy  and  eg(w)  is  the  B-di- 
rection  electric  field  strength  of  the  Fourier 

component.  Aiga,  Sasagane,  and  Itoh  [15]  have 
formulated  the  frequency-dependent  polarizabili¬ 
ties  and  hyperpolarizabilities  for  closed-shell  sys¬ 
tems  at  the  second-order  Moller-Plesset  perturba¬ 
tion  theory  (MP2)  level  based  on  the  QED  method 
and  Aiga  et  al.  [16]  have  calculated  them  with 
consideration  of  the  existence  of  poles  proper  to 
many-body  perturbation  theory  (MBPT)  pointed 
out  by  Hattig  and  Hefi  [17]. 

On  the  other  hand  with  respect  to  dynamic 
response  properties  for  open-shell  systems,  there 
have  been  only  a  few  reports,  in  contrast  with  the 
case  of  closed-shell  systems.  Hettema  and  Wormer 
[18]  have  calculated  the  frequency-dependent  po¬ 
larizabilities  and  determined  van  der  Waals  coeffi¬ 
cients  from  frequency-dependent  polarizabilities 
with  imaginary  frequency  by  the  time-dependent 
restricted  open-shell  Hartree-Fock  (TDROHF) 
method.  Kama  [19]  has  formulated  a  time-depen¬ 
dent  spin-unrestricted  Flartree-Fock  (TDUHF)  the¬ 
ory  and  has  performed  the  calculations  of  dynamic 
nonlinear  response  properties  (up  to  third  order 
[  pi-o)^;  ca2)])  by  the  TDUHF  method.  Pipin 

and  Bishop  [20]  have  calculated  the  dynamic  po¬ 
larizability  of  Li  atoms  by  the  sum-over-states 
(SOS)  method  based  on  the  combined  config¬ 
uration  interaction  (Cl)  Hylleraas  method. 
Very  recently,  Jaszuhski  and  Rizzo  [21]  have 
presented  the  dynamic  polarizabilities  a(-w;w) 
and  second  hyperpolarizabilities  for  the  dc-Kerr 
effect  yi-(o)  (o,0,0)  of  Li  atoms  by  the  time- 
dependent  multiconfiguration  self-consistent  field 
(TDMCSCF,  which  has  usually  been  referred  to  as 
MCTDHF)  method.  However,  the  frequency-de¬ 
pendent  polarizabilities  for  open-shell  systems  at 
the  correlated  level  of  theory  have  not  been  re¬ 
ported  yet  with  the  exception  of  Li  atoms.  The 
calculations  of  polarizabilities  for  open-shell  sys¬ 
tems,  except  for  Li  atoms  at  the  correlated  level  of 
theory,  have  been  performed  only  in  a  static  case 
by  numerical  energy  derivative  (finite-field) 
method  [22-25].  Unfortunately,  the  dynamic  re¬ 
sponse  properties  can  be  calculated  only  by  analyt¬ 
ical  approach.  However,  it  is  also  very  important 
to  estimate  both  the  frequency  dependency  and 
electron  correlation  of  polarizabilities  for  open-shell 
systems  at  the  same  time.  We  extended  the  dy¬ 
namic  response  theory  in  the  Moller-Plesset  per¬ 
turbation  theory  (MPPT)  based  on  the  QED  method 
for  closed-shell  systems  to  that  of  open-shell  sys¬ 
tems  [26]. 


In  this  work  we  perform  the  calculations  of 
frequency-dependent  polarizabilities  for  nonde¬ 
generate  open-shell  doublet  systems  (Li,  Na,  and  K 
atoms  and  BeH,  MgH,  CaH,  CN,  and  NH2 
molecules)  at  the  TDROHF  level  and  MP2  starting 
with  TDROHF  wave  function  (TDROHF  -F  MP2) 
level  based  on  the  QED  method  and  estimate  the 
electron  correlation  contribution  by  MP2.  There  are 
some  kinds  of  definitions  of  orbital  energy  in 
ROHF-based  perturbation  theory  with  respect  to 
electron  correlation.  We  employed  the  definition  in 
open-shell  perturbation  theory  1  (OPTl),  which 
has  been  proposed  by  Murray  and  Davidson  [27, 
28],  as  the  diagonal  blocks  of  Fock  matrix  (orbital 
energy  matrix).  These  are  the  first  calculations  of 
frequency-dependent  polarizabilities  for  open-shell 
systems  except  for  Li  atoms  at  the  correlated  level 
of  theory. 


Theory 

In  this  section  the  dynamic  response  theory  for 
open-shell  systems  at  the  MP2  level  of  theory 
based  on  the  QED  method  [26]  is  briefly  showed. 

For  convenience,  we  consider  the  high-spin 
open-shell  systems.  We  use  indices  {/,/},  {p^q}, 
{a,  £»},  and  {r,  s]  to  denote  closed  (doubly  occupied), 
open  (singly  occupied),  virtual,  and  arbitrary  or¬ 
bitals,  respectively,  and  abbreviate  "closed,"' 
"open,"  and  "virtual"  to  "C,"  "O,"  and  "V," 
respectively.  The  TDROHF  wave  function  is  ap¬ 
proximated  as  a  single  determinant: 

I^TDROHf(^))  ^  *”  4^p4^q  **■  I 

closed  open 

=  exp[K(f)]|ROHF>,  (2) 

where 

k(0  =  (3) 

rs 

and 

Ers  =  E  4a  ^  {(VC),  (OC),  (VO)}  .  (4) 

(T 

is  the  TDROHF  orbital,  is  the  generators  of 
unitary  group,  are  creation  and  annihila¬ 

tion  operators  with  the  spin  index  a.  Here  we 
introduce  the  following  notation: 

Ea  -  Rrs/  =  Xrs(0  (5) 
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and 

£_,  ^E;,,  k_,(0=-k*(0.  (6) 


defined  over  the  open  orbitals;  h  is  the  one-elec¬ 
tron  part  of  the  Hamiltonian,  can  be  ex¬ 

pressed  by  matrix  representation: 


Then  /c(0  can  be  rewritten  as 
K(t)  =  Y, 

i  A 

In  the  following  notation,  we  regard  |HF>  as 
IROHF),  Applying  the  time-dependent  variational 
principle  (TDVP)  [29,  30]  to  I^^tdrohf^^))  condi¬ 
tion  of  the  orthonormality  of  TDROHF  orbitals,  we 
obtain  TDROHF  equation  in  the  first-quantized 
form  in  case  that  the  definition  in  open-shell  per¬ 
turbation  theory  1  (OPTl)  proposed  by  Murray 
and  Davidson  [27,  28]  is  used  as  the  diagonal 
blocks  of  Fock  matrix  (orbital  energy  matrix), 

^  se(CO,V) 

re(C,0,V),  (8) 


where 


g’ 

& 

T 

w 

'  (F^)cc 

iF^°)co 

(F^)cv^ 

/f 

(F^°)oc 

(F^)oo 

(F°)ov 

•r 

,  (f^)vc 

o 

o 

(F^)vv, 

6^^  can  be  written  in  the  second  quantized  expres¬ 
sion  as 


(CO  =  U 


-  z  — 
dt 


®/<T  / 


dt 


|HF>,  (14) 


(oo)  e^^  =  {ci>_ 


pc  _  /. 


dt 


<i>. 


f  OPTl  =  [(1  -  p)Pp^  +  p^Pil  -p)  +  CpJ 

I 

+  E  P/F‘^°P/  +  (1  -  p)E^(l  -  p)-  (9) 

hi 


=  ^E<HFllfl^ 


qa-  f 


P^' 


e""  H  -  z— 
dt 


|HF>,  (15) 


In  Eq.  (9),  p^,  Pq,  and  p  are  defined  as 

Pc  =  E  Po  =  E  l<Ap><<^pl/ 

ZeC  p^O 

P=  E  l<^r><<^rl  =  EP/  (10) 

reC,0  / 

and  J(/)  summation  runs  over  all  occupied  (closed 
and  open)  space  and  F^,  and  F^^  are  given  by 

rO 

=  h  +  2J^  -  (11) 

rO 

F°  =  F^-—,  (12) 

K° 

pco  =  pc  p  (j3) 

where  and  are  the  Coulomb  and  exchange 
operators  defined  over  the  closed  orbitals  and 
and  K°  are  the  Coulomb  and  exchange  operators 


(yy)  p„,  =  U, 


1 


F^  -  i- 


=  rE<HF| 


dt 


*ba-> 


e"''  H-  f- 


(cy)  P,„  =  U, 


/ 

d 

Ui 

F^  -  i— 
dt 

= 

=  0, 


(oy)  =  {<!>, 


dt 


|HF>,  (16) 


dt 


4>a 


=  <HF| 

=  0, 


|e^  E, 

at 


|HF> 

(17) 

|HF> 

(18) 
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(CO)  = 


=  <HF| 

=  0. 


d 

u 

pco  _ 

dt 

<!>. 


|HF> 

(19) 


energy  W(t)  with  respect  to  V 

!<!>)  =  E  13), >  =  IDtdrohf)  +  E  !<&,>.  (25) 

/=0  /=1 
cc  oc 

W(f)  =  E  W,  =  Wtdkohf  +  E  W,,  (26) 


Equations  (17),  (18),  and  (19)  are  so-called  Brillouin 
conditions.  As  the  orbital  mixing  parameters  in  the 
same  orbital  blocks  have  no  effects  on  variational 
principles,  we  can  neglect  them.  Though  we  can 
get  canonical  TDROHF  orbitals  by  diagonalizing 
the  TDROHF  orbital  energy  matrix,  we  use  non- 
canonical  TDROHF  orbitals  because  of  avoiding 
the  possibility  of  singularity  in  the  orbital  mixing 
parameters  in  the  same  orbital  blocks  [15].  The 
zeroth-order  Hamiltonian  in  OPTl  starting  with 
TDROHF  wave  function  becomes 

^OPTl  ^  ^pOPTlQ)  _  (20) 

i 

The  time-dependent  Schrodinger  equation  in 
which  an  overall  time-dependent  phase  factor  is 
isolated  is  given  by 

|H-f^J|<l)>  =  |<I)>W(0,  (21) 

where  W(0  is  quasi-energy.  The  total  Hamiltonian 
H  is  assumed  to  be  written  as 

H  =  Ho  +  Hi,  (22) 

where  Hq  is  nonrelativistic  Born-Oppenheimer 
electron  Hamiltonian  of  the  isolated  system  and 
time-dependent  perturbed  Hamiltonian  is 

given  as 

n 

Hi=-  E  E  e„(ft),)M„exp(-/ft);0.  (23) 

i=  -n  =  y,  z 

is  an  a-component  of  electric  dipole  moment 
operator  and  e^(wy)  is  the  electric  field  strength  of 
Fourier  component.  We  assume  a  periodicity 
in  Hi  as  is  known  in  many-mode  Floquet  theory 
[31]. 

In  the  MPPT  starting  with  TDROHF  wave  func¬ 
tion,  Hamiltonian  H  is  partitioned  into 

H  =  F  +  y,  (24) 

where  F  ==  E/F^^^KO.  In  Eq.  (21),  we  introduce 
the  perturbation  expansions  of  |0)  and  quasi- 


which  result  in  the  nth-order  equations 

n-1 

=  El^/>W„_„  n>l,  (27) 

with  the  intermediate  normalization  condition 
=  1.  Writing  |<i>i)  as 
exp(K)\K}t^  and  multiplying  Eq.  (27)  with 
n  ^  I  frojji  the  left  by  <L|exp(-K),  we  obtain 
the  equation  for  determining  the  amplitude 
of  the  first-order  MP  wave  function: 

LGiKh  -  iii  +  (L\e-'<Ve'^\HP)  =  0,  (28) 

K 


where 


=  <L|G|K> 


^rs^rs  -  ^0 


R  ,  (29) 


\K)  and  <L|  are  all  doubly  excited  configuration 
state  functions  (CSFs)  from  |ROHF)  and  (ROHF|, 
respectively.  The  existence  of  Eq.  (28),  which 
cause  the  poles  proper  to  the  MBPT,  have  been 
pointed  out  by  Hattig  and  Hefi  [17].  Multiplying 
Eq.  (27)  with  n  =  2  from  the  left  by  (OqI  ^ 
<HF|exp(“K),  we  obtain  the  MP2  quasi-energy 
correction 


W2  =  E<HFk-'^WlK>f,..  (30) 

K 


We  introduce  and  ti,  which  are  Lagrange  mul¬ 
tipliers  for  equations  for  the  TDROHF  orbital 
rotation  parameters  and  for  the  amplitudes 
respectively.  Then  the  MP2  quasi-energy  La- 
grangian  L  is 


E  ~  I^TDROHF 

K 

+  LhlLG.KtK  -  Hl  + 

r  \  V  / 


+  E  Ca<hfi 

i  A 


dt 


|HF>. 


(31) 
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The  permanent  electric  dipole  moment  and 
frequency-dependent  polarizability  a^gC  — w;  w)  at 
the  ROHF  +  MP2  level  of  theory  based  on  the 
quasi-energy  derivative  method  are  defined  as 


dUt) 


I^A 


SeAO) 


a^gC-w;  ft))  =  - 


deA  —  o))  deAoi)) 


(32) 

(33) 


and 


=  ^  E  E  <HF|[ flL  Aa,^,  A]]  ^  |HF>< LI EJK} 
-  E  E<HF|[flL,[fl,.,,  A]]  JHFXLIX) 

/  e  C  a 

-\lL  E<HF|[4,,[a,,,A]]  JHFXLIK). 

^  p^O  a 

(38) 


and  explicitly  given  as 

=  (HFIMJHF)  +  E 

L,  K 

+  E  e„m^] ihf>,  (34) 

i  A 

=  <HF|[M^,K^(a;)]|HF> 

-  E<HF|[y<^  K^(-co),  K®(co)]lK>^f 

K 

-  E L|[ y k^(-(o),kH ct))]|HF> 

L 

L,K 

K 

+  —  0)) 

K 

-  E ^.®(hf| [e„j^\- ft),  ft))] |hf) , 

±  A 

(35) 

where  is  an  A  component  of  electric  dipole 
moment  operator,  and  we  have  used  the  symmet¬ 
ric  commutator 

[A;B,C]  =K[[A,B],C]  +  [[A,C],B])  (36) 

and  have  defined  the  following  notations: 

/^^(-o),  0))  =  ”  [M^,  K^io))]  “  [Mg,  k:^(-o>)] 
-h  [Hq;  /<^(“  w),  K^(a>)] 

+  0)[  K^(  —  0)),  K^(o))]  (37) 


In  Eqs.  (34)  and  (35),  and  are  the 

zeroth-order  of  Ig,  and  and  k'^(o>),  t^(o>), 
and  t^((o)  are  first-order  of  k,  and  fg  in  the 
perturbation  expansion  with  respect  to  electric 
field,  respectively.  These  amplitudes  and  multipli¬ 
ers  are  determined  by  the  time-averaged  varia¬ 
tional  condition  for  Lagrangian  8{L}j  =  0.  is 
the  zeroth-order  fluctuation  potential. 


Computational  Details 

The  matrix  elements  in  the  second  quantization 
language  appearing  in  Eqs.  (31)  and  (32)  are  calcu¬ 
lated  automatically  by  using  the  program  [32]  in 
the  algebraic  programming  language  REDUCE3. 
The  program  for  calculation  of  frequency-depen- 
dent  polarizability  for  open-shell  system  at  the 
TDROHF  +  MP2  level  of  theory  is  connected  to 
Sasagane's  TDROHF  program. 

BASIS  SETS 

The  primitive  Gaussian  basis  sets  for  Li,  Be,  and 
H  on  MgH  and  BeH  are  derived  from  Sasagane's 
smooth-tempered  basis  sets  (14s),  (10  s),  and  (6s), 
respectively.  A  series  of  exponents  in  smooth-tem¬ 
pered  basis  set  is  given  by  p,  y,  8)  = 

(k  =  1,2, ...,N).  We  used  the  opti¬ 
mized  parameter  sets  (a,  j8,  y,  6)  =  (0.023455420, 
2.3106846, 17.03123,  0.6247924)  for  Li,  (0.053619160, 
2.7557931,  6.0954213,  0.50286011)  for  Be,  and 
(0.079899519,  2.6231810,  8.3605059,  0.56192361)  for 
H  on  MgH  and  BeH.  The  diffuse/polarization 
functions  (5sl4p/12d),  (3s9p/7d),  and  (3s/7p)  are 
added  to  the  primitive  basis  sets  for  Li,  Be,  and  H 
on  MgH  and  BeH,  respectively,  and  contracted  to 
[14sl2pl0d]  for  Li,  [9s7p5d]  for  Be,  and  [6s5p]  for 
H  on  MgH  and  BeH.  The  basis  sets  for  Li,  Be,  and 
H  on  MgH  and  BeH  are  given  in  Table  I.  The 
primitive  basis  sets  for  Na,  Mg,  and  K  are  derived 
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TABLE  I _ 

Smooth-tempered  basis  sets  for  Li,  Be,  and  H. 


Li  [14s12p10d] 

Be  [9s7p5cf] 

H  [6s5p] 

Type 

Exponent 

Coefficient 

Type 

Exponent 

Coefficient 

Type 

Exponent 

Coefficient 

s 

21385.428 

0.00015000019 

s 

2996.0097 

0.00087628616 

s 

82.969412 

0.0025894488 

3194.5111 

0.0011938157 

442.62907 

0.0068262223 

12.476175 

0.019972614 

711.25708 

0.0063812155 

102.22305 

0.0334361 1 1 

2.8198849 

0.098435389 

203.21120 

0.025844243 

29.554163 

0.12406540 

0.80137208 

0.33188081 

67.847365 

0.088942149 

9.5663893 

0.34213598 

0.25901324 

0.6499241 1 

24.968615 

0.26721810 

3.2775362 

0.60474561 

s 

82.969412 

0.0018385256 

9.7649438 

0.69035061 

s 

2996.0097 

0.00069412747 

12.476175 

0.014297031 

s 

21385.428 

0.00014407673 

442.62907 

0.0054235614 

2.8198849 

0.072945077 

3194.5111 

0.0011474109 

102.22305 

0.026767969 

0.80137208 

0.27386902 

71 1 .25708 

0.0061324481 

29.554163 

0.10217944 

0.25901324 

0.72303267 

203.21120 

0.024909655 

9.5663893 

0.30120542 

p 

2.8198849 

0.0054112577 

67.847365 

0.086074721 

3.2775362 

0.66381458 

0.80137208 

0.037682410 

24.968615 

0.26250884 

s 

1.1554537 

1 .000000 

0.25901324 

0.20870216 

9.7649438 

0.69744586 

p 

9.5663893 

0.069383602 

0.089993840 

0.82404431 

s 

3.9668719 

1.0000000 

3.2775362 

0.21148033 

s 

0.089993840 

1.000000 

p 

24.968615 

0.065800904 

1.1554537 

0.80594128 

p 

2.8198849 

0.0049857914 

9.7649438 

0.19389662 

sp 

0.41323385 

1.000000 

0.80137208 

0.032937307 

3.9668719 

0.80737986 

sp 

0.14885938 

1.000000 

0.25901324 

0.20139597 

sp 

1 .6502041 

1.0000000 

d 

1.1554537 

0.024542010 

0.089993840 

0.83219299 

sp 

0.69673783 

1 .0000000 

0.41323385 

0.17376192 

sp 

0.032564898 

1.000000 

d 

3.9668719 

0.057990397 

0.14885938 

0.87145372 

sp 

0.012055978 

1 .000000 

1.6502041 

0.12960203 

spd 

0.053818796 

1 .000000 

sp 

0.0045209224 

1 .000000 

0.69673783 

0.87849098 

spd 

0.019493284 

1 .000000 

spd 

0.29691094 

1.0000000 

spd 

0.0070669971 

1.000000 

spd 

0.12726174 

1.0000000 

spd 

0.0025632172 

1.000000 

spd 

0.054744532 

1.0000000 

spd 

0.023602899 

1.0000000 

spd 

0.010190683 

1.0000000 

spd 

0.0044037687 

1.0000000 

spd 

0.0019040797 

1.0000000 

spd 

0.00082356019 

1.0000000 

spd 

0.00035628623 

1.0000000 

from  Partridge's  uncontracted  basis  sets  [33].  The 
Partridge's  primitive  basis  sets  (ISslOp)  for  Na 
and  Mg  are  contracted  to  [13s7p]  by  contracting 
(6  s)  with  the  six  largest  exponents  in  s  functions 
to  [Is]  by  using  LCAO  coefficients  of  Is  orbital  in 
ROHF  orbitals  as  the  contraction  coefficients  and 
contracting  (4p)  with  the  four  largest  exponents  in 
p  functions  to  [Ip]  by  using  LCAO  coefficients  of 
2p  orbital  in  ROHF  orbitals  as  the  contraction 
coefficients.  The  contracted  basis  sets  for  Na 
and  Mg  [13s7p]  are  added  by  diffuse  functions 
(2s5p)  with  exponents  obtained  from  the  smallest 
ones  divided  by  2.4.  Then  the  polarization  funct¬ 
ions  (10  are  added  for  Na  with  exponents 
(13.51565,  5.966856,  2.700045,  1.218512,  0.542187, 
0.227413,  0.0948,  0.0395,  0.0165,  0.00685).  For  Mg, 


the  polarization  functions  (5d)  are  added  with 
exponents  (7.753275,  1.614324,  0.302946,  0.0526, 
0.00913).  The  Partridge's  primitive  basis  set 
(20sl2p)  for  K  are  contracted  to  [13s7p]  by  con¬ 
tracting  (10s)  with  the  10  largest  exponents  in  s 
functions  to  [3s]  by  using  LCAO  coefficients  of  Is, 
2  s,  and  3  s  orbitals  in  ROHF  orbitals  as  the  contrac¬ 
tion  coefficients  and  contracting  (7p)  with  the  seven 
largest  exponents  in  p  functions  to  [2p]  by  using 
LCAO  coefficients  of  2p  and  3p  orbitals  in  ROHF 
orbitals  as  the  contraction  coefficients.  The  con¬ 
tracted  basis  set  for  K  [13s7p]  is  added  by  diffuse 
functions  (2s4p)  with  exponents  obtained  from  the 
smallest  ones  by  divided  by  2.5.  The  polarization 
functions  (5d)  are  added  for  K  with  exponents 
(4,086109,  0.845487,  0.15171,  0.0243,  0.00388).  Con- 


670 


VOL.  65,  NO.  5 


CALCULATION  OF  FREQUENCY-DEPENDENT  POLARIZABILITIES 


sequently,  the  basis  sets  for  Na,  Mg,  and  K  be¬ 
comes  [15sl2pl0d],  [15sl2p5d],  and  [15sllp5d], 
respectively.  For  H  on  CaH  and  NH2,  we  em¬ 
ployed  Sadlej's  polarized  basis  set  [34]  [3s2p] 
modified  by  Sasagane  [16,  36].  For  H  on  NH2,  the 
diffuse/polarization  functions  (ls/1  p)  which  have 
sp  exponent  0.010215  are  added.  For  C  and  N,  we 
used  Sadlej's  polarized  basis  sets  [6s4p2d]  modi¬ 
fied  by  Sasagane  with  diffuse/polarization  func¬ 
tions  (1  sip /Id)  in  which  s  exponents  are  0.014653 
for  C  and  0.020168  for  N  and  pd  exponents  are 
0.012198  for  C  and  0.017337  for  N.  The  basis  sets 
for  C,  N,  and  FI  on  CaH  and  NH2  are  shown  in 
Table  IT  For  Ca,  Sadlej's  original  polarized  basis 
set  [9s7p2d]  [35]  are  used. 

GEOMETRIES 

The  calculations  of  dipole  moments  and  dy¬ 
namic  polarizabilities  of  BeH,  MgH,  CaH,  CN,  and 
NH2  are  performed  at  the  experimental  equilib¬ 
rium  geometries  [37,  38].  For  BeH,  MgH,  CaH,  and 


CN,  =  1.3431  A,  =  1.7306  A,  R^aH  = 
2.002  A,  and  Rq^  =  1.1718  A.  For  NH2,  the  2  axis 
is  ^21)  -^NH  ~  1.024  A,  and  ~  103.36  . 


Results 

In  Table  III,  the  experimental  [39,  40]  and  other 
theoretical  [20,  21,  22-25]  values  of  static  polariz¬ 
abilities  of  Li,  Na,  and  K  atoms  are  given  with  our 
work.  To  our  knowledge,  no  experimental  values 
of  dynamic  polarizabilities  of  Li,  Na,  and  K  atoms 
have  been  reported.  In  Tables  IV,  V,  and  VI,  the 
results  of  calculations  of  static  and  dynamic  polar¬ 
izabilities  for  Li,  Na,  and  K  atoms  at  the  TDROHF 
and  TDROHF  -I-  MP2  levels  of  theory  are  pre¬ 
sented,  respectively.  The  electron  correlation  con¬ 
tribution  to  the  polarizabilities  estimated  by  the 
second-order  Moller-Plesset  perturbation  theory 
(MP2  contribution)  is  also  shown.  In  Table  IV, 
frequency-dependent  polarizabilities  for  Li  atom  in 


TABLE  II _ 

Modified  Sadlej’s  polarized  basis  sets  for  C,  N,  and  H. 


C  [7s5p3d] 

N  [7s5p3c/] 

H  [3s2p]® 

Type 

Exponent 

Coefficient 

Type 

Exponent 

Coefficient 

Type 

Exponent 

Coefficient 

s 

5240.6353 

0.02235882 

s 

8104.0716 

0.02229515 

s 

33.865014 

0.008761077 

782.20480 

0.1724634 

1216.0215 

0.1715770 

5.094788 

0.06542281 

178.35083 

0.8672313 

277.23428 

0.8679323 

1.158786 

0.2928504 

s 

50.815942 

0.3030337 

s 

78.904023 

0.2899865 

0.325840 

0.7272122 

16.823562 

0.7400231 

25.874419 

0.7525437 

s 

0.102741 

1.000000 

s 

6.175776 

0.6940676 

s 

9.346767 

0.6700486 

s 

0.032395 

1.000000 

2.418049 

0.3400776 

3.579794 

0.3669701 

p 

1.158786 

0.1864409 

s 

0.511900 

1 .000000 

s 

0.739610 

1.000000 

0.325840 

0.8730840 

s 

0.156590 

1 .000000 

s 

0.222617 

1.000000 

p 

0.102741 

0.9764826 

s 

0.047901 

1 .000000 

s 

0.067006 

1 .000000 

0.032395 

0.03443044 

s 

0.014653 

1 .000000 

s 

0.020168 

1 .000000 

p 

18.841800 

0.03923992 

p 

26.868987 

0.03948419 

4.159240 

0.2437874 

5.991227 

0.2486013 

1.206710 

0.8158682 

1 .750842 

0.8111297 

p 

0.385540 

1 .000000 

p 

0.560511 

1 .000000 

p 

0.121940 

1 .000000 

p 

0.175948 

1 .000000 

p 

0.038568 

1 .000000 

p 

0.055231 

1 .000000 

p 

0.012198 

1 .000000 

p 

0.017337 

1 .000000 

d 

1.206710 

0.2679750 

d 

1.750842 

0.2770034 

0.385540 

0.8199855 

0.56051 1 

0.8128187 

d 

0.121940 

0.6277424 

d 

0.175948 

0.5974958 

0.038568 

0.4964327 

0.055231 

0.5295971 

d 

0.012198 

1 .000000 

d 

0.017337 

1 .000000 

®For  H  on  NH2,  the  diffuse/ polarization  functions  {1s /1p)  which  have  sp  exponent  0.010215  are  added. 
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TABLE  III 


Experimental  and  theoretical  values  of  static  polarizabilities  of  Li, 

Na,  and  K  atoms  (In  a.u.). 

Li 

Na 

K 

ROHF  (this  work) 

170.13 

190.55 

416.27 

ROHF  +  MP2  (this  work) 

165.01 

165.88 

285.23 

PNO-CEPA 

164.5  [22] 

165.02  [23] 

287.6  [23] 

Cl  Hylleraas  [20] 

164.1 

MCSCF[21] 

164.91 

UMP4  [24] 

164.55 

UHF-CCSD(T)  [25] 

164.187 

Experimental  values 

164  ±  3  [39] 

159.2  ±  3.4  [39] 

292.8  +  6.1  [39] 

164.6  +  11.5  [40] 

305.0  +  21 .6  [40] 

TABLE  IV _ _ _ 

Frequency-dependent  polarizabilities  a(“0>;(o)  of  Li  atoms  (in  a.u.). 

Frequency  co 

0  0.005  0.010  0.015  0.020  0.025  0.030  0.035 


TDROHF 


a(  —  w;  0)) 

TDROHF +  MP2 

170.13 

171.07 

173.95 

a(— o);  co) 

Cl  Hylleraas  [20] 

165.01 

165.90 

168.63 

a(  —  o);  co) 

TDMCSCF[21] 

164.1 

165.0 

167.7 

a(  —  co;  co) 

MP2  contribution® 

164.91 

165.80 

168.53 

a(—  CO]  co) 

-5.121 

-3.10% 

-5.169 

-3.12% 

-5.319 

-3.15% 

178.97 

186.52 

197.22 

212.11 

232.92 

173.38 

180.52 

190.61 

204.61 

224.09 

172.4 

179.5 

189.6 

203.4 

222.8 

173.29 

-5.585 

-3.22% 

-5.996 

-3.32% 

-6.603 

-3.46% 

-7.494 

-3.66% 

-8.831 

-3.94% 

®MP2  contribution  is  obtained  after  subtracting  the  TDROHF  value  from  the  TDROHF  +  MP2  value.  The  percentage  is  obtained  by 
dividing  the  MP2  contribution  with  the  TDROHF  +  MP2  value. 


TABLE  V _ 

Frequency-dependent  polarizabilities  a(-to;  cd)  of  Na  atoms  (in  a.u,). 


Frequency  co 


0 

0.005 

0.010 

0.015 

0.020 

0.025 

0.030 

0.035 

TDROHF 

a(—  CO]  co) 

190.55 

191 .48 

194.32 

199.25 

206.58 

216.86 

230.90 

250.05 

TDROHF +  MP2 

a(—  CO]  co) 

165.88 

166.57 

168.66 

172.27 

177.58 

184.88 

194.64 

207.51 

MP2  contribution® 
a(  — co;  co) 

-24.67 

-24.91 

-25.65 

-26.98 

-29.01 

-31.97 

-36.26 

-42.54 

-14.87% 

-14.95% 

-15.21% 

-15.66% 

-16.33% 

-17.29% 

-18.63% 

-20.50% 

®MP2  contribution  is  obtained  after  subtracting  the  TDROHF  value  from  the  TDROHF  +  MP2  value.  The  percentage  is  obtained  by 
dividing  the  MP2  contribution  with  the  TDROHF  +  MP2  value. 
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TABLE  VI _ 

Frequency-dependent  polarizabilities  a(- co;  o>)  of  K  atoms  (in  a.u.). 


Frequency  co 


0 

0.005 

0.010 

0.015 

0.020 

0.025 

TDROHF 

a(  — (w;  (w) 

416.27 

420.50 

433.71 

457.71 

496.22 

556.55 

TDROHF  +  MP2 
a(  —  w;  0)) 

285.23 

286.71 

291.18 

298.59 

308.58 

319.49 

MP2  contribution® 
a(  —  w;  (o) 

-131.05 

-133.79 

-142.53 

-159.12 

-187.64 

-237.06 

-45.94% 

-46.66% 

-48.95% 

-53.29% 

-60.81% 

-74.20% 

^MP2  contribution  is  obtained  after  subtracting  the  TDROHF  value  from  the  TDROHF  +  MP2  value.  The  percentage  is  obtained  by 
dividing  the  MP2  contribution  with  the  TDROHF  +  MP2  value. 


Cl  Hylleraas  calculation  by  Pipin  and  Bishop  [20] 
and  those  at  the  TDMCSCF  level  by  Jaszuhski  and 
Rizzo  [21]  are  shown  with  our  work.  Jaszuhski  and 
Rizzo  used  [\3sllp9d5f]  basis  set  and  employed 
5s4p2d  as  the  orbitals  in  the  active  space  to  obtain 
the  results  of  dynamic  polarizabilities  for  Li  atom 
shown  in  Table  IV.  Compared  with  the  experimen¬ 
tal  and  other  theoretical  values,  the  static  polariz¬ 
abilities  of  Li,  Na,  and  K  atoms  at  the  ROHF  + 
MP2  level  give  good  corrections  to  those  at  the 
ROHF  level.  Moreover  the  frequency-dependent 
polarizabilities  of  Li  atom  at  the  TDROHF  +  MP2 
level  of  theory  are  slightly  higher  than  those  in  Cl 
Hylleraas  calculations  by  Pipin  and  Bishop  and 
TDMCSCF  calculations  by  Jaszuhski  and  Rizzo  but 
in  good  agreement  with  them.  The  calculations  of 
frequency-dependent  polarizabilities  for  Li  atom  at 
the  TDROHF  +  MP2  level  of  theory  are  expected 
to  give  reliable  results.  On  the  other  hand,  no  other 
theoretical  values  of  dynamic  polarizabilities  of  Na 
and  K  atoms  have  been  reported.  However,  as  the 
static  limits  of  polarizabilities  for  Na  and  K  atoms 
at  the  TDROHF  +  MP2  level  of  theory  have  very 
close  values  to  the  experimental  values,  the  dy¬ 
namic  polarizabilities  of  Na  and  K  atoms  are  also 
likely  to  give  good  results.  In  Tables  VII,  VIII,  IX, 
X  and  XI,  the  results  of  calculations  of  dipole 
moments  and  static  and  dynamic  polarizabilities 
for  BeH,  MgH,  CaH,  CN,  and  NH2  at  the  ROHF 
and  ROHF  -I-  MP2  levels  of  theory  are  presented 
with  MP2  contribution,  respectively.  To  our 
knowledge,  no  experimental  and  other  theoretical 
values  of  static  and  dynamic  polarizabilities  for 
BeH,  MgH,  CaH,  CN,  and  NH2  have  also  been 
reported.  So  we  cannot  compare  the  calculations  of 


frequency-dependent  polarizabilities  of  those 
open-shell  molecules  at  the  TDROHF  +  MP2  level 
of  theory  with  other  results. 

The  percentage  of  MP2  contribution  to  dynamic 
polarizabilities  a(  -  w;  (o)  of  Li,  Na,  and  K  atoms 
and  average  dynamic  polarizabilities  a(-ft);  (o)  of 
BeH,  MgH,  CaH,  and  CN  becomes  greater  as  the 
frequency  increases.  The  order  of  percentage  of 
MP2  contribution  to  dynamic  polarizabilities  is 
Li  <  Na  <  K  and  BeH  <  MgH  <  CaH.  The  per¬ 
centage  of  MP2  contribution  to  oy)  of  NH2 

shows  little  change  over  the  frequency  range  oy  = 
0-0.045  a.u.,  where  the  percentage  is  8.55-8.57%. 


Conclusion  and  Discussion 

We  presented  the  first  calculations  of 
frequency-dependent  polarizabilities  for  open-shell 
systems  except  for  Li  atoms  at  the  correlated  level 
of  theory.  Compared  with  experimental  [39,  40] 
and  other  calculations  [20,  21,  22-25],  the  static 
polarizabilities  of  Li,  Na,  and  K  atoms  at  the 
ROHF  +  MP2  level  give  good  corrections  to  those 
at  the  ROHF  level.  The  dynamic  polarizabilities  of 
Li  atoms  at  the  TDROHF  +  MP2  level  of  theory 
are  in  good  agreement  with  the  previous  calcula¬ 
tions  [20,  21].  As  no  experimental  and  other  theo¬ 
retical  values  of  static  and  dynamic  polarizabilities 
for  BeH,  MgH,  CaH,  CN,  and  NH2  have  also  been 
reported,  we  cannot  tell  whether  the  calculations 
of  frequency-dependent  polarizabilities  of  those 
open-shell  molecules  at  the  TDROHF  -F  MP2  level 
of  theory  are  reliable  or  not  at  this  time.  However, 
the  treatment  of  frequency-dependent  polarizabili¬ 
ties  by  TDROHF  +  MP2  method  is  expected  to  be 
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TABLE  VII _ 

Dipole  moments  (t  and  frequency-dependent  polarizabilities  «(-(»;  co)  of  BeH  (in  a.u.). 


Frequency  w 


0 

0.005 

0.010 

0.015 

0.020 

0.025 

0.030 

0.035 

0.040 

0.045 

ROHF 

f^z 

ROHF  +  MP2 

0.1096 

f^z 

TDROHF 

0.1069 

36.326 

36.378 

36.535 

36.802 

37.188 

37.703 

39.205 

39.205 

40.250 

41.551 

0)) 

24.806 

24.815 

24.840 

24.883 

24.943 

25.021 

25.116 

25.231 

25.365 

25.518 

a{  —  (o',  0))^ 

TDROHF  +  MP2 

32.486 

32.523 

32.637 

32.829 

33.106 

33.476 

33.950 

34.547 

35.288 

36.207 

a^^(  — w;  0)) 

36.290 

36.345 

36.510 

36.791 

37.197 

37.742 

38.448 

39.344 

40.470 

41 .886 

a^^(-(x)\  0)) 

25.578 

25.587 

25.614 

25.659 

25.723 

25.805 

25.907 

26.028 

26.170 

26.333 

a(  —  (o]  (o)^ 

MP2  contribution^ 

32.719 

32.759 

32.878 

33.080 

33.372 

33.763 

34.268 

34.905 

35.703 

36.702 

^xx(~^»  Cl>) 

-0.036 

-0.033 

-0.026 

-0.011 

0.009 

0.039 

0.081 

0.139 

0.220 

0.335 

-0.10% 

-0.09% 

-0.07% 

-0.03% 

0.02% 

0.10% 

0.21% 

0.35% 

0.54% 

0.80% 

0.772 

0.772 

0.774 

0.776 

0.780 

0.785 

0.790 

0.797 

0.805 

0.815 

3.02% 

3.02% 

3.02% 

3.03% 

3.03% 

3.04% 

3.05% 

3.06% 

3.08% 

3.09% 

a(—  a)‘,  0))^ 

0.233 

0.235 

0.241 

0.251 

0.266 

0.288 

0.317 

0.359 

0.415 

0.495 

0.91% 

0.92% 

0.94% 

0.98% 

1 .03% 

1.11% 

1.23% 

1 .38% 

1.59% 

1 .88% 

a—  1  ,  1 

a  —  +  3 

^MP2  contribution  is  obtained  after  subtracting  the  TDROHF  value  from  the  TDROHF  +  MP2  value.  The  percentage  is  obtained  by 
dividing  the  MP2  contribution  with  the  TDROHF  +  MP2  value. 


TABLE  VIII _ 

Dipole  moments  (x  and  frequency-dependent  polarizabilities  a(-a);  €o)  of  MgH  (in  a.u.). 


Frequency  co 

0  0.005  0.010  0.015  0.020  0.025  0  030  0.035  0.040  0.045 


ROHF 

/^z 

ROHF  +  MP2 

0.5862 

TDROHF 

0.5865 

o)\  (o) 

69.222 

a^zi-f^',  (*>) 

53.780 

a(  —  w; 

TDROHF  +  MP2 

64.075 

a^yi  —  oj',  0)) 

65.762 

0)) 

56.452 

a(—o)',  w)® 

MP2  contribution^ 

62.658 

^xx(“^j  a>) 

-3.460 

-5.26% 

(o) 

2.672 

4.73% 

-1.416 

-2.26% 

69.353 

69.749 

70.426 

53.817 

53.929 

54.117 

64.174 

64.476 

64.989 

65.880 

66.236 

66.845 

56.491 

56.609 

56.807 

62.750 

63.027 

63.499 

-3.473 

-3.513 

-3.580 

-5.27% 

-5.30% 

-5.36% 

2.674 

2.680 

2.690 

4.73% 

4.73% 

4.74% 

- 1 .424 

- 1 .449 

- 1 .490 

-2.27% 

-2.30% 

-2.35% 

71.406 

72.726 

74.441 

54.382 

54.728 

55.158 

65.731 

66.727 

68.013 

67.727 

68.917 

70.462 

57.087 

57.451 

57.904 

64.181 

65.095 

66.276 

-3.678 

-3.809 

-3.978 

-5.43% 

-5.53% 

-5.65% 

2.704 

2.723 

2.746 

4.74% 

4.74% 

4.74% 

- 1 .551 

-1.632 

- 1 .737 

-2.42% 

-2.51% 

-2.62% 

76.627 

79.395 

82.906 

55.676 

56.288 

57.002 

69.643 

71 .693 

74.271 

72.435 

74.935 

78.114 

58.449 

59.094 

59.844 

67.773 

69.655 

72.024 

-4.192 

-4.460 

-4.793 

-5.79% 

-5.95% 

-6.14% 

2.773 

2.806 

2.843 

4.74% 

4.75% 

4.75% 

-1.871 

-2.038 

-2.247 

-2.76% 

-2.93% 

-3.12% 

‘^MP2  contribution  is  obtained  after  subtracting  the  TDROHF  value  from  the  TDROHF  +  MP2  value.  The  percentage  is  obtained  by 
dividing  the  MP2  contribution  with  the  TDROHF  +  MP2  value. 
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TABLE  IX _ 

Dipole  moments  |x  and  frequency-dependent  polarizabilities  a(-(o;  ca)  of  CaH  (in  a.u.). 

Frequency  o) 

0  0.005  0.010  0.015  0.020  0.025  0.030  0.035 


ROHF 

fJiz 

ROHF  +  MP2 
f^z 

TDROHF 

—  w;  0)) 
azz(-o)\  (o) 
a(  —  co',  o))® 
TDROHF  +  MP2 

(o]  (o) 

a(“(y;  (o)^ 

MP2  contribution‘s 
a^^(  — (w;  (o) 

azz(-co',  (o) 


0.8351 

1.0168 

176.63 

177.48 

102.00 

102.19 

151.75 

152.39 

149.93 

150.48 

103.15 

103.34 

134.34 

134.77 

-26.70 

-27.00 

-17.81% 

-17.94% 

180.10 

184.68 

102.78 

103.78 

154.33 

157.71 

152.18 

155.12 

103.90 

104.86 

136.09 

138.37 

-27.92 

-29.56 

-18.35% 

-19.06% 

1.12 

1.08 

1 .08% 

1 .03% 

-18.24 

-19.35 

-13.40% 

-13.98% 

191.57 

201 .37 

105.23 

107.16 

162.79 

169.97 

159.46 

165.47 

106.24 

108.09 

141.72 

146.34 

-32.11 

-35.90 

-20.14% 

-21.70% 

1.02 

0.927 

0.96% 

0.86% 

-21.07 

-23.63 

-14.87% 

-16.14% 

215.10 

234.45 

109.65 

112.79 

179.95 

193.89 

173.56 

184.33 

110.46 

113.44 

152.53 

160.70 

-41.54 

-50.11 

-23.93% 

-27.19% 

0.810 

0.656 

0.73% 

0.58% 

-27.42 

-33.19 

-17.98% 

-20.65% 

1.16  1.15 

1.12%  1.11% 

E(-o)\o})^  -17.42  -17.62 

-12.97%  -13.07% 


‘^MP2  contribution  is  obtained  after  subtracting  the  TDROHF  value  from  the  TDROHF  +  MP2  value.  The  percentage  is  obtained  by 
dividing  the  W\P2  contribution  with  the  TDROHF  +  MP2  value. 

TABLE  X _ 

Dipole  moments  |x  and  frequency-dependent  polarizabilities  a(-co;  <o)  of  CN  (In  a.u.). 


Frequency  co 

0  0.005  0.010  0  015  0.020  0.025  0.030 


ROHF 

Mz 

ROHF  +  MP2 

-0.9123 

TDROHF 

-0.5995 

a^^(  —  cu;  a>) 

17.424 

azzi~o)-,  (o) 

20.008 

a(  —  w;  0))^ 

TDROHF  +  MP2 

18.285 

Co) 

18.736 

azz(-(x)\  (o) 

24.039 

a(—  CO',  (o)^ 

MP2  contribution^^ 

20.504 

axx(~<w5  o)) 

1.312 

7.00% 

Giz:^(-oy\  co) 

4.031 

16.77% 

a(~  CO’,  co)^ 

2.218 

10.82% 

17.466 

17.599 

17.835 

20.010 

20.017 

20.029 

18.314 

18.405 

18.566 

18.840 

19.169 

19.777 

24.045 

24.064 

24.094 

20.575 

20.800 

21.216 

1.373 

1.570 

1.943 

7.29% 

8.19% 

9.82% 

4.035 

4.046 

4.065 

16.78% 

16.81% 

16.87% 

2.261 

2.396 

2.650 

10.99% 

11.52% 

12.49% 

18.203 

18.755 

19.591 

20.045 

20.067 

20.092 

18.817 

19.192 

19.758 

20.782 

22.415 

25.174 

24.138 

24.194 

24.263 

21.900 

23.008 

24.870 

2.579 

3.660 

5.583 

12.41% 

16.32% 

22.18% 

4.092 

4.127 

4.170 

16.95% 

17.06% 

17.19% 

3.083 

3.816 

5.112 

14.08% 

16.58% 

20.56% 

‘SMP2  contribution  is  obtained  after  subtracting  the  TDROHF  value  from  the  TDROHF  +  MP2  value.  The  percentage  is  obtained  by 
dividing  the  MP2  contribution  with  the  TDROHF  +  MP2  value. 
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TABLE  XI  _ _ 

Dipole  moments  p  and  frequency-dependent  polarizabilities  a(-c»;  co)  of  NH2  (in  a.u.). 


Frequency  co 

0  0.005  0010  0.015  0020  0.025  0.030  0.035  0.040  0.045 


ROHF 

ROHF  +  MP2 
Mz 

TDROHF 

a>) 

(o) 

a(  — tt);  0))^ 

TDROHF  +  MP2 

0iyyi-0l)\  <^) 
a(  — £t>;  0))^ 

MP2  contribution*^ 
a^^{-~(o',  (o) 

OLyy(--O)',  (O) 

a(  —  (o)^ 


-0.7391 

-0.7052 

9.830 

9.834 

11.932 

11.933 

11.736 

11.738 

11.166 

11.168 

10.774 

10.779 

12.684 

12.685 

13.180 

13.181 

12.212 

12.215 

0.945 

0.945 

8.77% 

8.76% 

0.752 

0.752 

5.93% 

5.93% 

1.443 

1.444 

10.95% 

10.95% 

1.047 

1.047 

8.57% 

8.57% 

9.847 

9.870 

1 1 .935 

1 1 .940 

11.741 

11.747 

11.175 

11.186 

10.792 

10.815 

12.688 

12.693 

13.186 

13.195 

12.222 

12.234 

0.945 

0.945 

8.75% 

8.73% 

0.753 

0.754 

5.93% 

5.94% 

1.445 

1.447 

10.96% 

10.97% 

1.047 

1.049 

8.57% 

8.57% 

9.904 

9.949 

1 1 .946 

1 1 .955 

11.756 

1 1 .767 

1 1 .202 

1 1 .224 

10.848 

10.894 

12.701 

12.711 

13.207 

13.222 

12.252 

12.275 

0.945 

0.944 

8.71% 

8.67% 

0.755 

0.756 

5.94% 

5.95% 

1.451 

1.455 

10.99% 

11.01% 

1.050 

1.052 

8.57% 

8.57% 

10.009 

10.086 

11.965 

11.977 

11.780 

11.796 

11.251 

11.286 

10.953 

11.031 

12.723 

12.737 

13.240 

13.262 

12.306 

12.343 

0.944 

0.945 

8.62% 

8.56% 

0.758 

0.760 

5.96% 

5.97% 

1.460 

1.467 

11.03% 

1 1 .06% 

1.054 

1.057 

8.57% 

8.56% 

10.186 

10.317 

1 1 .991 

12.007 

11.814 

11.835 

1 1 .330 

1 1 .386 

11.131 

11.263 

12.754 

12.772 

13.288 

13.317 

12.391 

12.451 

0.945 

0.946 

8.49% 

8.40% 

0.763 

0.765 

5.98% 

5.99% 

1.474 

1.482 

11.09% 

11.13% 

1.061 

1.065 

8.56% 

8.55% 

OL  3  +  dyy  + 

°MP2  contribution  is  obtained  after  subtracting  the  TDROHF  value  from  the  TDROHF  +  MP2  value.  The  percentage  is  obtained  by 
dividing  the  MP2  contribution  with  the  TDROHF  +  MP2  value. 


valid  for  nondegenerate  open-shell  systems  in 
which  only  dynamical  pair  correlation  is  signifi¬ 
cant,  that  is,  TDROHF  wave  function  is  good  as 
a  reference  state  of  Moller-Plesset  perturbation 
theory. 

In  this  calculation,  OPTl  definition  is  used  as 
the  orbital  energy  matrix.  There  are  some  kinds  of 
open-shell  perturbation  theory  with  respect  to 
electron  correlation  by  spin-unrestricted  or  re¬ 
stricted  and  the  difference  of  definition  of  orbital 
energy.  It  is  necessary  to  estimate  the  effect  to 
response  property  by  such  differences  in  open-shell 
perturbation  theory.  In  this  study,  we  treated 
open-shell  doublet  systems  for  the  calculation  of 
dynamic  polarizabilities  at  the  TDROHF  +  MP2 
level  of  theory.  In  the  dynamic  polarizability  for 
open-shell  system  which  has  spin  multiplicity  more 
than  triplet,  the  MBPT  poles  corresponding  to  the 
difference  between  (open  +  open)  and  (closed  + 
closed)  orbital  energy  (e^  +  -  e^)  may  ex¬ 

ist,  and  they  may  be  smaller  than  the  first  excited 


energy  by  spin-restricted  open-shell  random-phase 
approximation  (RPA)  or  TDROHF  since  open  or¬ 
bital  energy  may  be  very  close  to  closed  orbital 
energy.  In  such  a  case,  the  first  dispersion  of  dy¬ 
namic  polarizability  exists  before  the  first  excited 
energy  by  RPA.  However,  such  a  MBPT  pole  has 
no  physical  meaning.  That  difficulty  may  be 
avoided  by  utilizing  the  arbitrarity  of  orbital 
energy. 
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ABSTRACT:  Ab  initio  calculations,  including  the  electron  correlation  effects  via  the 
Moller-Plesset  partitioning,  of  the  static  longitudinal  polarizability  (ot^),  first  and  second 
hyperpolarizabilities  ( and  Ji)  of  small  push-pull  molecules  have  been  carried  out 
with  the  6-31G  and  6-31G*  basis  sets.  Two  types  of  conjugated  segment  (polyacetylene 
and  polyyne)  were  used  as  linker  between  the  nitro  and  amino  fragments,  chosen  as 
acceptor  and  donor  groups.  For  small  chains,  the  triply  bonded  structures  lead  to  larger 
than  the  doubly  bonded  structures,  although  the  and  values  order  in  the 
opposite  direction.  The  second-order  Moller-Plesset  (MP2)  corrections  are  essential  to 
obtain  correct  estimates  of  the  hyperpolarizabilities  and  are  generally  much  larger  than 
the  MP3  and  MP4  contributions.  Geometry  optimizations  were  carried  out  with  both 
basis  sets  within  the  Hartree-Fock  and  MP2  schemes;  the  MP2  geometries  lead  to  smaller 
bond  length  alternations  and  thus  larger  (hyper)polarizabilities.  For  a  careful  choice  of 
the  geometry  is  more  crucial  than  the  use  of  polarization  functions  during  the  electronic 
state  calculations;  however,  for  the  hyperpolarizabilities,  both  the  geometry  and  the  basis 
set  are  of  importance.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  679-688,  1997 
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Introduction 

The  nonlinear  optical  (NLO)  properties  of 
push-pull  systems  have  been  widely  investi¬ 
gated,  both  theoretically  and  experimentally  in  the 
recent  years.  These  molecules  are  composed  of  a 
conjugated  segment  end-capped  by  donor  and  ac¬ 
ceptor  groups.  Numerous  end  groups  and  linkers 
were  tested  in  order  to  obtain  large  NLO  responses 
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(see  the  review  article  of  Kanis  et  al.  [1]).  As 
shown  by  Sim  et  al.  for  the  typical  case  of  p-nitro- 
aniline  the  electron  correlation  could  lead  to  signi¬ 
ficant  variations  of  the  predicted  NLO  responses 
[2].  Indeed  the  inclusion  of  the  second-order 
Moller-Plesset  (MP2)  correction  gives,  in  compari¬ 
son  to  the  Hartree-Fock  level,  variations  of  + 12%, 
+  88%  and  +100%  for  the  polarizability,  the  first 
and  second  hyperpolarizability,  respectively.  How¬ 
ever,  most  of  the  ab  initio  calculations  including 
the  electron  correlation  effects  have  focused  on 
small  systems,  and  we  refer  the  reader  to  the 
works  of  Shelton  and  Rice  [3]  and  Bartlett  and 
Sekino  [4]  for  complete  reviews  of  their 
(hyper)polarizability  evaluations. 

To  maximize  the  longitudinal  polarizability 
(a^^),  first  hyperpolarizability  ( j8^),  and  second  hy¬ 
perpolarizability  (ji),  a  large  electronic  delocaliza¬ 
tion  along  the  backbone  of  the  molecule  is  needed; 
so,  increasing  the  length  of  the  linker  in  push-pull 
molecules  leads  to  larger  a^,  and  values.  In 
the  case  of  fBi,  the  electronic  asymmetry  of  the 
fragment  is  also  of  importance.  Indeed,  it  was 
shown  experimentally  [5]  and  theoretically  [6,  7] 
that  Pi  of  a  symmetric  conjugated  segment  end- 
capped  by  donor  and  acceptor  groups  presents  a 
saturation  to  a  constant  when  the  length  of  the 
linker  increases.  This  is  explained  by  the  fact  that 
the  push-pull  strength  becomes  diluted  in  the  con¬ 
jugated  space  for  large  chains.  This  saturation  is 
even  reproduced  by  the  simple  two-state  picture, 
because  the  energy  difference  and  the  dipolar  tran¬ 
sition  moment  between  the  highest  occupied  and 
lowest  unoccupied  molecular  orbitals  (HOMO  and 
LUMO)  as  well  as  the  difference  of  dipole  moment 
between  the  ground  and  excited  states  become 
constant  for  very  large  chains  [1].  Nevertheless,  the 
saturation  regime  is  only  reached  for  extended 
chains;  for  example,  in  the  case  of  a  a,(o- 
nitro,amino-polyacetylene  systems,  18  carbon 
atoms  are  necessary  in  order  to  reach  the  inflection 
point  (where  the  increase  of  Pi  begins  to  slow 
down),  whereas  Pi  is  not  yet  constant  for  a  chain 
containing  28  carbon  atoms  [7],  So  the  formula. 
Pi  =  kN'^,  which  is  generally  used  to  fit  the  expo¬ 
nential  grow  of  Pi  with  chain  length  (N),  is  only 
valid  for  small  chain  length  and,  in  order  to  fit  the 
global  shape  of  the  evolution  of  Pi,  one  has  to  use 
more  elaborated  functions  [7,  8].  To  compensate 
this  drawback  of  a  smaller  Pi/N  (longitudinal  first 
hyperpolarizability  per  unit  cell)  for  large  chains. 


one  could  think  of  an  asymmetric  linker  which 
will  also  contribute  to  the  enhancement  of  /3^,  If 
the  use  of  such  a  segment  prevents  the  saturation 
of  Pi  when  chain  length  increases,  it  is  however  to 
the  prejudice  of  a  smaller  electronic  delocalization 
along  the  backbone  (as  shown,  for  small  oligomers, 
by  Tsunekawa  et  al.  [9,  10]),  and  thus  Pi  of 
medium  chains  is  smaller  than  for  the  correspond¬ 
ing  symmetric  linker  [7].  Another  idea  is  to  bring 
more  tt  electrons  to  the  chain  by  using  triply 
bonded  structures.  This  has  sometimes  been  pro¬ 
posed  to  optimize  the  NLO  properties,  although  it 
has  not  produced  experimental  [11]  or  theoretical 
[1,  12,  13]  larger  Pi  values  than  the  corresponding 
doubly  bonded  structures  with  the  exception  of 
the  smallest  systems. 

Correlated  semi-empirical  calculations  of  the 
evolution  with  chain  length  of  Pi  of  push-pull 
systems  have  been  performed  recently;  we  refer 
the  reader  to  Ref.  [13]  for  a  summary  of  these 
investigations.  However,  to  our  knowledge,  no 
work  has  addressed  the  electron  correlation  effects 
on  ab  initio  Pi  in  a  series  of  oligomers  of  increas¬ 
ing  size.  Only  a  few  recent  calculations  have  ad¬ 
dressed  the  electron  correlation  effects  upon 
and  ji  in  oligomeric  series.  First  calculations  on 
model  hydrogen  chains  have  been  carried  out 
[14-16]  and  have  shown  that,  in  the  case  of  an 
infinite  chain,  the  ratio  between  the  MP4  and  cou¬ 
pled  Hartree-Fock  (CHF)  varies  between  .97 
and  .90  with  the  6-311G**  basis  set,  while  the 
corresponding  ji  ratio  is  1.31.  For  polyacetylene, 
polyyne,  polypyrrole  the  ratios  between  the  MP2 
and  CHF  are  2.84,  8.0  and  2.2,  respectively,  with 
the  6-31G  basis  set  [17,  18], 

Our  study  is  organized  as  followed.  A  brief 
presentation  of  the  methods  is  given  and  followed 
by  a  detailed  discussion  of  the  results.  In  particu¬ 
lar,  comparisons  with  the  nonsubstituted  chains 
are  carried  out.  Finally,  we  summarize  and  con¬ 
clude. 


Methodology 

STATIC  HYPERPOLARIZABILITIES 

The  application  of  a  static  uniform  electric  field 
(E)  on  a  system  leads  to  a  reorganization  of  its 
charge  distribution.  This  reorganization  is  charac¬ 
terized  by  representing  either  the  dipole  moment 
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(|jl)  or  the  energy  (e)  as  a  power  series  in  E: 

1  1 

=  p,o  +  ot  E  +  — p:EE  +  —7  lEEE  +  ••• , 

(1) 

1  1 

^  =  ^0  “  M^o  *  E  -  —ol:  EE  -  — P  :EEE - , 

(2) 


where  the  subscript  characterizes  the  order  of  the 
electron-electron  interactions,  and  the  superscript 
describes  the  type  of  substitutions  used  in  the 
intermediate  states;  S  stands  for  single  substitu¬ 
tions,  D  for  double  substitutions,  T  for  triple  sub¬ 
stitutions.  64^*^^  is  the  energy  term  coming  from 
the  disconnected  quadruple  substitutions  and  the 
associated  renormalization.  For  (P^  =  a^,  or 
7l)/  similar  equations  may  be  written: 


where  eo  and  iuiq  are  the  energy  and  the  dipole 
moment  of  the  molecule  in  the  absence  of  an 
external  electric  field,  respectively;  a  is  the  electric 
dipole  polarizability  tensor  which  represents  the 
linear  response  of  the  dipole  moment  to  the  exter¬ 
nal  electric  field.  P  and  7  are  the  first  and  second 
hyperpolarizability  tensors  which  correspond  to 
the  nonlinear  phenomena.  When  the  molecule  is 
extended  primarily  in  one  direction,  the  longitudi¬ 
nal  component  of  the  (hyper)polarizabilities  is  the 
largest  because  the  electrons  are  primarily  delocal¬ 
ized  along  the  backbone  of  the  molecule.  This  is 
why  we  have  restricted  our  investigation  to  this 
component. 

If  the  Hellmann-Feymann  [19,  20]  theorem  is 
fulfilled,  i.e.,  if  the  wave  functions  are  optimized 
variationally  with  respect  to  all  parameters,  the 
quantities  in  Eqs.  (1)  and  (2)  (jjlq,  a, , . , )  are  equiv¬ 
alent.  This  theorem  is  satisfied  by  Hartree-Fock 
(HF)  wave  functions,  however  it  is  not  the  case  of 
the  Moller-Plesset  ones.  Therefore,  there  exist  two 
ways  (leading  to  different  numerical  values)  to 
compute  the  (hyper)polarizabilities;  either  via  a 
differentiation  of  the  field-dependent  dipole  mo¬ 
ment  or  via  a  differentiation  of  the  field-dependent 
energy,  the  last  one  being  chosen  in  the  present 
work. 

MOLLER-PLESSET  PARTITIONING 

Electron  correlation  corrections  have  been  con¬ 
sidered  within  the  Moller-Plesset  partitioning 
scheme  [21]  which  leads  to  successive  corrections 
(MP2,  MP3, . . . )  which  are  consistent  with  elec¬ 
tron-electron  interactions  through  second,  third, 
etc.  order.  This  technique  presents  the  advantage 
to  be  size-consistent,  i,e,,  to  lead  to  the  correct 
behavior  when  the  size  of  the  system  increases. 
The  total  MP4  energy  can  be  decomposed  into 
different  contributions: 


^MP2 

=  Phv  +  PF^ 

(4a) 

(4b) 

^MP4 

^  ^MP3  ^4' 

(4c) 

Pmpa  =  Pmp3  +  Pi  +  PF  +  Pi  +  PF'^^,  (4d) 

where  Pmp4  is  the  MP4  value  of  Pp,  while  P4  is 
the  fourth-order  correction  to  P^,  i.e,,  the  value  to 
be  added  to  lo  obtain  Pmp4‘  Equations  4(a-d) 
permit  to  evaluate  the  relative  importance  of  each 
type  of  excitations  in  the  total  Pj^p4  value.  It  is 
important  to  stress  that  all  these  terms  are  size 
consistent.  The  inclusion  of  the  electron  correlation 
requires  the  definition  of  an  active  space,  i.e.,  a 
space  where  the  electron  substitutions  occur.  In 
our  case,  this  space  is  the  whole  orbital  space 
except  for  the  core  orbitals.  When  truncating 
Moller-Plesset  schemes  in  that  way,  they  are  no 
more  size  consistent  but,  has  shown  in  previous 
studies  [22],  the  computed  NLO  properties  are 
almost  unaffected  by  this  approximation. 


FINITE-FIELD  PROCEDURE 

The  NLO  properties  of  the  various  molecules 
have  been  computed  by  adopting  the  finite-field 
(FF)  method.  This  choice  has  been  dictated  by  the 
lack  of  analytical  techniques  for  computing  the 
(hyper)polarizabilities  within  the  Moller-Plesset 
schemes.  In  particular,  we  have  considered  the 
field  dependence  of  the  energy.  So,  the  finite- 
difference  expressions  of  the  static  (hyper)polariz- 
abilities  are  given  by: 


dWE^) 

Je,=o 


(5a) 


-MP4 


=  6 


HF 


+  eP  -h 


eP  -f-  eF  +  eP  +  ej  -I- 


(3) 


2e(0)  -  6(-2%)  -  ^  ^ 

=  lim  - ~ - ,  (5b) 

Et-o  (2‘Ei.) 
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/3l" 


0,k  ^  _ 


d(2%f 


£,  =  0 


(5c) 


=  lim 

E,->0 


e(-2‘^+^Ei,)  -  2€(-2'‘Ej^) 

+  2e(2*^Ei,)  -  e(2'^''^Ei,) 

2(2%f 


(5d) 


<?^e(2%) 

d(2%)" 


(5e) 


=  lim 

El^O 


£,-0 

e(-2''^'EL)  +  4e(2‘^EL)  -  6e(0) 
+  4e(-2*^Ei.)  -  e(2‘^+iEL) 


(2'=E,r 


(50 


where  the  basic  field  amplitude,  £^,  is  a  chosen 
constant,  k  (/:  =  0, 1, . . . )  defines  the  lowest 
nonzero  field  amplitude  used  in  each  calculation. 
The  field  amplitudes  are  following  a  power  of  two 
pattern  to  permit  the  use  of  the  Romberg  proce- 
dure  (see  below).  The  0  superscript  stands  for  the 
zeroth  iteration  within  the  Romberg  procedure. 
These  equations  are  equivalent  to  those  given  by 
Sim  et  al.  [2],  if  one  takes  into  account  the  different 
definitions  of  /3  and  y  in  Eq.  (2).  Indeed,  Sim  and 
co-workers  took  an  experimentalist's  convention, 
developing  the  energy  as:  e  =  Cq  -  |uiq  ‘  E  -  fa: 
EE  -  fPiEEE - . 

Only  the  energies  corresponding  to  different 
electric  fields  are  necessary  to  use  these  FF  formu¬ 
las;  however,  this  technique  presents  two  major 
drawbacks:  First,  the  energy  has  to  be  known  at 
several  different  electric  fields  for  the  high-order 
hyperpolarizabilities;  second,  the  errors  associated 
with  the  numerical  derivative  formulas  may  not  be 
negligible.  In  practice,  the  electric  fields  at  which 
the  energy  is  computed  have  to  be  small  enough  to 
satisfy  the  ^  0  condition,  to  avoid  SCF  conver¬ 
gence  problems,  and  to  minimize  high-order  hy¬ 
perpolarizability  contaminations  but  large  enough 
to  obtain  a  sufficient  number  of  significant  digits 
in  the  numerical  differentiation  which  is  especially 
critical  for  the  second  hyperpolarizability. 

The  accuracy  of  the  FF  results  is  improved  by 
the  use  of  the  Romberg's  procedure  [23],  which 
removes  the  higher-order  hyperpolarizability  con¬ 
taminations.  Starting  with  the  values  obtained  by 
using  different  k  in  the  Eq.  (5),  one  could  system¬ 
atically  improve  the  hyperpolarizabilities  values 


by  applying  the  iterative  formula  [16]:* 
4,ppp~'^'^  ~~ 


where  p  is  the  number  of  Romberg  iterations.  This 
improves  the  NLO  property  estimates  to  the  preju¬ 
dice  of  additional  computational  costs.  During  our 
calculations,  we  have  chosen  the  following  basic 
field  amplitude:  =  8.10"^  a.u.  (1  a.u.  of  electric 

field  =  5.1422  10^^  V  m"^)  and  k  takes  the  values 
0,  1,  2,  3,  and  4.  This  number  of  electric  fields 
allows  us  to  compute  up  to  and  yl'^.  As 

expected,  is  the  most  difficult  quantity  to  calcu¬ 
late  due  to  the  high  order  of  the  numerical  deriva¬ 
tive  used  in  Eq.  (5);  while  the  values  obtained 
after  a  unique  Romberg's  procedure  are  almost 
clear  of  any  inaccuracies.  In  addition  to  this  careful 
choice  of  the  electric  field  amplitudes,  the  SCF 
convergence  criterion  has  been  tightened  to  10“^^ 
a.u.  in  the  Gaussian94  program  [24],  which  permits 
us  to  meet  a  10"^  a.u.  precision  on  the 
10"^  a.u.  on  Pi,  and  ^  10^  a.u.  on  y^. 


Results  and  Discussion 

Two  conjugated  linkers  [polyacetylene  (PA)  and 
polyyne  (PY)]  and  two  double-^  basis  sets  are 
used  in  this  work:  6-31G  and  6-31G*,  The  6-31G* 
basis  is  biased  toward  the  representation  of  elec¬ 
tron  correlation  rather  than  (hyper)polarizabilities, 
and  generally  it  gives  too  small  values  for  the 
(hyper)polarizabilities.  It  is  difficult  to  predict 
which  basis  set  will  provide  the  more  accurate 
correlated  (hyper)polarizabilities  estimates.  It  was 
shown  in  previous  studies  that  the  addition  of  p 
functions  on  the  hydrogens  of  conjugated 
molecules  is  not  crucial  in  order  to  obtain  accurate 
estimates  of  the  correlated  longitudinal 
(hyper)polarizabilities  values  [18,  22].  This  is  re¬ 
lated  to  the  fact  that  the  NLO  responses  of  these 
systems  are  mainly  due  to  the  7r-electron  delocal¬ 
ization.  For  all  systems,  the  geometry  has  been 
fully  optimized  at  the  Hartree-Fock  and  at  the 
MP2  levels  of  approximation  using  thresholds  cor¬ 
responding  to  the  tight  keyword  in  the  Gaussian94 
program  [24].  For  the  sake  of  simplicity,  we  do  not 
present  here  the  geometrical  parameters  obtained 
by  those  optimizations.  We  refer  to  the  work  of 
Tsunekawa  and  Yamaguchi  [10]  for  a  description 

*  In  the  Eq.  (8)  of  Ref.  [16],  one  should  replace  the  numerator 
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of  the  results  of  the  RHF/6-'31G  (RHF  =  restricted 
HF)  geometries  of  the  doubly  bonded  structures. 

Our  results  are  displayed  in  Tables  I,  11,  and  III 
for  the  static  longitudinal  polarizability,  first  and 
second  hyperpolarizabilities,  respectively.  Figures 
1,  2,  and  3  show  the  ratios  between  the  different 
components  of  the  total  MP4  correction  and  the 
HF  values;  these  results  have  been  obtained  for  the 
RHF/6-31G  optimized  structures  with  the  6-31G 
basis  set. 

POLARIZABILITY 

From  Table  I,  it  is  clear  that,  for  small  chains,  it 
is  mostly  the  size  rather  than  the  nature  of  the 
conjugated  segment  that  determines  the  ampli¬ 
tude.  Indeed,  the  differences  in  values  noticed 
when  using  triple  bonds  instead  of  double  bonds 
is  relatively  small.  Nevertheless,  the  PA-type  linker 
gives  always  the  largest  a^.  is  more  sensitive 
to  the  nature  of  the  conjugated  linker  than  a^p. 
For  small  chains,  this  MP2  correction  is  always 
positive,  i.e.,  it  leads  to  an  increase  of  the 


opposite  being  true  for  longer  chains.  The  relative 
importance  of  the  MP2  correction  de¬ 

creases  with  chain  length  9,  6  and  4%  for  the 
PA  monomer,  dimer,  and  trimer,  respectively),  see 
Figure  1.  However,  it  is  not  clear  that  this  trend 
will  go  on  for  longer  chains.  The  MP3  and  MP4 
corrections  are  smaller  than  the  MP2  one  for  PA 
oligomers;  however,  for  the  PY  dimer  is  the 
largest  component  of  the  total  Moller-Plesset  cor¬ 
rection.  If  is  negative  (and  decreases  with  in¬ 
creasing  chain  length,  as  shown  on  Fig.  1),  the  sign 
and  the  ratio  varies  a  lot  with  the  nature 

and  length  of  the  conjugated  linker.  In  general  the 
Hartree~Fock  and  MP4  results  are  in  quite  good 
agreement,  and,  in  the  case  of  the  PA  trimer,  the 
correction  is  so  negligible  that  it  is  not  interesting 
to  compute  aj^p4  compared  to  a^^p. 

The  ratios  ai^p4/af^P2  found  by  us  are  very 
close  to  the  ones  computed  by  Toto  and  co-workers 
[18]  on  nonsubstituted  PA  chains.  For  the  dimer 
and  trimer  they  found  0.981  and  0.969,  respec¬ 
tively,  while  we  have  0.982  and  0.970  using  the 
6-31G  basis  and  the  RHF/6-31G  optimized  struc- 


TABLE  I 


Static  longitudinal  polarizability  (a.u.)  of  NHg  —  R  — NOg.® 


R 

Geometry 

Basis 

«HF 

“MP2 

O^MPS 

^MP4 

X 

o 

II 

X 

o 

HF/6-31G 

6-31 G 

70.27 

76.09 

76.22 

75.42 

MP2/6-31G 

6-31 G 

75.89 

83.31 

83.23 

81.43 

HF/6-31G 

6-31 G* 

70.10 

76.33 

75.38 

75.66 

MP2/6-31G 

6-31 G* 

75.53 

82.22 

81.94 

81.63 

HF/6-31G* 

6-31 G 

66.54 

71.74 

71.37 

71.44 

MP2/6-31G* 

6-31 G 

71.69 

78.06 

77.91 

77.04 

HP/ 6-31 G* 

6-31 G* 

66.63 

72.29 

71.08 

71.93 

MP2/6-31G* 

6-31 G* 

71.50 

78.22 

77.01 

77.31 

C\J 

X 

o 

II 

X 

o 

HF/6-31G 

6-31 G 

144.45 

152.99 

150.93 

150.24 

MP2/6-31G 

6-31 G 

156.31 

167.00 

164.66 

161.81 

HF/6-31G* 

6-31 G* 

134.81 

144.01 

139.67 

142.52 

MP2/6-31G* 

6-31 G* 

146.88 

158.26 

154.05 

155.53 

CO 

X 

o 

II 

X 

o 

HF/6-31G 

6-31 G 

239.49 

246.74 

235.98 

239.25 

MP2/6-31G 

6-31 G 

263.21 

271 .92 

260.49 

260.32 

HF/6-31G* 

6-31 G* 

221.26 

233.16 

220.36 

MP2/6-31G* 

6-31 G* 

248.79 

263.47 

250.50 

(CH  =  CH)4 

HF/6-31G 

6-31 G 

358.96 

354.49 

c=c 

HF/6-31G 

6-31 G 

67.80 

71.57 

70.69 

72.46 

MP2/6-31G 

6-31 G 

74.60 

78.38 

78.00 

78.99 

HF/6-31G* 

6-31 G* 

65.28 

70.46 

68.53 

71.11 

MP2/6-31G* 

6-31 G* 

71.65 

77.15 

75.38 

77.67 

CVJ 

o 

III 

o 

HF/6-31G 

6-31 G 

136.06 

137.37 

135.68 

141.80 

MP2/6-31G 

6-31 G 

151.14 

150.28 

150.39 

155.23 

o 

III 

o 

OJ 

HF/6-31G 

6-31 G 

221 .89 

217.27 

^1  a.u.  of  polarizability  =  1.6448  x 

J-’. 
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ture.  However,  the  values  of  aj^p2/apjp  are  differ¬ 
ent,  as,  for  nonsubstituted  PA,  the  polarizability 
always  decreases  when  the  electron  correlation  is 
included,  while  for  substituted  chains  it  is  only  the 
case  for  the  tetramer  (and  most  probably  for  longer 
chains).  For  the  dimer,  trimer,  and  tetramer  Toto  et 
al.  [18]  found  0.855,  0.831,  and  0.816,  respectively, 
while  we  have  1.06, 1.03,  and  0.987  using  the  6-31G 
basis  and  the  RHF/6-31G  geometry.  Our  chains 
are  of  course  too  small  to  show  a  convergence 
behavior. 

The  triple  excitations  contribute  largely  to  the 
total  MP4  correction;  and  in  the  case  of  the  trimer, 
a4  is  even  larger  than  a4,  as  shown  by  Figure  1.  So 
the  MP4(SDQ)  values  are  not  more  accurate  than 
the  MP3  ones.  This  conclusion  has  been  reached 
many  times  before  for  conjugated  molecules 
[18,  22], 

When  optimizing  at  the  MP2  level,  the  bond 
length  alternation  (BLA)  decreases,  i.e.,  the  differ¬ 
ence  of  bond  length  between  single  and  double  (or 
triple)  bonds  decreases  and  increases  ( ~  +  7%). 
The  addition  of  d  polarization  functions  on  the 


second-row  atoms  during  the  optimization  process 
leads  to  an  increase  of  the  BLA  and  a  decrease  of 
Ul  ('^  -6%).  In  that  respect,  it  is  striking  to  note 
that  for  NH2 — CH=CH — NO2,  the  difference 
between  the  MP4/6-31G//MP2/6-31G  (finite- 
field  calculation  based  on  MP4  energies  computed 
on  the  MP2  optimized  geometry  with  the  6-31G 
basis  set)  and  MP4/6-31G*//MP2/6-31G*  is 
mostly  due  to  the  variation  of  geometry  and  not  to 
the  addition  of  d  functions  during  the  FF  calcula¬ 
tions. 

FIRST  HYPERPOLARIZABILITY 

Again  for  the  first  hyperpolarizability,  the  MP2 
contribution  to  the  total  MP4  correction  is  by  far 
the  largest,  and  the  ratio  Pmpi/Phf  is  relatively 
stable  for  a  given  conjugated  linker,  showing  only 
small  decrease  with  chain  length.  For  the  PA-like 
chains,  this  ratio  is  2.70,  2.44,  2.26,  and  2.25  for  the 
monomer,  dimer,  trimer,  and  tetramer  at  the 
RHF/6-31G  level  (Fig.  2).  It  is  striking  to  note  that 
for  the  infinite  chain  this  ratio  is  undetermined 


TABLE  II _ 

Static  longitudinal  first  hyperpoiarizabiiity  (a.u.)  of  NHg  —  R  —  NOg.® 


R 

Geometry 

Basis 

PmP2 

PmP3 

PmP4 

X 

0 

11 

X 

0 

HF/6-31G 

6-31 G 

-313.4 

-846.9 

-814.1 

-824.0 

MP2/6-31G 

6-31 G 

-379.2 

-1095.7 

-1017.2 

-1058.4 

HF/6-31G 

6-31 G* 

-246.4 

-650.6 

-600.1 

-643.9 

MP2/6-31G 

6-31 G* 

-308.5 

-826.5 

-769,7 

-848.0 

HF/6-31G* 

6-31 G 

-263.8 

-706.9 

-680.8 

-688.0 

MP2/6-31G* 

6-31 G 

-323.9 

-902.2 

-853.2 

-874.1 

HF/6-31G* 

6-31 G* 

-204.8 

-544.4 

-500.5 

-537.7 

MP2/6-31G* 

6-31 G* 

-256.5 

-699.1 

-634.1 

-689.4 

(CH  =  CH)2 

HF/6-31G 

6-31 G 

-1494.1 

-3649.9 

-3713.5 

-3700.4 

MP2/6-31G 

6-31 G 

-1798.1 

-4639.8 

-4630.9 

-4627.1 

HF/6-31G* 

6-31 G* 

-1025.7 

-2449.0 

-2340.2 

-2495.9 

MP2/6-31G* 

6-31 G* 

-1266.2 

-3139.3 

-3022.1 

-3184.7 

(CH  =  CH)3 

HF/6-31G 

6-31 G 

-4272.8 

-9635.1 

-9710.6 

-9771.8 

MP2/6-31G 

6-31 G 

-5219.2 

-12438.7 

-12513.5 

-12397.7 

HF/6-31G* 

6-31 G* 

-2951.5 

-6558.7 

-6137.6 

MP2/6-31G* 

6-31 G* 

-3811.4 

-8844.8 

-8567.7 

{CH  =  CH)4 

HF/6-31G 

6-31 G 

-8168.6 

-18459.1 

c=c 

HF/6-31G 

6-31 G 

-524.7 

-828.5 

-822.8 

-754.8 

MP2/6-31G 

6-31 G 

-653.6 

-1112.5 

-1083.2 

-988.1 

HF/6-31G* 

6-31 G* 

-341.9 

-528.9 

-500.2 

-490.8 

MP2/6-31G* 

6-31 G* 

-456.4 

-720.5 

-687.9 

-661.3 

OJ 

0 

III 

0 

HF/6-31G 

6-31 G 

-1550.6 

-2436.4 

-2360.0 

-2233.4 

MP2/6-31G 

6-31 G 

-1999.1 

-3311.8 

-3199.5 

-2931.5 

CO 

0 

III 

0 

HF/6-31G 

6-31 G 

-3057.0 

-4651.8 

a.u.  of  first  hyperpolarizability  =  3.2063  x  10  J 
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TABLE  III _ 

Static  longitudinal  second  hyperpolarizability  (10^  a.u.)  of  NH2  —  R  —  N02.^ 


R 

Geometry 

Basis 

Thf 

7mP2 

7mP3 

7mP4 

X 

o 

X 

o 

HF/6-31G 

6-31 G 

742 

1409 

1419 

2116 

MP2/6-31G 

6-31 G 

1030 

2035 

2060 

3207 

HF/6-31G 

6-31 G* 

618 

999 

1014 

1414 

MP2/6-31G 

6-31 G* 

939 

1600 

1640 

2317 

HF/6-31G* 

6-31 G 

765 

1555 

1466 

1988 

MP2/6-31G* 

6-31 G 

866 

1716 

1689 

2429 

HF/6-31G* 

6-31 G* 

668 

1217 

1139 

1484 

MP2/6-31G’" 

6-31 G* 

728 

1248 

1230 

1707 

(CH==CH)2 

HF/6-31G 

6-31 G 

6660 

17369 

18092 

21632 

MP2/6-31G 

6-31 G 

8652 

24123 

24689 
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FIGURE  1.  Ratios  between  the  Moller-Plesset  corrections  and  the  Hartree-Fock  static  polarizability  values  for 
a,cu-nitro,annino  polyene  chains.  These  results  have  been  obtained  with  the  6-31 G  basis  set  on  the  RHF/6-31G 
optimized  structures. 
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FIGURE  2.  Ratios  between  the  Moller-Plesset  corrections  and  the  Hartree-Fock  static  first  hyperpolarizability  values 
for  a,ft>-nitro,annlno  polyene  chains.  These  results  have  been  obtained  with  the  6-31 G  basis  set  on  the  RHF/6-31G 
optimized  structures.  The  P2  /  Phf  has  been  divided  by  a  factor  2  for  the  purpose  of  visibility. 


because  the  PA  has  a  zero  /3^.  For  comparison  the 
ratio  found,  by  a  similar  method,  is  1.62  for  the 
p-nitro-aniline  [2]  and  —4.48  for  the  polymeth- 
ineimine  (PMI)  dimer  [22].  (These  PMI  oligomers 
are  similar  to  PA,  but  half  of  the  carbon  atoms  are 
replaced  by  nitrogen  atoms.)  This  ratio  slightly 
increases  when  the  geometry  optimization  is  car¬ 
ried  out  at  the  MP2  level,  as  a  result  of  the  larger 
electronic  delocalization  along  the  backbone  but  is 
almost  the  same  for  both  basis  sets.  For  PA,  with 
the  6-31 G  basis  set,  we  have  \  \  P4I  but  their 

relative  sign  changes  as  a  function  of  the  geome¬ 
try,  the  basis  set,  as  well  as  the  length  of  the 
conjugated  segment.  However,  with  the  same  ba¬ 
sis  set,  in  PY  chains  the  opposite  is  true:  |  ^3^!  < 
Ij84l.  The  contribution  of  the  triple  excitations  to 
the  total  MP4  correction  is  generally  large:  1/3/1  > 
|jS4l.  Moreover  it  is  striking  to  note  the  relative 
stability  with  chain  length  of  most  of  the  ratios 
drawn  at  Figure  2. 

Note  the  importance  of  the  electron  correlation 
corrections  in  the  ordering  of  the  doubly  versus 
triply  bonded  structures.  Indeed,  for  the  monomer 
and  dimer,  the  j6MP4  values  order  in  the  opposite 
direction  to  ones;  while  the  ordering  given  by 
the  MP2  or  MP3  technique  changes  as  a  function 
of  the  basis  set  and  the  geometry  used.  However, 


for  the  trimer,  it  becomes  clear,  even  at  HF  level, 
that  the  PA  linker  leads  to  larger 

Under  a  geometry  optimization  at  the  MP2  level, 
the  values  increase  significantly  (from  +20  to 
+  40%)  compared  to  the  values  obtained  on  the 
corresponding  HF  optimized  geometry.  The  addi¬ 
tion  of  polarization  functions  on  the  carbon  atoms 
leads  to  a  decrease  of  Pi  (from  -30  to  -40%). 
This  basis  set  effect  can  be  divided  into  two  com¬ 
ponents:  a  direct  one,  coming  from  the  variation  of 
the  computed  property,  and  an  indirect  one,  com¬ 
ing  from  the  change  on  geometry.  Although,  for 
NH2 — CH=CH — NO2  both  components  are  im¬ 
portant  (which  was  not  the  case  for  the  polarizabil¬ 
ity)  and  have  the  same  sign,  the  sum  does  not 
reproduce  exactly  the  full  6-31 G*  results,  showing 
that  they  are  not  purely  additive. 

SECOIVD  HYPERPOLARIZABILITY 

In  the  computation  of  7^,  similarly  to  the 
MP2  correction  is  of  more  importance,  and  gener¬ 
ally,  one  can  notice,  on  Figure  3  that  7^  >  7Hp. 
Moreover,  the  ratio  7mp2/Thf  increases  with  chain 
length,  which  goes  in  opposition  to  the  nonsubsti- 
tuted  PA  chains.  For  the  monomer,  dimer,  trimer, 
and  tetramer  of  nonsubstituted  PA,  To  to  and  co- 
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FIGURE  3.  Ratios  between  the  Moller-Plesset  corrections  and  the  Hartree-Fock  static  second  hyperpolarizability 
values  for  a,a>-nitro,amino  polyene  chains.  These  results  have  been  obtained  with  the  6-31 G  basis  set  on  the 
RHF/6-31G  optimized  structure. 


workers  [18]  found  the  ratio  to  be  3.05,  2.20,  2.00, 
and  1.92,  respectively,  while  we  have  1.90,  2.61, 
2.66,  and  2.71  using  the  6-31G  basis  and  the 
RHF/6-31G  geometry.  For  small  chains  74  has  a 
magnitude  comparable  to  yf  for  the  CH=CH 
fragment,  but  the  relative  importance  of  the 
fourth“Order  correction  decreases  with  increasing 
chain  length.  Indeed,  y^/y^v  decreases  when  shift¬ 
ing  from  the  dimer  to  the  trimer,  while  yf/ynp 
still  increases  (see  Fig.  3).  y^f  changes  of  sign 
according  to  the  linker  studied  but  is  smaller  by 
one  order  of  magnitude  than  both  yf  and  y^.  In 
the  case  of  y^  the  triple  excitations  do  not  con¬ 
tribute  largely  to  the  total  MP4  correction,  and 
thus  the  relatively  cheap  MP4(SDQ)  method  may 
be  accurate. 

The  substituted  PY  monomer  and  dimer  present 
large  yj^  values  in  comparison  to  the  correspond¬ 
ing  substituted  PA  molecules.  This  means  that  the 
largest  polarizability  does  not  systematically  lead 
to  the  largest  second  hyperpolarizability;  although 
for  long  chains  this  is  going  to  be  true.  The  ratio 
7mp2/Thf  (each  being  computed  at  the  corre¬ 
sponding  6-31G  optimized  geometry)  of  the  substi¬ 
tuted  (nonsubstituted)  PY  dimer  is  2.78  (2.27,  see 
Ref.  [17]). 

The  effects  of  the  geometry  and  the  basis  set  on 
yi  are  comparable  to  the  ones  noticed  for  and 
i.e.,  the  MP2  geometry  leads  to  larger  y^  val¬ 
ues;  the  addition  of  polarization  function  to  smaller 


one.  Like  /3^,  the  amplitude  of  yi  depends  on  the 
geometry  and  the  basis  set  used  during  the  FF 
calculations. 


Conclusions  and  Outlook 

We  have  carried  out  Moller-Plesset  calculations 
of  the  static  polarizability,  first  and  second  hyper¬ 
polarizabilities  of  polyacetylene  and  polyyne  link¬ 
ers  end-capped  by  nitro  and  amino  groups.  The 
MP2  correction,  which  leads  to  significant  in¬ 
creases  of  the  hyperpolarizability  estimates,  is  gen¬ 
erally  much  larger  than  the  MP3  and  MP4  ones, 
except  in  the  cases  of  the  second  hyperpolarizabil¬ 
ity  of  small  PA  chains  and  of  the  polarizability  of 
the  PY  dimer.  If  the  absolute  values  of  NLO  prop¬ 
erties  are  very  different  for  substituted  or  nonsub¬ 
stituted  chains,  the  ratios  Pufi/^hf  espe¬ 

cially  Fmp4/^mp2)  quite  close,  especially  for 
medium-size  oligomers.  PA  linkers  always  lead  to 
larger  static  (hyper)polarizabilities  than  the  PY 
chains,  except  in  the  case  of  y^  of  small  oligomers. 
For  and  Pmfi/Phf  decreases  with  increas¬ 
ing  chain  length,  while  for  y^  this  ratio  increases. 

Under  optimization  at  the  MP2  level,  the  bond 
length  alternation  decreases,  and  thus  the 
(hyper)polarizabilities  increase;  while  the  addition 
of  d  polarization  functions  on  the  second-row 
atoms  decreases  the  NLO  estimates. 
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In  our  calculations  we  have  neglected  the  vibra¬ 
tional  contribution  to  the  (hyper)polarizabilities; 
however,  these  contributions  may  be  of  impor¬ 
tance,  as  recently  reviewed  [25].  For  example,  it 
was  shown  that  the  static  vibrational  contributions 
to  is  more  than  twice  the  electronic  contribu¬ 
tions  for  the  small  NO2 — (CH=CH)^ — NH2 
chains  at  the  RHF/6-31G  level  within  the  double 
harmonic  approximation.  To  our  knowledge,  no 
Moller-Plesset  correction  of  the  vibrational  first 
hyperpolarizability  has  been  computed  yet  for  such 
push-pull  systems,  and  it  is  still  impossible  to 
know  how  the  relative  weight  of  the  vibrational 
contributions  will  change  when  electron  correla¬ 
tion  corrections  are  included. 

Calculations  including  the  electron  correlation 
at  the  MP2  level  of  substituted  PA  oligomers  of 
medium  and  large  size  are  now  carried  out  in  our 
laboratory  in  order  to  address  the  saturation 
behavior. 
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ABSTRACT:  Static  electronic  and  vibrational  longitudinal  first  hyperpolarizabilities 
( (0)  and  PiiO))  of  a  series  of  mono-  and  disubstituted  benzenes  were  calculated  at  the 
Hartree-Fock  6-31G  level  by  using  coupled  Hartree-Fock  and  the  double  harmonic 
oscillator  approximations,  respectively.  Although  the  ISi(0)/Pi(0)  ratio  is  slightly  larger 
than  unity  and  rather  constant  with  respect  to  the  substituent(s),  it  turns  out,  for  the 
monosubstituted  compounds,  that  the  variations  of  /3f (0)  upon  substitution  can  be  most 
accounted  for  by  the  mesomeric  effects,  whereas  for  Pi(0),  the  inductive  effects  are  also 
of  importance.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  689-696,  1997 


Introduction 

Quantum  chemical  evaluations  of  the  polariz¬ 
abilities  (a),  first  and  second  hyperpolariz¬ 
abilities  ( jS  and  y),  and  their  interpretation  in 
terms  of  structure-property  relationships  help  in 
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the  design  of  new  compounds  for  nonlinear  optical 
(NLO)  applications  [1].  Recently,  particular  atten¬ 
tion  has  been  given  to  the  vibrational  contributions 
to  these  properties  [2].  In  fact,  although  the  appli¬ 
cation  of  an  external  electric  field  induces  simulta¬ 
neous  distortions  of  the  electron  and  nuclear  mo¬ 
tions,  in  the  clamped  nucleus  or  canonical  treat¬ 
ment,  one  considers  that  the  field  acts  sequentially 
on  the  electron  and  nuclear  motions  [3-4].  This 
leads  to  the  distinction  between  the  electronic  (a^ 
/3^  and  yO  and  vibrational  (a"",  and  y^)  (hy- 
per)polarizabilities.  The  present  study  copes  with 
the  evaluation  of  the  electronic  and  vibrational 
static  longitudinal  first  hyperpolarizabilities  [  131(0) 
and  (0)]  of  a  series  of  mono-  and  disubstituted 
benzenes.  Since  the  first  investigations  have 
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demonstrated  that  can  be  substantial  for  conju¬ 
gated  molecules  in  the  static  limit  [2],  it  can  be 
optimized  to  maximize  the  dc-Pockels  (dc-P)  and 
optical  rectification  (OR)  responses.  At  optical  fre¬ 
quencies,  the  vibrational  component  to  the  second 
harmonic  generation  (SHG)  response  is  negligible. 
A  similar  strategy  has  already  been  followed  for  20 
years  now  for  the  electronic  counterpart  as  re¬ 
viewed  by  Kanis  et  al.  [5]. 

Zerbi  and  co-workers  [6-8]  found  that,  for  many 
second-order  conjugated  molecules,  the  calculated 
static  electronic  and  vibrational  hyperpolarizabili¬ 
ties  are  very  similar.  Using  experimental  data,  they 
concluded  that  is  close  to  unity.  In  the 

latter  case,  the  vibrational  first  hyperpolarizability 
is  evaluated  from  infrared  and  Raman  absorption 
spectra,  whereas  electric  field-induced  SHG 
(EFISHG)  measurements  give  the  electronic  coun¬ 
terpart  which — to  be  correct — would  have  been 
required  to  be  extrapolated  to  zero  frequency. 
Consequently,  they  argued  that  such  a  coincidence 
is  not  fortuitous  and  that  and  (5^  are  just  two 
manifestations  of  the  same  underlying  physical 
phenomenon.  They  made  the  connection  with  the 
strong  electron-phonon  coupling  along  the  coordi¬ 
nate  which  is  associated  with  the  variation  of  the 
bond-length  alternation  (BLA).  A  first  quantitative 
demonstration  of  their  assumption  was  recently 
elaborated  on  by  Castiglioni  et  al.  [9].  However,  in 
addition  to  its  restriction  to  the  two-state  approxi¬ 
mation  for  computing  whereas  /3^  is  evalu¬ 
ated  at  the  double  harmonic  oscillator  approxima¬ 
tion  by  neglecting  all  the  modes  but  the  BLA 
mode,  their  approach  neglects  an  important  contri¬ 
bution  [10].  The  rationalization  of  the  ratio 

value  is  even  made  more  difficult  by  a  few  recent 
results  concerning  push-pull  conjugated  materials 
which  demonstrate  the  dependence  of  the 
ratio  upon  both  the  chain  length  and  the  nature  of 
the  linker  [2]. 

This  lack  of  satisfactory  quantitative  justifica¬ 
tion  for  the  relationship  between  and  in 
these  conjugated  systems  makes  the  subject  very 
attractive.  In  the  present  study,  the  relationship 
between  and  [particularly  between  ^^(0) 
and  I3i(0)]  is  tackled  from  another  side,  i.e.,  the 
P[(0)  and  Pi(0)  quantities  are  analyzed  in  terms  of 
both  the  inductive  and  the  mesomeric  substituent 
constants.  Then,  these  relationships  to  simple 
physical  concepts  are  compared.  A  similar  ap¬ 
proach  was  followed  by  Hehre  et  al.  [11]  to  study 
the  charge  distributions  and  nuclear  magnetic 
shifts  in  a  series  of  monosubstituted  benzenes  and 


in  m-  and  p-substituted  fluorobenzenes.  In  addi¬ 
tion,  a  few  investigations  of  second-order  NLO 
materials  have  demonstrated  that  a  satisfactory 
linear  dependence  relates  (3^  to  the  parameters 
(see  below  for  their  description).  These  experimen¬ 
tal  or  theoretical  investigations  of  mono-  and  di- 
substituted  benzenes  [12],  stilbenes  [12,  13], 
polyenes  [14],  and  polymethineimines  [14]  showed 
that  this  approach  could  be  used  by  the  organic 
chemists  to  elaborate  new  promising  materials  for 
NLO  and  to  predict  their  properties  from  the  na¬ 
ture  and  positions  of  the  donors  and  acceptors. 
This  work  is  organized  as  follows:  The  next  section 
briefly  summarizes  the  methodological  and  com¬ 
putational  aspects,  the  third  section  presents  the 
results  and  elaborates  an  interpretation,  whereas 
we  conclude  in  the  fourth  section. 


Methodological  and  Computational 
Aspects 

The  /3^(0)  values  were  evaluated  at  the  coupled 
Hartree-Fock  (CHF)  [15]  level  with  the  split-va¬ 
lence  6-3 IG  basis  set  [16].  This  approach,  which  is 
equivalent  to  a  Hartree-Fock  finite-field  proce¬ 
dure,  takes  explicitly  into  account  the  field-in¬ 
duced  electron  reorganizational  effects.  On  the 
other  hand,  the  vibrational  counterpart,  (0),  is 
obtained  at  the  RHF/6-31G  level  within  the  dou¬ 
ble  harmonic  oscillator  approximation  [17-19]  and 
reads 


(0)  =  [  =313 

a 


(1) 


where  0, 0  indicates  that  no  electrical  or  mechani¬ 
cal  anharmonicity  is  included.  The  sum  runs  over 
all  the  vibrational  normal  modes  of  frequency 
The  idF^/dQ^)^  with  a^,  which  are  the 

derivatives  of  the  property  with  respect  to  the 
normal  coordinate  Q^,  are  evaluated  at  the  equilib¬ 
rium  geometry. 

For  optical  processes,  both  arid  are  mod¬ 
ulated,  although  in  very  different  ways.  Indeed, 
the  frequency  dispersion  of  f3l  has  been  shown  to 
follow  the  relation  [20] 

p[(  —  0)^;  0)^,  0)2)  =  jS^(O) [1  -F  A (ol  +  B(ol  +  “*  ]  / 

(2) 
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where  o)^  +  0)2  and  +  coj  +  <w|.  A 

and  B  are  parameters  depending  upon  the 
molecule,  but  only  B  depends  upon  the  NLO  pro¬ 
cess.  On  the  other  hand,  by  adopting  the  enhanced 
[21]  or  infinite-frequency  [22]  approximation  which 
relies  on  the  fact  that  the  optical  frequencies  are  at 
least  one  order  of  magnitude  larger  than  the  vibra¬ 
tional  ones,  one  obtains 

(  (x))  <y,0)a>->oo  ^^{0/  (x)f  Ci))a}->oo 

(3) 

(-2w;  oj,  O})(o^oo  =  0.  (4) 

In  other  words,  for  SHG,  is  negligible,  whereas 
for  the  dc-P  and  OR  effects,  it  amounts  to  1/3  of 
the  static  quantity.  This  point  is  obviously  of  im¬ 
portance  for  optimizing  the  responses  according  to 
the  NLO  process. 

All  the  (3  calculations  were  performed  with  the 
Gaussian94  program  [23]  on  the  RHF/6-31G-opti- 
mized  structures.  A  tight  convergence  threshold 
on  the  residual  forces  was  adopted  to  meet  a 
satisfactory  accuracy  in  computing  the  i^Pi/^Qa)o 
values.  The  6-31 G  atomic  basis  set  was  employed 
because  it  is  known  to  be  reasonably  accurate  for 
evaluating  the  hyperpolarizabilities  of  push-pull 
conjugated  systems  [5].  Addressing  electron  corre¬ 
lation  corrections  is  left  for  future  investigations. 
For  these  second-order  NLO  materials,  a  few  cal¬ 
culations  have  already  revealed  that  the  inclusion 
of  electron  correlation  effects  leads  to  a  15-50% 
increase  in  whereas,  to  our  knowledge,  it  has 
not  yet  been  addressed  for  jS/.  Further  computa¬ 
tional  details  on  these  calculations  can  be  found  in 
[24].  The  molecules  have  been  oriented  such  that 
the  Cl  and  C4  atoms  belong  to  the  z-  or  longitudi¬ 
nal  axis.  Since  our  purpose  consists  of  relating  /3  ^ 
and  to  the  substituent  constants,  only  the 
=  (3^  tensor  component  was  analyzed.  It  is  strik¬ 
ing  to  note  that  for  weak  donor  or  acceptor  groups 
is  not  necessarily  the  dominant  tensor  compo¬ 
nent,  whereas  in  the  case  of  the  resonant  D/A 
pairs,  Pi  is  always  dominant. 


Results  and  Discussion 

The  jjbi,  PICS),  and  P/O)  values  are  listed  in 
Tables  I  and  II  for  the  mono-  and  disubstituted 
benzenes,  respectively,  whereas  Table  III  gives  the 
various  a  parameters  employed  in  this  study.  As 
shown  in  the  previous  investigations  [12-14],  dif¬ 
ferent  types  of  or  parameters  have  proven  to  be 


TABLE  I _ 

RHF/6-31G  longitudinal  component  of  the  dipole 
moment  and  the  static  electronic  and  vibrational 
first  hyperpolarizability  as  a  function  of  the 
substituent  in  monosubstituted  benzenes. 


R 

B'l 

PI 

PI 

OCH3 

0.044 

126.1 

96.8 

OH 

-0.225 

143.4 

43.1 

N(CH3)2 

0.688 

297.4 

180.8 

NH2 

0.573 

259.2 

112.5 

F 

-0.965 

108.3 

-59.1 

CH3 

0.130 

44.8 

23.4 

CH2CH3 

0.116 

-12.4 

8.0 

CHCH2 

-0.059 

-37.5 

-61.1 

CCH 

-0.264 

-23.9 

40.7 

SH 

-0.752 

28.5 

16.1 

Cl 

-1.076 

134.3 

-210.1 

SIH3 

-0.363 

-119.1 

-236.9 

CN 

-1.914 

-56.1 

-56.0 

CF3 

-1.606 

35.0 

-205.3 

CHO 

- 1 .262 

-197.3 

-358.8 

COCH3 

-0.939 

-197.7 

-229.2 

CO2H 

-0.745 

-205.5 

-308.1 

NO2 

-2.294 

-269.7 

-311.3 

All  the  values  are  given  in  au.  (1.0  au  of  dipole  moment  = 
8.4783610"®°  cm  =  2.541765  Debye;  1.0  au  of  first  hyper¬ 
polarizability  =  3.2063  x  10"^®  C®  m®  J" ®  =  8.641  X 
10"®®  esu). 


adequate  for  reproducing  the  P^  variations.  Most 
often,  the  para  substituent  constants  ,  a/) 

are  employed.  The  empirical  cr^  parameters  were 
originally  related  by  Hammett  to  the  variation  of 
the  dissociation  constant  of  p-substituted  acids, 
the  benzoic  acids  in  water  at  25°C  being  the  refer- 

TABLE  II _ 

RHF  /  6-31 G  longitudinal  component  of  the  dipole 
moment  and  the  static  electronic  and  vibrational 
first  hyperpolarizability  as  a  function  of  the 
substituent  in  disubstituted  benzenes 
(all  values  in  au). 


R,R' 

B'L 

PI 

PI 

CHO,  CH3 

1.478 

366.6 

518.1 

CHO,  F 

0.282 

335.0 

344.6 

CHO,  OH 

1.146 

517.7 

607.8 

CHO,  NH2 

2.124 

928.0 

1014.8 

NO2,  CH3 

2.548 

500.7 

492.3 

NO2,  F 

1.329 

408.9 

298.5 

NO2,  OH 

2.222 

672.4 

704.6 

NO2,  NH2 

3.230 

1262.8 

1351.6 
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TABLE  III _ 

The  inductive  (<t,)  and  mesomeric 
substituent  constants. 


R 

0CH3 

0.29 

-0.44 

OH 

0.25 

-0.41 

-1.17 

N(CH3)2 

0.12 

-0.53 

NH2 

0.11 

-0.48 

-1.41 

F 

0.51 

-0.33 

-0.58 

CH3 

-0.03 

-0.10 

-0.28 

CH2CH3 

-0.02 

-0.12 

CHCH2 

0.06 

-0.07 

CCH 

0.20 

0.03 

SH 

0.29 

-0.20 

Cl 

0.48 

-0.21 

SiHg 

0.05 

0.09 

CN 

0.57 

0.08 

CF3 

0.42 

0.07 

CHO 

0.25 

0.27 

0.79 

COCH3 

0.26 

0.15 

COjH 

0.32 

0.29 

NO2 

0.68 

0.19 

0.56 

The  (T^  are  the  unbiased  values  associated  with  nonconju- 
gated  reaction  centers,  whereas  the  are  the  dual 

values  which  express  the  enhanced  mesomeric  interaction 
of  a  donor  (a^)  with  an  acceptor  or  of  an  acceptor 
with  a  donor.  Values  taken  from  [25]. 

ence  [25].  Obviously,  these  parameters  involve 
both  inductive  and  mesomeric  effects,  whereas  the 
latter  ones  are  overestimated  for  medium/strong 
donor  substituents  because  they  interact  reso¬ 
nantly  with  the  acceptor  acid  group.  This  is  why 
unbiased  (a^^)  and  dual  (a^ ,  <j“)  parameters  are 
required  to  express  the  extreme  behaviors.  Indeed, 
on  the  one  hand,  the  parameters  characterize 
substituents  which  do  not  interact  with  the  reac¬ 
tive  center,  whereas,  on  the  other  hand,  the 
(cr~)  parameters  include  the  enhancement  of  the 
donor  (acceptor)  character  due  to  its  resonant  in¬ 
teraction  with  an  acceptor  (donor)  substituent  in 
the  para  position  [25].  A  further  refinement  of  this 
parametrization  is  accomplished  by  the  splitting  of 
the  different  into  mesomeric  or  resonant  (aj^) 
and  inductive  or  electrostatic  (aj)  parts.  The  aj  do 
not  depend  upon  the  presence  or  not  of  other 
interacting  substituents,  whereas  there  exist  unbi¬ 
ased  and  dual  cr^  constants  [25].  In  our  case,  the 
distinction  between  the  ctj  and  cr^  effects  is  of 
importance.  We  have  considered  the  unbiased 
values  for  the  monosubstituted  benzenes,  whereas 
for  the  disubstituted  constants,  the  dual 
parameters  have  been  employed.  Two  important 


comments  are  needed  before  deducing  cr-relations 
and  comparing  these  with  previous  studies.  First, 
as  mentioned  by  Exner  [25],  there  are  errors  in  the 
substituent  constants  which  are  not  always  un¬ 
equivocally  determined.  These  errors  originate 
from  both  the  experimental  data  and  their  subse¬ 
quent  theoretical  manipulations  such  as  these  in 
deriving  the  constants.  Second,  the  cr^^“  values 
are  often  considered  to  be  equivalent  to 
values  because  the  resonant  effect  is  dominant, 
neglecting  the  importance  of  the  inductive  effects 
such  as  in  the  case  of  the  NO2  group. 

Figure  1  shows  that  the  ratio  is 

rather  constant  with  respect  to  the  nature  and 
number  of  the  substituents,  the  deviation  being 
associated  mainly  with  the  monosubstituted  com¬ 
pounds.  The  slightly  larger  amplitude  of  13^(0) 
with  respect  to  ^3^(0)  was  first  analyzed  in  terms  of 
their  dependence  upon  (Figs.  2  and  3).  The 
least-squares  fittings, 

P[(0)  =  -51.5  -  5725a^  R  -  0.93  (5) 

13^(0)  -  -134.3  -  524.60-^^  R  =  0.84,  (6) 

show  that  the  variations  in  ^j^(O)  can  be  better 
accounted  for  by  the  mesomeric  effects  than  can 
the  corresponding  variations  in  jS/'(0).  Indeed,  the 
correlation  coefficient  is  closer  to  unity  for  P[(0) 
and  the  constant  term  of  the  line,  which  obviously 
should  be  equal  to  zero,  is  smaller.  The  removal  of 
this  constant  term  leads  to  poorer  fittings  of  which 
the  correlation  coefficient  is  equal  to  0.88  and  0.28 
for  (31(0)  and  respectively.  The  correspond¬ 

ing  standard  deviation  of  the  errors  is  equal  to  66 
and  136  au.  This  good  description  of  the  I3[(Q) 
variations  in  terms  of  is  in  agreement  with  the 
fact  that  the  charge-transfer  contribution  to 
I3[(0)  is  known  to  be  by  far  the  largest  component, 
the  remaining  term  ( 13^^^)  being  associated  to  the 
inductive  effects.  The  largest  deviations  of  the  cal¬ 
culated  [3[(0)  from  the  linear  fitting  estimates  con¬ 
cern  the  CF3  and  NO2  groups,  i.e.,  substituents 
with  small  with  respect  to  aj.  This  dominance 
of  is  even  more  pronounced  for  the  D/A- 
disubstituted  compounds  [26-27].  In  what  con¬ 
cerns  /3l(0),  the  CH3,  CCH,  Cl,  and  CN  sub¬ 
stituents  present  the  largest  deviations  from  the 
linear  fitting.  In  a  similar  way,  for  the  least- 
squares  fitting  to  is  poor: 

=  -0.80  -  2.040-^^  R  =  0.64.  (7) 
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FIGURE  1.  Plot  of  the  RHF/ 6-31 G  jS/"(0)vs.  )8®(0)  for  the  mono- and  disubstituted  benzenes. 
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FIGURE  3.  RHF  /  6-31 G  double  harmonic  Pl^ (0)  values  as  a  function  of  a^. 


With  the  relation  to  being  insufficient  in  sev¬ 
eral  instances,  the  role  of  needs  to  be  assessed. 
To  make  it,  one  can  assume  that  any  property,  P, 
depends  linearly  upon  both  o-j  and  cr^  according 
to 

P  =  PlO-l  +  PR(fR  ■  (8) 

The  Pi  and  pj^  factors  which  depend  upon  the 
investigated  property  are  determined  by  a  least- 
squares  fitting  procedure.  One  then  obtains 

PliQ)  =  -127.5(7;  -  530.7a-;®  (9) 

j^L^CO)  =  -376.1a;  -  424.1a;®  (10) 

pLi^  =  -2.88a;  -  1.56a;®.  (11) 

These  results  support  our  first  idea  that  the  induc¬ 
tive  effects  have  a  larger  impact  on  P^dS)  than  on 
PldS),  whereas  they  are  dominant  for  pui.  They  also 
indicate  that  for  such  monosubstituted  benzenes 
PldS)  and  Pl{d)  are  not  determined  by  the  same 
underlying  phenomena.  The  standard  deviation  of 
the  errors  which  measures  the  goodness  of  these 
fittings  is  equal  to  61,  87,  and  0.27  au  for  Pliff), 
Pi(0),  and  piif  respectively,  showing  a  clear  im¬ 
provement  over  the  simple  linear  fitting  for  Pi(0) 
and  pbi^,  whereas  for  j8/(0),  both  fittings  are  of 
equivalent  quality.  Most  of  the  significant  devia¬ 
tions  from  Eqs.  (9)-(ll)  can  be  attributed  to  the 
particular  nature  of  the  substituents.  For  the  CHO 


and  SiH3  groups,  the  Pi(0)  values  are  poorly 
reproduced  as  could  be  understood  from  their 
large  intrinsic  contributions.  For  Cl  but  also  SiH3, 
a  part  of  the  deviation  can  certainly  be  attributed 
to  the  lack  of  d-functions  on  the  Cl  and  Si  atoms. 
Moreover,  the  ^8^(0)  estimates  for  the  cyano-  and 
acetylenobenzenes  are  of  relatively  poorer  quality 
than  for  the  other  substituents.  In  the  case  of  p[(0X 
the  estimates  for  the  CF3-  and  CH2CH3-sub- 
stituted  benzenes  are  particularly  bad  due  to  both 
a  large  deviation  and  a  wrong  sign.  Obviously, 
improvements  of  these  fitting  relations  would  be 
obtained  by  adopting  higher-level  quantum  chem¬ 
ical  methods,  i.e,,  including  electron  correlation 
corrections  and  using  a  larger  atomic  basis  set. 

Nevertheless,  these  global  double  linear  rela¬ 
tionships  deserve  to  be  considered  in  the  design  of 
NLO  materials  and  particularly  for  their  help  in 
determining  the  factors  which  influence  the  vibra¬ 
tional  and  electronic  first  hyperpolarizabilities.  In¬ 
deed,  there  was  no  a  priori  reason  to  expect  a 
linear  relation  between  the  PiiOYs  and  the  sub¬ 
stituent  constants  which  are  related  to  variations 
of  chemical  reaction  constants. 

Then,  an  interesting  point  consists  of  estimating 
from  Eqs.  (9)-(ll)  the  jSf(O),  PiiO),  and  pi  values 
of  the  substituted  compounds  and  comparing  these 
to  the  calculated  ones.  For  the  disubstituted  ben¬ 
zenes,  the  substituent  effect  is  represented  by  the 
difference  of  their  individual  constants,  the  coop- 
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erative  or  resonant  effects  being  accounted  for  by 
taking  in  the  place  of  One  can  directly 
see  inconsistencies  between  these  estimates  and 
the  calculations.  In  particular,  the  j8^(0)'s  of  the 
NH2“Substituted  compounds  are  much  larger  than 
these  of  the  parent  OH-substituted  systems  al¬ 
though  the  corresponding  changes  in  are 

rather  small.  In  addition,  for  the  p[{0)  of  the  CH3- 
and  F-substituted  systems,  these  estimated  values 
overestimate  the  calculated  ones.  It  is  our  feeling 
that  these  prediction  errors  are  related  to  the  diffi¬ 
culty  of  knowing  Indeed,  there  is  no  reason 
why  these  values  would  be  the  same  for  the 
solvolysis  of  cumyl  chlorides  in  90%  aqueous  ace¬ 
tone  and  for  ^  quantities.  In  other  words,  contrary 
to  the  unbiased  constants,  the  dual  parameters  can 
be  easily  considered  dependent  upon  the  strength 
of  the  resonantly  interacting  group.  In  this  regard, 
it  seems  obvious  that  the  NH2  group  resonantly 
interacts  much  more  strongly  with  the  acceptor 
group  than  does  the  OH  group. 

The  nonnegligible  deviations  from  the  linear 
relation  obtained  by  Ulman  [13]  and  the  interpre¬ 
tation  given  by  Cheng  et  al.  [12]  of  the  cr-depen- 
dence  of  the  EFISHG  values  demonstrate  also 
that  the  dual  parameters  are  not  determined  in  an 
absolute  way.  Indeed,  Cheng  et  al.  obtained  linear 
relations  with  different  slopes  for  the  para-sub¬ 
stituted  nitro-  and  methoxybenzenes.  On  the  other 
hand,  Jacquemin  et  al.  [14]  obtained  a  very  good 
correlation  constant  for  polyenes  and  polyme- 
thineimines  substituted  by  strong  or  weak  donors 
and  acceptors.  However,  it  is  striking  to  note  that, 
in  the  latter,  the  same  standard  geometry  was 
adopted  for  the  linker  in  order  to  address  the 
electronic  effects  of  the  substituents,  neglecting, 
therefore,  the  consequences  on  the  linker  geome¬ 
try.  Accordingly,  the  NO2/NH2  pair  significantly 
transforms  the  aromatic  benzene  ring  into  a  par¬ 
tially  quinonoid  structure  [24]. 


Synopsis  and  Outlooks 

This  RHF/6-31G  study  points  out  that  the 
I3ii0)/P[(0)  ratio  for  mono-  and  disubstituted  ben¬ 
zenes  is  rather  constant  with  respect  to  the  nature 
of  the  substituent(s)  and  is  slightly  larger  than 
unity.  With  the  lack  of  electron  correlation  and 
anharmonicity  corrections  preventing  us  from 
making  a  better  estimate,  these  results  are  in 
agreement  with  the  observation  of  Zerbi  and  co¬ 


workers  [6-8]  who  found  experimentally  and  the¬ 
oretically  that  PiiO)  and  (Sli  —  lo);  o),  o))  are  of  the 
same  amplitude.  However,  the  analysis  of  the  de¬ 
pendence  of  Pi(0)  and  131(0)  upon  the  mesomeric 
and  inductive  substituent  constants  in  the  mono- 
substituted  benzenes  exhibits  different  behaviors. 
^[(0)  can  mostly  be  described  by  the  unbiased 
mesomeric  constants  (o-^),  whereas  its  vibrational 
counterpart  also  depends  upon  the  inductive  con¬ 
stants  (cTj).  Moreover,  the  intrinsic  contribution  of 
the  substituent  is  generally  larger  for  than 

for  (31(0). 

Nevertheless,  it  is  important  to  note  that,  at 
optical  frequencies,  this  Pi(0)/Pl(0)  ratio  de¬ 
creases  to  about  0.3  as  a  consequence  of  both  the 
dispersion-induced  increase  of  I3[  and  the  de¬ 
crease  of  Pi  by  a  factor  of  3  as  determined  in  the 
infinite-frequency  double  harmonic  oscillator  level 
of  approximation.  On  the  grounds  of  the  recent 
results  concerning  a,  (o  nitroamino  polyenes,  poly- 
ynes,  and  polythiophenes  which  show  that  the 
P^(0)/I3l(0)  ratio  ranges  between  1.1  and  2.8  ac¬ 
cording  to  the  length  and  nature  of  the  conjugated 
linker,  further  investigations  of  these  substituent 
effects  would  deserve  to  be  carried  out  to  address, 
from  the  chemical  point  of  view,  the  value  of  the 
I3l!(0)/Pl(0)  ratio. 
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ABSTRACT:  Explicit  expressions  of  polarizability  and  hyperpolarizability  are 
formulated  by  a  path  integral  method  in  terms  of  the  generalized  coherent  states  of  the 
Thouless  parametrization.  A  novel  scheme  is  shown  for  calculating  the  molecular 
properties  including  total  energy,  and  the  Monte  Carlo  method  using  the  metropolis 
algorithm  is  utilized  for  performing  the  actual  calculations.  As  a  simple  example,  the 
hydrogen  molecule  is  examined  to  demonstrate  performances  of  the  present  scheme.  We 
discuss  improvement  of  the  accuracy  of  the  (hyper)polarizabilities  depending  on  the 
number  of  the  division  for  the  partition  function  and  also  the  behaviors  of  the 
(hyper)polarizability  for  the  external  magnetic  field.  We  compare  the  present  results  with 
those  obtained  by  the  conventional  molecular  orbital  theory.  ©  1997  John  Wiley  &  Sons, 
Inc.  Int  J  Quant  Chem  65:  697-707,  1997 


Introduction 

In  the  context  of  quantum  chemical  calculations 
of  the  static  (hyper)polarizabilities,  we  have 
two  main  approaches,  i.e,,  the  perturbation  tech¬ 
niques  and  the  derivative  techniques.  In  the  per- 

Correspondence  to:  H.  Nagao. 

*  Present  address:  Institute  for  Molecular  Science,  Myodaiji, 
Okazaki  444,  Japan. 


turbation  techniques,  the  perturbation  energy  is 
usually  expressed  as  the  summation  over  elec¬ 
tronic  states  (sum  over  states  [SOS]).  The  hyperpo¬ 
larizability  has  been  calculated  by  the  perturbative 
approach  with  various  approximate  models  [1-5]. 
The  most  useful  advantage  of  the  SOS  technique  in 
terms  of  the  analysis  of  the  nonlinear  optical  re¬ 
sponse  is  that  it  allows  one  to  obtain  a  physical 
picture  as  to  which  excited  states  play  a  significant 
role  in  the  response.  On  the  other  hand,  the  deriva¬ 
tive  techniques  rely  on  the  fact  that  the  static 
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(hyper)polarizability  tensor  components  can  be  ob- 
tained  as  first-  (second-,  third-)  order  derivatives 
of  a  given  dipole  moment  component  with  respect 
to  the  electric  field.  Those  derivatives  can  be  eval¬ 
uated  numerically  or  analytically  within  the  cou¬ 
pled  Hartree-Fock  scheme.  The  finite-field  (FF) 
approach  is  used  for  the  numerical  procedure, 
originally  proposed  by  Cohen  and  Roothaan  [6], 
where  the  molecular  Hamiltonian  explicitly  in¬ 
cludes  a  finite  electric-field-dependent  term.  Sev¬ 
eral  Hartree-Fock  self-consistent-field  or  post- 
Hartree-Fock  calculations  need  to  be  performed 
for  different  numerical  values  of  the  electric  field. 
The  total  energy  or  dipole  moment  as  a  function 
of  the  different  field  values  is  then  differentiated 
numerically. 

Many  calculations  of  static  (hyper)polarizabili- 
ties  of  several  interesting  materials  have  been  re¬ 
ported.  These  results  suggest  that  it  is  important  to 
calculate  them  considering  the  effect  of  electron 
correlation  using  sufficiently  large  basis  sets  [7].  In 
particular,  the  second  hyperpolarizability  value,  y, 
is  very  dependent  on  these  factors.  However,  an 
accurate  and  practical  computational  method  for 
determining  the  sensitive  physical  properties  to 
the  electron  correlation  of  quantum  many-body 
systems  is  still  under  study. 

In  the  last  decades,  there  have  been  several 
attempts  to  apply  the  path  integral  method  to 
calculating  physical  properties,  with  success  for 
simple  systems.  The  path  integral  was  first  intro¬ 
duced  by  Feynman  [8-10].  The  theory  has  been 
successfully  applied  to  many  problems  [11-15]  in 
quantum  mechanics.  Because  the  path  integral  is 
expressed  as  multidimensional  integrals  with  a 
huge  number  of  variables,  we  cannot  directly  com¬ 
pute  this  integration  by  ordinary  ways  of  numeri¬ 
cal  integrations.  Therefore,  a  statistical  method 
such  as  the  Monte  Carlo  (MC)  method  [16-22]  is 
attempted  for  the  integrations.  The  standard  ap¬ 
proach  to  the  Monte  Carlo  simulation  of  field 
theory  has  been  established  through  Feynman's 
path  integral  formulation.  It  is  necessary  to  work 
at  an  imaginary  time  so  that  the  integrands  of  the 
path  integrals  do  not  oscillate.  It  is  also  necessary 
to  work  with  systems  having  a  finite  extent  in  both 
space  and  imaginary  time.  This  is  equivalent  to 
finite  temperature  statistical  mechanics. 

Recently,  we  also  applied  the  path  integral 
method  to  the  many-electron  systems  in  quantum 
chemistry  [23-27].  The  path  integral  formulation 
of  the  partition  function  was  presented  in  terms  of 
the  LCAO  coefficients  or  the  Thouless  parameteri¬ 


zation.  We  presented  numerical  calculations 
[28-31]  of  the  ensemble  average  of  the  total  energy 
and  the  magnetization  by  means  of  the  MC 
method.  We  already  reported  briefly  the  numerical 
results  of  the  calculations  of  the  (hyper)polarizabil- 
ity  for  a  H3  system  [32]. 

In  this  article,  we  show  dependencies  of  the 
(hyper)polarizability  on  the  external  magnetic  field 
for  a  hydrogen  molecule  using  our  path  integral 
expressed  by  the  generalized  coherent  state,  where 
our  model  Hamiltonian  includes  the  ab  initio 
Hamiltonian  and  the  electric-  and  magnetic-field- 
dependent  terms.  We  also  discuss  relations  be¬ 
tween  (hyper)polarizability  and  spin  states. 


Theory 


To  apply  a  path  integral  method  to  molecular 
systems,  the  path  integral  formulation  of  the  parti¬ 
tion  function  is  derived  by  using  the  generalized 
coherent  states.  The  Slater  determinant  |Z)  is  writ¬ 
ten  in  terms  of  complex  coefficients  as 


|Z>  -  detd  +  Z^Z)" 


X  exp 


(r,p  —  a,(B  p  —  1  h  =  l 


\<f>) 


(1) 


=  detd  +  Z+Z) 
K  / 

X  n  n 

p—a,  ^  h  —  \ 


M-N„ 


a=a,p  p=\ 


E 


|o>, 

(2) 


where  |<^>  is  a  Slater  determinant  composed  of  N 
orbitals  (hole-orbitals),  means  the  number  of 
hole-orbitals  with  a  spin  a,  and  N  is  +  N^.  M 
is  the  number  of  the  base.  The  p  labels  denote  the 
particle  orbitals  that  are  orthogonal  to  the  hole 
orbitals  h,  and  Z  is  a  matrix  whose  elements  are 
Z^jf.  («,>)  is  the  creation  (annihilation)  operator 
for  the  z-th  orbital.  These  operators  also  obey  the 
anticommutation  relation. 

The  partition  function  H  is  written  as 

H  =  Tr[e”^^].  (3) 

The  symbol  Tr  indicates  a  trace  operation,  and 
13  =  (fcgT)"^  with  the  Boltzmann  constant  and 
a  temperature  of  T  Kelvin. 

A  path  integral  representation  of  the  partition 
function  in  terms  of  the  generalized  coherent  states 
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yields 

H  =  /  n  d^(Z,)<Z,|Z,_i> 


X  exp 


1=1 


with 


where 


|Zo>  = 


,  (4) 


(5) 


hfj  =  f  drcpUr) 


72  K 


2m  .t'l  Ir  -  RJ 


•P/aW/ 


(12) 


=  j  dr  dt'  (pf,(r)^fp(r')i^:471i‘P'tp(r')'P/aW/ 

(13) 


and 


ixfj  =  j  dr(pf^it)iJ,(pj^ir).  (14) 


M-N,  Np 

dHz)  =  cu  n  n 

ap  p  =  l  h  =  l 


dz;f;  dz;g 

liri 


xldetd  +  Z+Z)l'^  (6) 


and 


H(Z*,Z,_i)  =  <ZjH|Z,_i>.  (7) 


r  =  jS/n.  n  is  the  number  of  the  divisions,  which  is 
used  as  the  parameter  of  the  actual  calculation  of 
physical  properties.  From  expression  (4),  we  will 
derive  an  expression  of  the  ensemble  average  of  an 
operator  such  as  the  total  energy  or  the  dipole 
moment. 


MODEL  HAMILTONIAN 

To  apply  the  path  integral  expressed  by  the 
generalized  coherent  states  to  the  quantum  chem¬ 
istry,  we  consider  the  ab  initio  Hamiltonian  in¬ 
cluding  electric-  and  magnetic-field  dependent 
terms  for  molecular  systems.  In  the  Born- 
Oppenheimer  approximation,  the  Hamiltonian  is 
written  as 

H  =  Ho  +  H,  +  Hu,  (8) 

where 


In  Eq.  (9),  the  first  term  represents  the  kinetic 
and  the  nuclear-electron  interaction  part  and  the 
second  one  represents  the  electronic  interaction 
part.  Hj  and  Hjj  are  the  electric-field-dependent 
term  and  the  Zeeman  Hamiltonian,  respectively,  fx 
is  the  dipole  moment,  and  F  indicates  the  electric 
field,  ixq  and  indicate  the  magnetic  moment 
and  the  static  external  magnetic  field,  respectively. 
Here,  we  introduce  an  orthogonal  basis  set  {<p/(r)} 
such  as  a  Hartree-Fock  (HF)  molecular  orbital 
(MO).  Therefore,  becomes  the  creation 

(annihilation)  operator  for  the  z-th  MO.  indi¬ 
cates  a  z-component  of  the  spin  angular  momen¬ 
tum  at  the  f-th  site.  In  Eq.  ^3),  is  a  spatial 
coordinate  of  the  a-th  nucleus,  and  the  number 
of  the  nucleus.  Z^  indicates  the  positive  electron 
charge  of  the  a-ih  nucleus. 

We  derive  the  matrix  elements  (Zj^lHlZ^.^)  of 
the  Hamiltonian  in  Eq.  (8): 

H(Z*,Z,_i)  =  Ho(Z*,Z,_P  +  H,(Zt,Z,_P 

+  H,j{Z*,Z^_P.  (15) 


ENSEMBLE  AVERAGE 

The  ensemble  average  of  any  operator  O  can  be 
calculated: 


M 

Ho  =  I  E 

or=  a,  jS  i,  j=l 

I  M 

+  2 


E 

E 

^i]kr^t(T^ip^kp^l 

(9) 

<7,  p=  a,  /8  i, 

j,k,l=l 

H;=  - 

E  F- 

(10) 

cr 

=  a,  /3 

M 

H„=  - 

-2/^oH 

0 

(11) 

i=i 


«0»  = 


Tr[Oe“^"] 

Tr[e-^«] 


(16) 


/  n  d^(Z,)<Z,|Z^_i> 

”  m  =  l _ 

■'  k=l 

(17) 
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In  this  article,  we  numerically  calculate  the  ensem¬ 
ble  average  of  the  total  energy  ((H)}  or  that  of 
the  dipole  moment  ((  fi)). 

It  is  impossible  to  directly  calculate  the  ensem¬ 
ble  average  of  physical  properties  in  Eq.  (17)  with 
the  matrix  elements  of  the  Hamiltonian  in  Eq.  (15), 
because  a  huge  amount  of  integral  variables  is 
generated.  Therefore,  the  above  integral  is  approxi¬ 
mated  by  other  numerical  methods  such  as  the 
Monte  Carlo  method. 


Static  (Hyper)polarizabilities 

We  consider  a  Hamiltonian  written  as 

H  =  H'  -  F  •  /X,  (18) 

where  H'  is  the  addition  of  the  ab  initio  Hamilto¬ 
nian  and  the  Zeeman  Hamiltonian.  If  the  second 
term  is  the  perturbation  term  applied  to  the  molec¬ 
ular  system,  the  (hyper)polarizabilities  are  derived 
in  terms  of  the  perturbation  procedure.  Let  us 
consider  the  molecule  symmetric  with  respect  to 
the  x-direction.  Then,  the  expression  of  the  ensem¬ 
ble  average  of  the  dipole  moment,  (( fJi^)(  (v  = 
X,  y,  z),  becomes 


« 


Tr[ 


(19) 


Similar  to  the  above  procedure,  we  finally  have  an 
explicit  expression  of  the  hyperpolarizability  y^xxx' 


«03,» 


=  z  fax  f'd\' 

•'o  ■'o 

X  Tr[ 


-  L  /%ATr[£-<»-«V.r“F.I 

xF,/Tr[e-^»']f^fA  f'dX' 

•'o  •'o 


XF,^F,yTr[e-^»], 


((Jxx 


1 

6  dF^ 


(22) 

(23) 


It  may  be  easily  shown  that  these  expressions 
are  equal  to  those  obtained  from  the  conventional 
perturbation  theories  of,  e.g.,  the  SOS  method  and 
so  on. 


Numerical  Example 


According  to  the  procedure  of  the  linear  response 
theory,  we  obtain  an  expression  for  to  the 

first  order  in  the  dipole  moment  /x^: 


«Oi,» 


E 


Tr[e-^«'] 


E 

v=x,y,z 


In  this  section  we  discuss  the  performances  of 
numerical  calculations  of  the  ensemble  average  of 
the  physical  property  by  a  Monte  Carlo  method 
based  on  this  method.  A  novel  numerical  scheme 
is  presented.  The  simplest  example  of  a  hydrogen 
molecule  is  examined.  Results  by  this  scheme  are 
compared  with  those  obtained  from  the  conven¬ 
tional  MO  method. 

CALCULATING  SCHEME  AND  MONTE  CARLO 
PROCEDURE 


(20) 


Therefore,  the  polarizability  is  derived  by 


^<Oi.» 

dFx 


(21) 


Figure  1  illustrates  a  numerical  scheme  to  calcu¬ 
late  the  ensemble  average  of  the  physical  property 
by  a  Monte  Carlo  method.  We  need  the  matrix 
elements,  and  between  orbitals  in  each 

term  of  Eqs.  (12)  and  (13).  Because  our  initial 
coherent  state  is  constructed  by  the  HF  orbitals, 
matrix  elements  based  on  the  HF  orbitals  must  be 
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FIGURE  1,  Scheme  for  calculating  physical  properties. 

generated.  First,  the  HF  molecular  orbitals  and  the 
molecular  integrals  based  on  a  basis  set  are  ob¬ 
tained  from  HF  calculations  using  the  H0ND095.3 
program  package  [33].  Second,  we  convert  them 
into  the  molecular  integrals  based  on  the  HF  or¬ 
bitals  orthogonal  to  each  other.  Finally,  the  ensem¬ 
ble  average  of  the  physical  property  is  calculated 
using  the  above  molecular  integrals,  the  number  of 
the  division  n  for  the  partition  function,  and  a 
temperature  based  on  a  metropolis  algorithm.  The 
random  number  with  the  density  |det(l  + 
of  the  probability  is  generated  by  the  rejection 
method.  To  avoid  the  negative  sign  problem,  a 
reweighting  method  is  attempted  [34]. 

EXAMPLE 

As  a  numerical  example,  first,  the  total  energy 
is  calculated  for  a  hydrogen  molecule.  The  results 
are  compared  with  those  obtained  from  the  con¬ 
ventional  molecular  orbital  theory.  Next,  the  calcu¬ 
lation  results  of  the  (hyper)polarizabilities  are 
shown  for  the  hydrogen  molecule.  Here,  the  de¬ 
pendence  of  the  values  on  the  number  of  the 
divisions  is  discussed.  Dependence  on  the  external 
magnetic  field  is  also  discussed. 


Total  Energy  and  Dipole  Moment  Without 

External  Magnetic  Field 

We  consider  a  hydrogen  molecule  with  an 
atomic  distance  of  1.46  au  as  shown  in  Figure  2. 
All  calculations  were  carried  using  the  4-31G  basis 
set.  Figure  3  and  Table  I  show  the  ensemble  aver¬ 
age  of  the  total  energy  vs.  the  number  of  the 
divisions  for  the  partition  function  at  temperature 
of  1  K.  For  10^  divisions,  the  ensemble  average  of 
the  total  energy  becomes  —1.1406  +  0.0033  au.  As 
shown  in  Table  I,  the  CCSD  total  energy,  which  is 
equivalent  to  the  full-CI  energy  in  the  case  of  the 
two-electron  system,  obtained  by  Gaussian  92  [35] 
is  —1.15127  au.  The  total  energy  is  expected  to  be 
equal  to  that  of  full-CI  when  the  number  of  divi¬ 
sions  becomes  large.  We  also  show  the  ensemble 
average  of  the  dipole  moment  under  the  same 
condition  in  Figure  4.  These  results  suggest  that 
for  the  infinite  divisions  for  the  partition  function 
at  lower  temperature  calculations  by  the  MC 
method  cause  errors.  We  expect  that  the  ensemble 
average  of  the  dipole  moment  value  for  each  divi¬ 
sion  number  n  will  vanish  at  higher  temperature. 
Note  that  the  accuracy  of  calculations  by  the  MC 
method  is  generally  proportional  to  the  square 
root  of  the  number  of  the  divisions.  It  means  that 
precise  calculations  by  the  MC  method  takes  us  a 
long  time.  For  this  reason,  the  calculations  for  10^ 
divisions  were  carried  out  to  obtain  qualitative 
effects  of  these  physical  properties.  It  is  necessary 
to  investigate  further  these  effects  with  larger 
number  of  the  divisions,  n,  and  temperature  for 
the  quantitative  study  of  these  physical  properties. 


z 


FIGURE  2.  Employed  model  system  of  a  hydrogen 
molecule. 
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FIGURE  3.  Total  energy  vs.  the  number  of  divisions  for  the  partition  function. 


(Hyper)polarizabilities  Without  External 
Magnetic  Field 

From  Eqs.  (21)-(24),  and  are  calcu¬ 

lated  in  terms  of  the  MC  method.  The  present 
calculations  were  performed  using  the  4-31G  basis 
set  under  a  condition  of  a  temperature  of  1  K.  The 


and  y^cxxx  values  are  also  calculated  for  the 
H2  molecule  by  the  FF  method  with  Gaussian  92. 
These  calculations  were  made  using  the  numerical 
derivative  of  the  total  energy  with  respect  to  the 
applied  electric  field.  The  results  at  HF  and  other 
correlation  levels  are  shown  in  Tables  II“IV.  The 
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TABLE  I  _ 
Total  energy. 


MC  method 

MO  method 

n 

Total  energy  (au) 

Method 

Total  energy  (au) 

1 

-1.1256  ±  0 

HF 

-1.126 

10 

-1.1264  +  0.0009 

MP2 

-1.143 

100 

-1.1285  ±  0.0014 

MP3 

-1.149 

1000 

-1.1336  ±  0.0024 

MP4DQ 

-1.150 

10,000 

-1.1406  +  0.0033 

MP4SDQ 

-1.150 

CCSD 

-1.151 

polarizability  for  each  division  number  in  the 
MC  method  is  shown  in  Figure  5  and  Table  11.  For 
the  division  number  of  10^,  the  value  becomes 
6.675  ±  0.180  au.  On  the  other  hand,  the  CCSD 
method  gives  6.407  au.  As  for  the  total  energy 
value,  we  can  also  expect  that  by  using  a  larger 
division  number  the  polarizability  value  will  vary 
from  that  with  the  HF  calculation  to  that  with  the 
full-CI.  Similarly,  the  hyperpolarizability  Jxxxx 
shows  a  tendency  similar  to  other  physical  proper¬ 
ties,  which  becomes  the  exact  value  at  the  limit  of 
infinite  division  number  as  expected  from  Figure  6 
and  Table  III. 

(Hyper)polarizabilities  Under  External 
Magnetic  Field 

We  calculate  the  (hyper)polarizability  of  the  hy¬ 
drogen  molecule  under  the  external  magnetic  field. 
These  calculations  were  carried  out  using  4-31G 
basis  set  under  a  condition  of  division  number  of 
10^  and  temperature  of  1  K.  First,  the  variation  of 
the  value  with  the  magnetic  field  is  shown  in 
Figure  7  and  Table  IV.  When  the  magnetic  field  is 
in  the  region  from  0.0  to  1.5  au,  the  value 


TABLE  II  _ 
Polarizability. 


MC  method 

FF  method 

n 

“xx  (au) 

Method  (au) 

1 

6.962  ±  0.000 

HF 

6.924 

10 

6.988  ±  0.121 

MP2 

6.665 

100 

6.845  ±  0.151 

MP3 

6.516 

1000 

6.785  ±  0.091 

MP4DQ 

6.440 

10,000 

6.675  ±  0.180 

MP4SDQ 

6.449 

CCSD 

6.407 

TABLE  III _ 

Hyperpolarizability. 


MC  method  FF  method 


n 

7xxxx  (au) 

Method 

•Txxxx  (au) 

1 

-22.603  +  0.000 

HF 

-22.00 

10 

-19.456  ±  1.976 

MP2 

-17.39 

100 

-12.492  ±  3.150 

MP3 

-12.79 

1000 

-10.979  ±  1.207 

MP4DQ 

-9.59 

10,000 

-9.707  ±  1.069 

MP4SDQ 

CCSD 

-10.21 

-8.72 

becomes  about  6  au.  However,  when  the  magnetic 
field  is  over  2.0  au,  the  value  suddenly  de¬ 
creases  to  about  3  au.  This  result  can  be  explained 
as  follows:  As  shown  schematically  in  Figure  8, 
triplet  energy  level  splits  into  three  distinct  levels 
by  the  external  magnetic  field  (Zeeman  splitting). 
Over  the  critical  magnetic  field,  the  lowest  one  of 
the  three  levels  becomes  lower  than  the  singlet 
energy  level  (spin  crossover).  The  sudden  decrease 
in  the  polarizability  described  above  may  reflect 


TABLE  IV _ 

Polarizability:  (a)  polarizability  vs.  magnetic  field; 
(b)  polarizability  for  the  singlet  or  triplet  spin  state. 

(a) 


MC  method 


HZ 

“xx  (au) 

0.0 

6.899  +  0.036 

0.5 

7.175  ±  0.429 

1.0 

6.576  ±  0.295 

1.5 

6.763  ±  0.610 

2.0 

3.207  +  0.505 

2.5 

3.370  +  0.295 

3.0 

3.303  +  0.155 

(b) 

FF  method 

«xx  (au) 

Method 

Singlet 

Triplet 

HF 

6.924 

3.322 

MP2 

6.665 

3.297 

MP3 

6.516 

3.288 

MP4DQ 

6.440 

3.285 

MP4SDQ 

6.449 

3.285 

CCSD 

6.407 

3.283 
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FIGURE  5.  Polarizability  vs.  division  number. 


this  transition  of  the  ground  state.  This  explanation 
is  also  supported  by  the  results  of  independent 
calculations  of  values  for  singlet  and  triplet 
states  by  the  FF  method  shown  in  Table  IV.  Calcu¬ 
lated  results  for  the  hyperpolarizability 
shown  in  Figure  9  and  Table  V.  When  the  mag¬ 
netic  field  is  in  the  region  from  0.0  to  1.0  au,  the 


Jxxxx  value  is  around  —11  au.  On  the  other  hand, 
the  Jxxxx  value  becomes  larger  at  the  region  over 
the  magnetic  field  of  1.0  au.  The  singlet  and  triplet 
Jxxxx  values  by  the  FF  method  are  —  8.72  and 
42.24  au,  respectively,  as  shown  in  Table  V.  The 
increase  in  the  Jxxxx  values  also  can  be  considered 
to  be  the  result  of  the  transition  of  the  ground  state 
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H 


FIGURE  7,  Polarizability  vs.  static  magnetic  field. 


TABLE  V _ 

Hyperpolarizability:  (a)  Hyperpolarizability  vs. 
magnetic  field;  (b)  hyperpolarizabilities  for  the 
singlet  and  triplet  spin  states. 


(a) 

MC  metho 

Hz 

Txxxx 

0.0 

-11.492  ±  1.214 

0.5 

-13.672  ±  4.288 

1.0 

-11.004  ±  2.940 

1.5 

-8.797  ±  4.677 

2.0 

-1.922  ±  6.991 

2.5 

5.047  +  10.684 

3.0 

8.239  ±  14.021 

(b) 

FF  method 

Txxxx  (au) 

Method 

Singlet 

Triplet 

HF 

-22.00 

39.47 

MP2 

-17.39 

41.24 

MP3 

-12.79 

41.95 

MP4DQ 

-9.59 

42.15 

MP4SDQ 

-10.21 

42.02 

CCSD 

-8.72 

42.24 

between  different  spin  states.  These  results  quali¬ 
tatively  show  that  the  (hyper)polarizability  under 
the  external  magnetic  field  is  explained  in  terms  of 
the  relative  stability  of  the  singlet  and  triplet  states. 
Note  that  the  value  by  the  MC  method  in 

the  region  over  the  magnetic  field  of  2.5  au  is  less 
than  that  of  the  triplet  state  by  the  FF  method 
because  of  using  the  division  number  of  10^.  For 
more  precise  calculations,  the  division  number 
should  be  larger  than  10^, 


Concluding  Remarks 

We  presented  a  path  integral  formulation  and 
the  scheme  to  calculate  the  ensemble  average  of 


Energy 
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FIGURE  9.  Hyperpolarizability  vs.  static  magnetic  field. 


physical  properties  in  terms  of  the  Monte  Carlo 
method.  We  applied  our  method  to  calculations  of 
total  energy,  dipole  moment,  and  static 
(hyper)polarizability  of  a  hydrogen  molecule.  The 
qualitative  dependence  of  those  physical  proper¬ 
ties  on  the  number  of  divisions  for  the  partition 
function  was  shown,  which  indicates  that  various 
properties  will  converge  to  an  exact  value  at  the 
limit  of  larger  division  number.  The  qualitative 
dependence  of  physical  properties  on  the  external 
field  is  also  shown.  It  is  found  that  the  static 
(hyper)polarizability  value  varies  around  the  spin 
crossover.  Our  method  can  be  a  useful  tool  for 
investigating  such  phenomena  as  the  property 
change  under  the  transition  process.  In  next  step, 
we  will  reform  our  scheme  to  yield  more  quantita¬ 
tive  results  for  investigating  larger  systems,  chemi¬ 
cal  catastrophe  (dissociation),  and  so  on. 
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ABSTRACT:  Monte  Carlo  simulations  of  liquid  formamide,  N-methylformamide 
(MF),  and  N,  IV-dimethylformamide  (DMF)  have  been  performed  in  the  isothermal  and 
isobaric  ensemble  at  298  K  and  1  atm,  aiming  to  investigate  the  C — H---0  and 
N — H  O  hydrogen  bonds.  The  interaction  energy  was  calculated  using  the  classical 

6-12  Lennard-Jones  pairwise  potential  plus  a  Coulomb  term  on  a  rigid  six-site  molecular 
model  with  the  potential  parameters  being  optimized  in  this  work.  Theoretical  values 
obtained  for  heat  of  vaporization  and  liquid  densities  are  in  good  agreement  with  the 
experimental  data.  The  radial  distribution  function  [RDF,  g(r)]  obtained  compare  well 
with  R-X  diffraction  data  available.  The  RDF  and  molecular  mechanics  (MM2) 
minimization  show  that  the  C — H  •••  O  interaction  has  a  significant  role  in  the  structure 
of  the  three  liquids.  These  results  are  supported  by  ab  initio  calculations.  This  interaction 
is  particularly  important  in  the  structure  of  MF.  The  intensity  of  the  N — H  O  hydrogen 

bond  is  greater  in  the  MF  than  formamide.  This  could  explain  some  anomalous  properties 
verified  in  MF.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  709-717,  1997 

Key  words:  Monte  Carlo  simulation;  amides;  hydrogen  bond;  radial  distribution 
functions 


Introduction 

Liquids  have  particularly  interested  the  chem¬ 
istry  community  due  to  their  fundamental 
role  in  chemical  and  biological  processes.  There¬ 
fore,  the  study  of  their  properties  have  attracted  a 

International  Journal  of  Quantum  Chemistry,  Vol.  65,  709-717  (1997) 
©  1997  John  Wiley  &  Sons,  Inc. 


good  deal  of  attention,  in  spite  of  the  difficulties 
involved.  In  the  last  decade,  computational  meth¬ 
ods  such  as  molecular  dynamics  and  Monte  Carlo 
(MC)  have  become  popular  tools  in  this  task  [1-4]. 
These  methodologies  have  shown  to  be  specially 
useful  to  investigate  some  structural  aspects  re¬ 
lated  with  the  liquid  state.  Nevertheless,  the  suc¬ 
cess  of  the  investigation  is  strongly  dependent  on 
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the  availability  of  intermolecular  potential  func¬ 
tions  that  yield  a  reasonable  model  for  the  liquid. 
In  recent  years,  we  have  undertaken  a  theoretical 
investigation  on  the  liquid  properties  emphasizing 
the  liquid  structure  and  solute-solvent  interac¬ 
tions  in  binary  mixtures  [5].  The  concept  of  hydro¬ 
gen  bond  (H  bond)  has  been  fundamental  in  un¬ 
derstanding  the  interaction  between  molecules, 
especially  in  aqueous  systems.  However,  as  yet, 
many  aspects  of  this  type  of  nonbonded  interac¬ 
tion  are  not  well  known,  in  spite  of  all  the  work 
that  has  been  done  in  that  area.  One  point  of 
dispute  is  the  possibility  of  the  C — H  -  **0  interac¬ 
tion  be  considered  as  H  bond  [6].  Taylor  and 
Kennard  have  provided  conclusive  evidence  of  the 
existence  of  this  interaction  in  crystals,  in  which 
the  lengths  of  the  C  •  •  •  O  distance  are  between  3.0 
and  4.0A,  and  the  C— H---0  angle  ranges  be¬ 
tween  90'"  and  180°  [7].  The  energy  of  this  interac¬ 
tion  is  small  (2.0-20  kj/mol)  [6],  but  there  is  little 
doubt  that  it  has  important  implications  in  chem¬ 
istry,  biochemistry,  and  crystal  engineering,  espe¬ 
cially  when  strong  O — H  •  *  •  O  and  N — H  •••OH 
bonds  are  absent  [6,  8].  C — H---0  bonds  may 
even  distort  the  geometry  of  the  strong  H  bond  in 
specific  instances.  Anomalies  in  strong  H  bond 
patterns  of  certain  crystal  structures  are  explained 
on  the  basis  of  the  interplay  of  strong  and  weak  H 
bonds  [8].  In  general,  the  strength  and  effective¬ 
ness  of  C — H  '••  O  H  bond  depends  on  the  C — H 
carbon  acidity  [9]  and  on  the  O  atom  basicity  [10], 
which  is  enhanced  by  cooperative  effects.  The  cri¬ 
teria  for  the  significance  of  a  particular  C — H  •  •  •  O 
geometry  have  also  been  the  subject  of  some  dis¬ 
cussion,  Recent  analysis  has  shown  that  angular 
characteristics  of  the  larger  C — H  •  •  •  O  contacts 
(C---0  3.5-4.0  A)  and  their  effects  on  crystal 
structures  resemble  those  with  shorter  contacts 
(3.0-3.5  A)  [6].  It  should  be  emphasized  that  the  C 
— H  •••  O  bond  is  not  really  a  van  der  Waals  con¬ 
tact  but  is  essentially  electrostatic.  Therefore,  since 
its  strength  falls  off  slower  with  distance  than  the 
usual  van  der  Waals  interaction,  it  is  conceivable 
at  distances  equal  to  or  larger  than  the  van  der 
Waals  limit  [8].  Thus,  even  long  C  •••  O  separations 
(about  4.0  A)  must  be  considered,  and  one  can 
expect  that  these  weak  H  bonds  have  orienting 
effects  on  molecules,  influencing  in  liquid  struc¬ 
ture. 

Another  type  of  H  bond  that  plays  a  master  role 
in  biological  chemistry  is  the  N — H---0  =  C  H 
bond.  However,  some  aspects  of  its  structure  are 


not  yet  well  understood.  Bertolasi  and  co-workers 
have  shown  that  the  range  of  the  N  •  •  •  O  distance 
is  between  2.7  and  3.3  A,  while  the  N — H---0 
angle  ranges  between  144°  and  175°  [11].  The  au¬ 
thors  have  concluded  that  the  resonance  is  the 
most  important  factor  determining  the  strength  of 
the  interaction  and  is  responsible  by  the  strong  H 
bonds  formed. 

The  main  goal  of  this  study  is  to  investigate  the 
patterns  of  this  interactions  in  liquid  phase  using 
computer  simulations.  For  that  purpose  we  have 
performed  MC  simulations  [12]  of  the  simplest 
liquid  amides,  like  formamide,  N-methylforma- 
mide,  and  N,N-dimethylformamide  (DMF).  Those 
molecules  contain  a  carbonyl  C — H  group  that 
could  form  C — H  •  •  •  O  bond,  and  two  of  them  also 
have  a  0  =  C — N — H  group  that  could  form  an  N 
— H---0  =  C  bond.  In  the  latest  case  the  main 
feature  is  the  capability  of  this  group  to  present 
resonance.  Besides,  it  is  possible  to  investigate  the 
role  of  the  substitution  on  the  donor  capability  of 
the  amine  group. 


Molecular  Models 

A  MC  investigation  using  optimized  potential 
for  liquid  simulations  (OPLS)  force  field  parame¬ 
ters  [13]  on  a  five-site  model  and  a  molecular 
dynamics  simulation  with  test-particle  model  po¬ 
tential  [14]  on  a  six-site  model  has  been  published 
for  formamide.  Monte  Carlo  [13]  and  molecular 
dynamics  [15]  simulations  have  been  performed 
for  a  five-site  model  of  DMF  using  OPLS  poten¬ 
tials.  More  recently,  molecular  dynamics  simula¬ 
tion  on  the  three  liquids  has  been  published  using 
six-sites  model  in  which  the  structure  factors  for 
the  liquids  were  explored  and  compared  with  the 
experimental  ones  [16].  Having  in  mind  our  main 
purpose  in  this  work,  we  have  optimized  a  six-site 
model  for  those  three  liquids  (the  methyl  groups 
are  treated  as  a  monatomic  site)  such  that  the 
H---0  interaction  in  C^ — H'*-0  can  be  studied. 
This  parametrization  is  also  important  to  investi¬ 
gate  the  behavior  of  the  aldehydic  hydrogen  in  the 
amide-water  solutions.  In  order  to  clarify  the 
site-site  intermolecular  correlation  obtained  by  the 
plots  of  the  radial  distribution  function,  we  have 
investigated  the  dimer  formation  in  the  gas  phase 
at  molecular  mechanics  (MM2)  level  [17].  The 
molecules  were  modeled  as  rigid  structures  and 
therefore  intramolecular  relaxation  effects  are  not 
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considered.  As  it  is  known  these  molecules  do  not 
present  rotation  around  the  C — M  bond  due  to  the 
resonance  verified  in  the  peptide  bond.  The  molec¬ 
ular  geometry  parameters  for  the  three  molecules 
were  taken  from  the  Jorgensen  and  Swenson  (JS) 
study  [13].  The  potential  parameters  were  opti¬ 
mized  for  formamide  and  DMF  and  accordingly 
used  in  MF.  The  Lennard-Jones  parameters  used 
for  aldehydic  hydrogen  were  those  developed  by 
Hagler  and  Liphson  [18]  while  the  rest  were  opti¬ 
mized  using  those  reported  by  JS  [10]  as  a  starting 
trial  set.  The  comparison  with  the  experimental 
values  of  density  and  heat  of  vaporization 
was  the  criterion  used  to  optimize  these  parame¬ 
ters.  The  charge  distribution  was  optimized  to 
yield  a  dipole  moment  around  15“20%  greater 
than  the  one  in  the  gas  phase.  This  procedure  has 
been  used  as  a  criterion  to  compensate  for  neglect¬ 
ing  polarization  effects  in  fluid  simulations  [13, 19, 
20].  In  formamide  we  have  started  with  the  charges 
published  by  Hagler  and  Liphson  [18]  (the  point 
charges  on  amine  sites  are  very  similar  to  those 
used  by  JS  [13]).  For  the  DMF  we  have  used  the 
aldehydic  charges  obtained  for  formamide  and  the 
charges  published  by  JS  for  the  amine  group  [13]. 
The  optimization  of  the  charges  was  made  such 
that  the  aldehydic  and  amine  groups  were  set  to 
be  zero.  I  kept  the  charge  distribution  on  the  alde¬ 
hydic  group  the  same  for  all  studied  molecules 
and  changed  only  the  charges  on  the  amine  group. 
All  the  approximations  above  are  criteria  to  mini¬ 
mize  the  number  of  potential  parameters.  In  Table 
I,  I  present  the  potential  parameters  used  for  the 
simulations  of  the  three  amides.  The  results  from 
this  work  attest  the  quality  of  the  Lennard-Jones 
potential  parameters  reported  by  JS  [13], 

A  calculation  of  the  partial  charges  was  made 
with  6-31 G*  basis  set  using  the  CHELPG  proce¬ 
dure  included  in  the  Gaussian  92  program  [21]  as 
discussed  elsewhere  [22],  maintaining  fixed  our 
molecular  geometries.  In  Table  II,  I  present  the 
results  obtained  in  this  work. 

It  is  noticed  that  the  charges  calculated  with  the 
CHELPG  method  are  somewhat  different  from 
those  I  have  used  in  the  simulation.  The  main 
difference  is  in  the  charges  of  the  carbonylic  car¬ 
bon  and  the  nitrogen.  However,  the  dipole  mo¬ 
ments  are  quite  similar  in  both  calculations.  There¬ 
fore,  I  conclude  that  the  dipole  moment  plays  an 
important  role  in  the  experimentally  observed  AH^ 
and  liquid  densities.  It  is  interesting  to  notice  that 
the  derivation  of  partial  charges  from  molecular 
orbital  calculations  depends  on  the  various  as¬ 


sumptions  in  the  molecular  orbital  theories,  while 
that  optimized  by  MC  simulations  depends  on  the 
fitting  of  experimental  results  with  the  intermolec- 
ular  force  field  used. 


Computational  Aspects 

INTERMOLECULAR  POTENTIAL  FUNCTION 

To  determine  the  potential  topology,  the  energy 
between  molecules  a  and  b  has  been  repre¬ 
sented  by  a  sum  of  Coulomb  and  Lennard-Jones 
potentials  centered  on  the  sites: 

=  L  [  -  Bij/rfj  +  ,  (1) 

where  is  the  distance  between  site  i  in  a  and 
site  j  in  b  and  and  qj  are  fractional  point 
charges  located  on  i  and  j  molecular  sites  (it  has 
been  supposed  that  the  electrostatic  contribution 
to  the  intermolecular  potential  is  suitable  repre- 


TABLE  I _ 

Potential  parameters  for  the  amides  used  In 
this  work. 


Sites 

<7 

(T  (A) 

s  (kcal  /  mol) 

Formamide  ( 

/x  =  4.2D) 

c 

0.34 

3.75 

0.105 

0 

-0.46 

2.96 

0.21 

H(C) 

0.12 

2.75 

0.038 

N 

-0.830 

3.25 

0.17 

H(N) 

0.415 

0.00 

0.00 

MF(ix,= 

4.3  D) 

c 

0.34 

3.75 

0.105 

0 

-0.46 

2.96 

0.21 

H(C) 

0.12 

2.75 

0.038 

N 

-0.70 

3.25 

0.14 

Me 

0.285 

3.80 

0.13 

H(N) 

0.415 

0.00 

0.00 

DMF(/a  = 

=  4.4D) 

C 

0.34 

3.75 

0.105 

0 

-0.46 

2.96 

0.21 

He 

0.12 

2.75 

0.038 

N 

-0.57 

3.25 

0.14  (0.17) 

Me 

0.285 

3.80 

0.13(0.17) 

The  values  between  brackets  are  those  reported  by  Jor¬ 
gensen  and  Swenson  [13].  The  unit  of  /x  in  Debye  (the 
values  are  those  ones  obtained  with  the  charges  presented). 
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TABLE  II _ 

Partial  charges  located  on  the  amides  sites 
obtained  with  6-31 G*  basis  functions. 


Formamide 
(11  =  4.34) 

MF 

(iu,  =  4.34) 

DMF 

( At  =  4.35) 

C  =  0.752 

C  =  0.565 

C  =  0.463 

0=  -0.630 

0=  -0.586 

0=  -0.595 

H(C)  =  -0.003 

H(C)  =  0.017 

H(C)  =  0.051 

N  =  -0.970 

N  =  -0.544 

N  =  -0.143 

H(N)  =  0.401 

H(N)  =  0.303 

CH3  =  0.153 

H{N)  =  0.449 

CH3  =  0.246 

CH3  =  0.072 

Unit  of  fjb  in  Debye. 


sented  by  fractional  charges  located  on  the  molecu¬ 
lar  sites).  For  each  site  fc,  the  parameters  and 
Bi,!,  were  given  by  A,,,,  =  and 

where  and  are  the  Lennard-Jones  parame¬ 
ters  for  the  kih  site.  Parameters  A^j  and  for  a 
nondiagonal  interaction  [/,;]  were  obtained  using 
the  geometric  combining  rules  Ajj  =  (A^Aj^y'^^ 
and  Bij  =  [23].  This  approach  has  been 

frequently  used  in  this  field  of  study.  In  an  earlier 
work  we  have  shown  that  the  difference  in  the 
results  obtained  using  the  geometric  combining 
rule  above  and  well-know  Lorentz-Berthelot  rule 
is  negligible  [5]. 

MO^TE  CARLO  SIMULATIOEVS 

MC  simulations  were  carried  out  in  the  isother¬ 
mal  and  isobaric  ensemble  at  298.15  K  and  internal 
pressure  1  atm,  employing  Metropolis  importance 
sampling  [24]  and  cubic  box  boundary  conditions. 


All  the  systems  investigated  have  consisted  of  256 
monomers  in  a  cubic  cell.  In  our  earlier  work  we 
have  demonstrated  that  the  number  of  molecules 
in  the  box,  does  not  imply  significant  changes  in 
the  results  obtained  [5].  In  the  calculation  of  con¬ 
figurational  energy  using  Eq.  (1),  the  Coulombic 
term  was  calculated  with  Ewald  sum  [23].  Starting 
from  the  initial  distribution  of  monomers,  a  new 
configuration  was  generated  by  randomly  translat¬ 
ing  and  rotating  a  randomly  chosen  molecule  along 
Cartesian  coordinates.  Ranges  for  translations  and 
rotations  of  the  monomers  and  volume  changes 
were  adjusted  to  yield  an  acceptance/trial  ratio  of 
about  0.45  for  new  configurations.  After  a  volume 
movement  the  coordinates  of  the  center  of  mass  of 
all  molecules  in  the  reference  box  were  scaled  in 
an  usual  way  [23].  Each  calculation  started  with  an 
equilibration  phase  of  1.2  X  10^  configurations  fol¬ 
lowing  by  an  average  segment  consisting  of  8  X  10^ 
configurations.  Statistical  uncertainties  were  calcu¬ 
lated  from  separate  averages  over  blocks  of  2  X  10^ 
configurations.  The  calculations  were  carried  out 
on  Silicon  Graphics  Power  4D  and  IMB  RISC  6000 
workstations. 


Results  and  Discussion 

THERMODYNAMIC 

In  Table  III,  I  show  the  thermodynamic  results 
obtained  for  the  three  amides.  The  results  agree 
very  well  with  those  reported  by  JS  for  formamide 
and  DMF  [13]  and  a  good  agreement  is  noted 
between  theoretical  and  experimental  values.  As 
already  discussed  above,  the  potential  parameters 


TABLE  III 


Values  of  thermodynamics  properties  of  the  pure  iiquids.^ 


Liquid 

<1 

-£/ 

V 

P 

This  work 

Formamide 

14.79  ±  0.02 

14.20  ±  0.02 

40.93 

1.10  +  0.003 

MF 

13.77  ±  0.02 

13.18  ±  0.02 

59.32 

1.00  +  0.003 

DMF 

1 1 .40  ±  0.03 

10.81  ±  0.03 

78.54 

0.93  ±  0.003 

Experimental  values^ 

Formamide 

14.70 

14.11  ±  0.02 

39.89 

1.13 

MF 

13.77 

13.18  ±  0.02 

57.56 

1.02 

DMF 

11.10 

10.51  ±  0.03 

77.66 

0.94 

^  Heat  of  vaporization  {AHj  and  intermolecular  energy  (£,)  In  kcal  /  mol.  The  heat  of  vaporization  was  estimated  by  =  -£/  +  RT. 
Mole  volume  (i7)  in  cm® /mol  (the  experimental  one  Is  calculated  with  experimental,  density  and  molecular  weight).  Density  (p)  in 
g/cm®. 

and  p  obtained  from  Ref.  [25]. 
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were  optimized  for  formamide  and  DMF  and  used 
in  methylformamide.  As  one  can  notice,  these  pa¬ 
rameters  reproduce  well  the  experimental  thermo¬ 
dynamic  properties  of  the  liquids. 

RADIAL  DISTRIBUTION  FUNCTIONS  AND 

HYDROGEN  BOND 

Formamide 

The  radial  distribution  functions  (RDF)  com¬ 
puted  for  formamide  are  shown  in  Figure  1.  Quali¬ 
tatively,  our  results  are  similar  to  those  reported 
by  JS  [13],  The  small  differences  in  the  height  of 
some  peaks  can  be  attributed  to  differences  in  the 
charges  located  on  the  sites.  The  closest  inter- 
molecular  correlation  verified  in  the  RDF  is  be¬ 
tween  the  oxygen  and  the  hydrogen  atoms  bonded 

o 

to  nitrogen  located  at  1.9  A.  This  correlation  has 
been  verified  by  different  experimental  techniques 
and  reported  earlier  by  Kalman  and  co-workers 
[26].  It  is  also  very  close  to  the  smaller  distance 
reported  by  Bertolasi  and  co-workers  to  the  H 
bond  in  the  molecules  investigated  by  them  [11]. 
In  the  same  way,  the  N — O  correlation  at  2.9  A 
also  agrees  with  the  distance  determined  experi¬ 
mentally  by  Kalman  and  co-workers  [26],  and  it  is 
in  the  range  of  distances  reported  by  Bertolasi  and 
co-workers  to  the  N — H  •••  0  =  C  FI  bond  [11].  Ac¬ 
cording  to  these  authors,  these  distances  are  char¬ 
acteristic  of  a  strong  H  bond  between  the  molecules 
and  are  verified  preferentially  in  molecules  that 
present  resonance.  It  is  clear  from  the  height  and 
the  peak  positions  of  the  two  amine  hydrogen  that 
there  is  no  strong  preference  between  them  in  the 
hydrogen  bond  formation. 


Figure  2  shows  the  only  two  dimers  found  with 
the  minimization  procedures  that  agree  with  the 
g(r)  plots.  The  legend  show  the  most  important 
geometric  parameters  to  our  discussion.  As  one 
can  observe,  the  gir)  plots  are  a  combination  of 
these  dimers.  This  is  an  indication  of  the  existence 
of  both  of  them  in  the  liquid.  This  coexistence  is 
stressed  by  the  0/H(C)  RDF  which  presents  two 
significant  peaks  at  2.9  and  4.9  A,  The  form  of  the 
N/N  g(r)  is  also  due  to  the  coexistence  of  both 
dimers  in  the  liquid.  The  N — H  O  angle  in  dimer 
a  is  roughly  157°,  which  agrees  with  the  values 
reported  by  Bertolasi  and  co-workers  for  the  N  •  •  •  O 
distance  verified  in  this  correlation  [11].  Ohtaki 
and  co-workers  have  optimized  formamide  dimers 
with  ab  initio  LCGO-MO-SCF  with  a  minimum 
base  set  (2s/lp)  [27].  They  have  obtained  that  the 
cyclic  dimer  is  the  most  stable,  which  is  also  the 
case  in  our  study,  but  the  authors  have  concluded 
that  the  trans  dimer  is  more  probably  in  the  liquid 
because  this  dimer  favors  the  formation  of  open 
chains,  which  stabilize  the  liquid.  Those  authors 
have  also  optimized  the  dimer  forming  H  bonds 
between  oxygen  and  aldehydic  hydrogen  and  have 
founded  that  this  dimer  presents  a  small  stability 

o 

with  the  equilibrium  distance  O — H  of  3.2  A, 
which  is  very  close  to  the  first  maximum  of  the 
0/H(C)  RDF  observed  in  Figure  1.  The  C**-0 
lengths  and  C — H  •  •  •  O  angles  reported  here  also 
agree  with  the  values  presented  by  Desiraju  [6]  as 
characteristic  of  C — H-**0  H  bonds.  Then  the 
results  are  consistent  with  the  existence  of  the 
C — H  ■••OH  bond  in  the  liquid  formamide.  This 
correlation  must  contribute  to  increases  the  stabil¬ 
ity  of  the  trans  dimer  (dimer  a).  Then,  our  six-site 


FIGURE  1.  Plots  of  radial  distribution  function  for  formamide.  H^.  and  H,  are  the  aminic  hydrogens  c/s  and  trans  to 
the  oxygen,  respectively. 
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FIGURE  2.  The  two  formamide  dimers  optimized  at  MM2  level  that  agree  with  g{r).  Geometric  parameters  of  dimer  a: 
r(O/Ht)  =  2.0  A;  r(0/Hc)  =  3.6  A;  r(O/N)=3.0  A;  r(0/C)  =  3.5  A;  r(0/ H(C))  =  3.0;  COM  angle  =152°;  NHO 
angle  =  148.6°;  CHO  angle  =  102.6°.  Geometric  parameters  of  dimer  b:  r(0/  Ht)  =  3.9  A;  r(0/  He)  =  2.0  A  (for  the  two 
H-bond);  r(0/N)  =  3.0  A;  r(0/C)  =  3.8  A;  COH  angle  =  120°  (for  the  two  H-bond);  NHO  angle  =  178°.  H^  and  H,  are 
the  aminic  hydrogens  c/s  and  trans  to  the  oxygen,  respectively. 


model  can  identify  the  two  types  of  correlation 
discussed  in  the  introduction  present  in  liquid 
formamide.  Based  in  this  work  and  in  the  early 
ones  [13,  27],  we  consider  that  the  structure  of  the 
liquid  formamide  consists  of  the  two  dimers 
showed  in  the  Figure  2.  Given  the  discussion 
above,  I  consider  the  position  that  the  formamide 
is  formed  of  flexible  chains  of  dimer  a  with  cyclic 
dimers  ''immersed"  in  it  [27]  seems  to  be  reliable. 

/V,iV-Dimethylforiiiamide 

Unlike  the  formamide,  no  distinct  peak  has  been 
verified  in  X-ray  study  of  DMF  [28].  As  a  conse¬ 
quence,  there  is  more  interest  in  simulating  this 
liquid  since,  of  course,  it  forms  a  condensed  phase. 
In  Figure  3  is  shown  the  RDF  computed  for  the 
liquid.  Qualitatively,  the  g(r)  graphs  obtained  here 
are  equivalent  to  those  obtained  by  JS  [13].  As  in 
the  case  of  formamide,  there  are  little  differences 


in  the  height  of  the  peaks,  which  can  be  attributed 
to  differences  in  the  charges  located  on  the  sites. 
The  present  g(r)  results  show  a  greater  definition 
for  the  shoulders  in  the  oxygen-carbon  correlation 
then  in  the  JS  study  [13],  as  consequence  of  the  fact 
that  I  recognize  C  and  H  as  different  sites  instead 
of  a  single  site.  The  g(r)  plots  show  that  the 
shortest  intermolecular  distance  in  liquid  phase  is 
not  between  oxygen  and  methyl  groups,  as  was 
reported  in  the  JS  study  [13],  but  between  oxygen 
and  the  aldehydic  hydrogen.  Figure  4  presents  the 
two  dimers  found  for  which  the  atom-atom  dis¬ 
tances  better  reproduce  the  g(r)  plots  obtained. 
From  these  dimers  it  is  clear  that  there  is  a  signifi¬ 
cant  correlation  between  oxygen  and  the  hydrogen 
bonded  to  the  carbonyl  group.  This  correlation 
must  contribute  for  the  correlation  between  oxy¬ 
gen  and  trans  methyl  to  be  more  intense  than 
oxygen  and  cis  methyl.  The  first  peak  in  the  corre¬ 
lation  oxygen-carbon  occurs  in  the  dimer  involv- 


FIGURE  3.  Plots  of  radial  distribution  function  for  N,  W-dimethylformamide.  Me{c)  and  and  Meft)  are  the  aminic 
methyls  c/s  and  trans  to  the  oxygen,  respectively. 
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FIGURE  4.  The  two  DMF  dinners  optimized  at  MM2  level  that  agree  with  the  g(r)  plots.  Geometric  parameters  of  the 
dimer  a:  r(0  /  Me(c))  =  3.5  A;  r (0  /  C)  =  5.4  A;  r(0  /  H)  =  6.5  A;  /-(N  /  C)  =  5.0  A;^  r (N  /  N)  =  5.7  A-  r (O  /  N)  =  4.7  A- 
C(CH3)0  angle  =  158°.  Geometric  parameters  of  the  dimer  b:  r(0  /  Me(t))  =  3.5  A;  r(0  /  C)  =  3.8  A;  r(0  /  H)  =  2.8  A; 
r(N/C)  =  5.3  A;  f(N/N)  =  5.8  A;  r(0/N)  =4.0  A;  COM  angle  =  143°;  CHO  angle  =  144°.  Me(c)  and  Me(t)  are  the 
aminic  methyls  cis  and  trans  to  the  oxygen,  respectively. 


ing  the  oxygen-hydrogen  correlation  while  the 

o 

peak  at  5.5  A  is  due  to  the  dimer  involving  the  cis 
methyl.  The  second  peak  in  the  O/C  correlation, 
more  intense  than  the  first  peak,  is  a  clear  indica¬ 
tion  of  the  existence  of  two  different  dimers  in  the 
liquid  bulk.  Also  in  this  case,  the  C**-0  lengths 
and  C — H  •  •  •  O  angles  reported  for  dimer  b  agree 
with  the  values  presented  by  Desiraju  as  being 
characteristics  of  C — H---0  hydrogen  bonds  [6]. 
Ohtaki  and  co-workers  [28]  have  made  an  ab  initio 
MO-SCF  calculation  for  the  dimer  bonded  by 
C  =  0 — H(C)  and  found  a  stabilization  energy  of 
-4.3  kcal/mol  and  a  bond  distance  O — H  of  3.2 
A.  Based  in  the  Desiraju  study  [6]  and  in  the 
hydrogen  bond  energy  of  water  (7.2-9.6  kcal/mol), 
this  energy  seems  to  be  too  high.  Our  results  show 
that  the  distance  of  this  interaction  is  also  lower 
than  that  reported  by  those  authors  [28].  One 
should  keep  in  mind  that  we  are  comparing  results 


for  liquids  with  ab  initio  results  for  the  gas  phase. 
The  existence  of  these  two  dimers  in  the  liquid 
phase  agrees  with  the  structure  proposed  by  Ohtaki 
and  co-workers  [28,  29]  of  a  random  distribution  of 
molecular  dipole  orientation. 

/V-Methylformamide 

In  the  simulation  of  these  molecules  we  have 
two  possibilities  for  the  position  of  the  methyl  and 
hydrogen  sites  of  the  amine  group.  Each  one  can 
be  cis  or  trans  to  oxygen.  The  hydrogen  trans 
conformer  is  predicted  to  be  4.2-5  kj/mol  more 
stable  than  the  cis  one.  Experimentally  the  relative 
concentration  of  the  cis  conformer  is  found  to  be 
around  10%,  which  is  in  agreement  with  the  stabi¬ 
lization  energy  [30].  In  this  work,  I  have  modeled 
the  molecule  with  the  hydrogen  trans  to  the  oxy¬ 
gen.  Figure  5  shows  the  RDF  graphs  obtained  for 


FIGURE  5.  Plots  of  radial  distribution  function  for  methylformamide. 
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FIGURE  6.  Methylformamide  dimer  optimized  at  l\/IM2 
level.  Geometric  parameters  of  the  dimer:  r(0-H(N))  = 
2.0  A;  r(0-CH3)  =  3.6  A;  r(O-C)  =  3.4  A;  r(0-H(C))  = 
3.0  A;  r(N”0)  =2.9  A;  r(N-C)  =  3.9  A,  4.7  A;  N-H--0 
angle  =  1 56°;  C  -H  •  •  •  0  angle  =  1 04°. 


this  liquid,  and  Figure  6  shows  the  only  dimer  that 
we  succeeded  to  optimize  that  agrees  with  the 
RDF  graphs  presented. 

Thus  for  this  liquid,  the  intermolecular  interac¬ 
tion  between  oxygen-amine  hydrogen  with  hydro¬ 
gen  bond  formation  prevails.  The  0-**H(N)  and 
N*--0  distance  and  N — H--*0  angle  agree  with 
this  [8],  The  weak  correlation  between  oxygen  and 
hydrogen  bonded  to  the  carbonyl  around  3  A  must 
contribute  to  the  dimer  stabilization.  The  O  •  •  •  Fi(C) 
length  and  C — H-'-O  angle  agree  with  that  re¬ 
ported  by  Desiraju  for  the  H  bond  between  this 
atom  [6].  It  is  interesting  to  notice  that  the  substi¬ 
tution  of  a  hydrogen  by  a  methyl  group  from 
formamide  to  MF  make  the  aminic  hydrogen  in 
MF  more  acidic  than  in  the  formamide,  due  to  the 
stabilization  of  the  methyl  group  in  the  change  of 
nitrogen  from  sp^  to  sp^.  Then  the  0”*H(N)  hy¬ 
drogen  bond  must  be  stronger.  This  dimer  agrees 
with  the  structure  proposed  by  Ohtaki  and  co¬ 
workers  [31]  and  Neuefeind  and  co-workers  [32]  of 
linear  chains  with  preferred  parallel  orientation  of 
the  molecular  moment. 


Conclusions 

Clearly  the  properties  of  formamide  and  N- 
methylformamide  are  a  consequence  of  the  hydro¬ 
gen  bond  between  oxygen  and  aminic  hydrogen. 
DMF  does  not  present  this  correlation,  N-methyl- 
formamide  presents  an  O  •••  H(N)  H  bond  stronger 
than  formamide,  with  the  consequence  than  the 
aminic  hydrogen  is  more  acidic  in  the  first  one.  So, 
MF  must  be  more  structured  around  this  hydrogen 


bond  than  formamide.  Other  properties  to  be  no¬ 
ticed  are  the  melting  and  boiling  points  of  for¬ 
mamide,  MF,  and  DMF  which  are  2.55  and  111°C, 
-3.86  and  185°C,  and  -60.48  and  155°C,  respec¬ 
tively  [25].  Then,  clearly  MF  presents  an  anoma¬ 
lous  behavior,  a  consequence  of  a  greater  struc¬ 
turation  of  the  liquid  phase  in  which  the  O  •  •  •  H(N) 
plays  a  very  important  role.  On  other  hand,  for¬ 
mamide  and  DMF  are  completely  soluble  in  water, 
while  MF  is  not  [25].  The  more  intense  correlation 
in  DMF  is  between  the  oxygen  and  methyl  group 
located  in  the  trans  position  with  respect  to  oxy¬ 
gen,  but  the  O  •  •  *  H(C)  is  the  closest.  Ab  initio 
calculations  on  the  dimer  stabilization  energy  of 
oxygen-aldehydic  hydrogen  interaction  from  for¬ 
mamide  and  DMF  give  values  of  -2.0  and  -4.3 
kcal/mol,  respectively  [26,  27].  This  correlation  in 
DMF  is  more  stable  than  in  the  other  two  liquids 
and  must  have  a  greater  significance  in  the  liquid 
structure.  In  all  studied  liquids,  the  length,  angles, 
and  energetic  analysis  made  on  the  O  •  •  *  H(C)  per¬ 
mit  the  conclusion,  based  in  the  Desiraju  study  [6], 
that  there  is  hydrogen  bond  formation  in  the  liq¬ 
uids.  Liquid  formamide  is  made  of  flexible  chains 
in  which  are  immersed  cyclic  dimers.  Liquid  DMF 
is  constituted  of  two  dimers  roughly  dispersed  in 
the  liquid  phase.  MF  is  constituted  of  linear  chains 
of  one  dimer  only. 
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ABSTRACT:  The  molecular  mechanism  for  the  gas-phase  addition  of 
organomagnesium  reagents:  CH3MgCl,  2CH3MgCl,  (CH3)2Mg,  and  (CH3)2Mg  plus 
Cl  2  Mg,  to  2-hydroxypropanal  as  a  model  of  chiral  a-alkoxy  carbonyl  compounds  is 
investigated  at  the  6-31G*  basis-set  level  of  calculation.  An  extensive  exploration  of  the 
reactive  potential  energy  surface  was  carried  out  in  order  to  locate  and  characterize  the 
stationary  points.  The  geometry  of  stationary  points  and  the  harmonic  vibrational 
frequencies,  transition  vectors,  and  electronic  structure  of  the  transition  structures  were 
obtained.  The  theoretical  results  are  analyzed,  discussed,  and  compared  with  previous 
theoretical  and  available  experimental  data.  The  first  step  corresponds  to  the  exothermic 
formation  of  the  chelate  complexes  without  an  energy  barrier.  These  stationary  points 
correspond  to  puckered  five-membered  rings,  determining  the  stereochemistry  of  the 
global  process,  which  is  retained  throughout  the  reaction  pathway.  For  the  reactions  of 
one  equivalent  of  an  organomagnesium  compound  [CH3MgCl  or  (CH3)2Mg],  the  second 
step  for  the  intramolecular  mechanism  is  associated  to  the  C — C  bond  formation  via 
1,3-migration  of  the  nucleophilic  methyl  group  from  the  organomagnesium  compound  to 
the  carbonyl  carbon  and  the  corresponding  transition  structure  can  be  described  as  a 
four-membered  ring,  the  anti  attack  being  the  most  favorable  pathway.  CH3MgCl  is  a 
more  powerful  quelant  agent  than  is  the  (CH3)2Mg  system.  Therefore,  the  reaction 
pathway  associated  to  the  nucleophilic  attack  of  CH3MgCl  -h  2-hydroxypropanal  presents 
a  larger  barrier  height  than  that  of  (CH3)2Mg  -f  2-hydroxypropanal  addition.  The 
inclusion  of  a  second  equivalent  corresponding  to  the  2CH3MgCl  and  (CH3)2Mg  + 

Cl  2  Mg  systems  yields  an  intermolecular  mechanism,  the  barrier  height  decreases,  and 
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the  process  can  be  considered  as  an  assisted  intermolecular  mechanism:  The  first 
equivalent  forms  the  chelate  structure  and  the  second  one  carries  out  the  nucleophilic 
addition  to  the  carbonyl  group.  The  most  favorable  pathway  corresponds  to  an 
intermolecular  mechanism  via  an  anti  attack  for  the  addition  of  2CH3MgCL  ©  1997  John 
Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  719-728,  1997 

Key  words:  chelate  complex;  organomagnesium  reagents;  chiral  a-alkoxy  carbonyl 
compounds;  enantioselective  additions;  intra-  and  intermolecular  mechanisms 


Introduction 

During  the  last  several  years,  there  has  been  a 
notable  increase  in  research  on  organomag¬ 
nesium  alkylation  reactions  [1-3].  Detailed  knowl¬ 
edge  of  such  reactions  is  relevant  not  only  for  the 
understanding  of  the  addition  mechanism  but  also 
for  the  development  of  enantioselective  synthesis 
via  the  presence  of  chiral  centers,  since  the  latter  is 
necessary  to  avoid  formation  of  undesirable  di- 
astereoisomers.  Face-selective  nucleophilic  addi¬ 
tion  of  Grignard  reagents  to  a  trigonal  carbon  such 
a  carbonyl  group  may  create  a  new  chiral  center  of 
high  diastereoisomeric  or  enantiomeric  purity  with 
practical  significance  in  synthetic  organic  chem¬ 
istry  [3-7].  However,  there  is  still  no  general 
agreement  concerning  the  nature  of  the  molecular 
mechanism  for  this  type  of  chemical  reaction 
[1-3,8-10]. 

Explaining  experimental  behavior  has  been  a 
challenging  problem  in  theoretical  physical  chem¬ 
istry.  The  quantum  mechanical  techniques  are  used 
to  characterize  the  potential  energy  surface  (PES) 
representing  a  chemical  reaction  [11, 12].  The  anal¬ 
ysis  of  this  PES  provides  molecular  geometries, 
vibrational  frequencies  for  reactants,  products,  in¬ 
termediates,  and  transition  structures  (TSs).  From 
these  data,  the  barrier  heights  and  rate  constants 
for  the  reaction  pathways  connecting  the  reactants 
with  the  products  via  the  corresponding  TSs  can 
be  calculated.  An  understanding  of  the  nature  of 
the  TS  for  a  reaction  can  be  valuable  for  practical 
as  well  as  theoretical  reasons.  We  are  engaged  in  a 
research  program  to  explore  new  possibilities  in 
asymmetric  synthesis  from  both  experimental  and 
theoretical  viewpoints.  We  describe  herein  our  the¬ 
oretical  work  on  the  analysis  of  nucleophilic  addi¬ 
tion  of  organomagnesium  reagents  [CH3MgCl, 
2CH3MgCl,(CH3)2Mg  and  (CH3)2Mg  plus  Cl2Mg] 
to  2-hydroxypropanal  in  the  gas  phase  as  a  model 


of  chiral  a-alkoxy  carbonyl  compounds.  The  main 
objective  is  the  characterization  of  the  reactive 
potential  energy  surface,  to  obtain  the  nature  of 
the  molecular  mechanism  for  these  chemical  reac¬ 
tions  and  to  rationalize  experimental  observations. 


Methods  and  Models 

The  calculations  were  performed  with  the 
GAUSSIAN94  program  [13].  The  ab  initio  calcula¬ 
tions  were  made  at  the  HF/6-31G*  [14-18]  level. 
This  basis-set  representation  yields  reasonable  re¬ 
sults  in  related  theoretical  studies  [4, 19-22]. 

The  exact  characterization  of  the  TSs  was 
achieved  by  using  a  simple  algorithm  [23-26],  in 
which  the  coordinates  describing  the  system  are 
separated  into  two  sets:  the  control  space,  which  is 
responsible  for  the  unique  negative  eigenvalue  in 
the  respective  force  constant  matrix  associated  with 
the  transition  vector  (TV)  [27],  and  the  comple¬ 
mentary  space,  which  includes  the  remaining  coor¬ 
dinates.  The  geometry  optimizations  were  carried 
out  alternatively  on  each  subspace,  one  at  a  time, 
until  a  stationary  structure  was  obtained.  Finally,  a 
complete  analytical  optimization  was  achieved  for 
the  whole  space  of  all  variables.  The  Berny  analyti¬ 
cal  gradient  optimization  routines  [28,29]  were 
used.  The  nature  of  each  stationary  point  was 
checked  by  diagonalizing  the  mass-weighted  Hes¬ 
sian  matrix  to  determine  the  number  of  imaginary 
frequencies  (zero  for  local  minima  and  one  for  the 
TS).  The  TSs  were  verified  to  be  the  right  ones  by 
tracing  the  intrinsic  reaction  coordinate  (IRC)  [30] 
path  from  each  TS  to  the  two  lower-energy  struc¬ 
tures  it  connects  by  using  the  second-order 
Gonzalez-Schlegel  integration  method  [31,32].  In 
the  present  work,  the  CH3MgCl  (I),  2CH3MgCl 
(II),  (CH3)2Mg  (III),  and  (CH3)2Mg  +  Cl2Mg  (IV) 
organomagnesium  reagents  were  selected  while 
the  2-hydroxypropanal  compound  was  employed 
to  represent  the  chiral  a-alkoxy  carbonyl  com¬ 
pounds. 
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Results  and  Discussion 

To  discriminate  between  alternative  reaction 
channels,  a  characterization  of  the  PESs  including 
the  location  of  stationary  points  is  essential.  An 
extensive  exploration  of  the  different  PESs  has 
rendered  two  minima  (chelate  complexes  and 
products)  and  one  TS.  The  addition  of  CH3MgCl 
to  the  carbonyl  group  of  2-hydroxypropanal  has 
rendered  two  competitive  reaction  channels:  the 
anti  and  syn  attacks.  The  corresponding  chelate 
complexes,  ICha  and  IChs,  TSs  associated  with  the 
1,3-migration  of  the  methyl  group,  ITSa  and  ITSs, 
and  the  corresponding  products,  IPa  and  IPs,  re¬ 
spectively,  are  localized  on  the  PES.  A  schematic 
representation  of  the  structures  of  these  stationary 
points  is  shown  in  Figure  1. 


The  stationary  points  for  the  addition  of  two 
equivalents  of  organomagnesium  reagents,  2CH3- 
MgCl,  to  2-hydroxypropanal  are  depicted  in  Fig¬ 
ure  2.  The  addition  processes  take  place  along  an 
intermolecular  mechanism  via  attack  on  anti  and 
metal-chelated  anti  conformations,  IlChaa,  IITSaa, 
and  IlPaa;  attack  on  anti  and  metal-chelated  syn 
conformation,  IlChas,  IITSas,  and  IlPas;  attack  on 
syn  and  metal  chelated  anti  conformation,  IlChsa, 
IITSsa,  and  IlPsa;  and  attack  on  syn  and  metal- 
chelated  syn  conformation,  IlChss,  IITSss,  and 
IIPss,  respectively. 

The  pathways  corresponding  with  the  addition 
process  of  (CH3)2Mg  to  2-hydroxypropanal  render 
six  stationary  points  along  anti  and  syn  chelation 
processes,  IllCha,  IllChs,  IIITSa,  IIITSs,  IllPa, 
and  IIIPs,  as  shown  in  Figure  3.  Finally,  an  inter¬ 
molecular  process  corresponding  to  the  addition  of 


IChs 

ITSs 

IPs 

-26.4 

-4.2/22.1 

-54.7 

FIGURE  1 .  Reactants,  chelate  complexes,  TSs,  and  product  structures  for  the  anti  (upper  path)  and  syn  (lower  path) 
addition  of  CHgMgCI  to  2-hydroxypropanal.  Relative  energies,  including  zero-point  energies  and  thermal  corrections,  in 
kcal/  mol,  to  reactants  (total  energy  =  -965.439246  au)  and  barrier  heights,  In  kcal/mol  (after  the  slash).  The  arrows 
on  the  TSs  represent  the  reaction  coordinate  vectors.  The  atom  labels  are  included. 
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FIGURE  2.  (a)  Reactants,  chelate  complex,  TSs,  and  products  for  the  anti  and  syn  addition  of  two  equivalents  of 
CHgMgCI  to  2-hydroxypropanal.  Relative  energies,  including  zero-point  energies  and  thermal  corrections,  in  kcal/mol, 
to  reactants  (total  energy  =  -1664.166056  au)  and  barrier  heights,  in  kcal/mol,  (after  the  slash). 


(CH3)2Mg  +  Cl2Mg  to  2-hydroxypropanal  was 
considered.  In  this  case,  the  molecule  of  Cl2Mg 
forms  the  chelate  structure  and  the  molecule  of 
(CH3)2Mg  carried  out  the  nucleophilic  attack  on 
the  carbonyl  group.  The  corresponding  stationary 
points  are  presented  in  Figure  4:  IVCha,  IVChs, 
IVTSa,  IVTSs,  IVPa,  and  IVPs. 

A  schematic  representation  of  the  structures  of 
the  stationary  points  is  shown  in  Figures  1-4.  The 
completely  optimized  geometries  are  available 
from  the  authors  on  request.  An  analysis  of  the 
results  reveals  slight  differences  depending  on  the 
model  system.  The  chelate  structures  are  puckered 
five-membered  rings;  the  metal  is  coordinated  to 
the  lone  pair  of  the  carbonyl  oxygen  and  the 
methoxy  oxygen  is  not  pyramidalized.  Similar 
structures  have  been  found  for  related  chelates 
using  X-ray  methods  [33]  and  related  theoretical 
calculations  [4,34,35].  There  is  a  correlation  be¬ 


tween  reactivity  and  stereoselectivity  that  demon¬ 
strates  the  mechanistic  importance  of  chelation,  the 
chelate  structure  being  maintained  throughout  the 
reaction  path. 

The  intramolecular  conversion  of  the  chelate 
complex  to  the  product  takes  place  via  the  corre¬ 
sponding  TS  with  retention  of  the  chelate  configu¬ 
ration,  For  CH3MgCl  and  (€113)2 Mg  systems,  the 
nucleophilic  methyl  group  of  the  organomagne- 
sium  compound  is  located  over  or  above  the 
chelate  ring  in  the  TS  structure.  The  steric  interac¬ 
tions  with  the  a-substituted  group,  CH2R3,  are 
avoided  in  the  case  of  anti  attack,  which  is  consis¬ 
tently  favored. 

The  inclusion  of  a  second  CH3MgCl  molecule 
yields  an  assisted  intermolecular  mechanism, 
where  the  first  molecule  forms  the  chelate  struc¬ 
ture  and  the  second  CH3MgCl  molecule  carries 
out  the  nucleophilic  addition  to  the  carbonyl  group. 
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IITSsa  IITSss 


FIGURE  2.  (b)  A  schematic  representation  of  the  TSs  for  the  anti  and  syn  addition  of  two  equivalents  of  CHgMgCI  to 
2-hydroxypropanal.  The  arrows  on  the  TSs  represent  the  reaction  coordinate  vectors.  The  atom  labels  are  included. 


This  fact  is  in  agreement  with  experimental  data 
reported  by  Reetz  et  al.  [36]. 

The  addition  of  CH3MgCl  to  2-hydroxypro¬ 
panal  presents  a  barrier  height  of  21-22  kcal/mol, 
the  products  being  the  most  stable  species  (28-30 
kcal/mol  below  the  chelate  complexes).  In  this 
case,  both  methods  render  that  the  anti  attack  is 
the  most  favorable  pathway.  The  ratio  between  the 
product  obtained  by  means  of  an  anti  attack  with 
respect  to  that  obtained  by  a  syn  attack  was  calcu¬ 
lated  in  our  early  work  [37]  assuming  a  Boltz¬ 
mann's  distribution  of  the  TS  leading  to  the  two 
diastereomers.  However,  the  presence  of  a  transi¬ 


tion  structure  controlling  the  interconversion  be¬ 
tween  both  chelate  complexes,  TSc,  on  the  PES 
opens  up  the  possibility  of  an  equilibrium  between 
both  chelate  complexes,  and  according  to  the 
Curtin-Hammett  principle  [38-40],  the  product 
ratio  is  therefore  not  determined  by  the  relative 
energies  of  the  TSs  with  respect  to  their  corre¬ 
sponding  chelate  complexes  but,  instead,  depends 
primarily  on  the  relative  energy  of  the  two  TSs 
with  respect  to  each  other.  In  this  case,  a  complete 
kinetic  analysis  gives  [22,41]  the  following  prod¬ 
uct  ratio  equation: 


[IPa] 

[IPs] 


K 

K 


(1) 
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FIGURE  3.  Reactants,  chelate  complexes,  TSs,  and  products  for  the  anti  (upper  path)  and  syn  (lower  path)  addition 
of  (CH3)2Mg  to  2-hydroxypropanal.  Relative  energies,  including  zero-point  energies  and  thermal  corrections,  In 
kcal  /  mol,  to  reactants  (total  energy  =  -545.435254  au)  and  barrier  heights,  In  kcal  /  mol  (after  the  slash).  The  arrows 
on  the  TSs  represent  the  reaction  coordinate  vectors. 


To  do  such  calculations,  the  rate  constants 

and  k^^  (Fig.  1)  must  be  found,  this  can  be 
achieved  by  means  of  the  TS  theory  (TST)  [42-44], 
assuming  that  the  transmission  coefficient  is  equal 
to  1.  A  pressure  and  temperature  of  1  atm  and 
195.15  K  was  taken  in  the  S  and  k  calculations, 
respectively.  The  product  ratio  for  the  diastere- 
omers  obtained  by  means  of  the  anti  attack  of  the 
organomagnesium  reagent  methyl  group  with  re¬ 
spect  to  those  obtained  by  means  of  a  syn  attack  is 
89:11.  Using  MP2/6-31G*,  the  activation  barriers 
associated  to  ITSa  and  ITSs  are  significantly  re¬ 
duced  from  the  HF/6-31G*  results  (ca.  5-6 
kcal /mol),  having  very  similar  values.  As  a  conse¬ 
quence,  a  decrease  in  stereoselectivity  is  sensed 
(68:32).  This  is  corrected  by  using  the  correlation 
energy  obtained  at  the  MP3  level  (87:13),  This  fact 
can  be  related  to  the  known  tendency  of  MP2 
calculations  to  overcorrect  the  too  high  HF  results 
and  shows  that  MP2  calculations  overestimate  the 
effect  of  the  electron  correlation  energy  on  the  TSs 


with  respect  to  that  of  the  reactants,  in  agreement 
with  previous  studies  of  related  Diels-Alder  reac¬ 
tions  [45-51]. 

The  TV  renders  very  concisely  the  essentials  of 
the  chemical  process  under  study.  The  correspond¬ 
ing  components  of  TV  for  the  variables  defining 
the  reaction  coordinate  vectors  are  depicted  in 
Figures  1-4  and  are  available  from  the  authors 
upon  request.  The  TSs  are  well  associated  with 
forming/breaking  bonds,  as  can  be  seen  from  the 
main  components  of  the  TVs.  The  results  provide 
good  hints  for  a  qualitative  and  semiquantitative 
analysis. 

An  analysis  of  the  TV  shows  that  the  major 
components  are  Cl — C(Mg),  C(Mg) — Mg,  Cl-Ol, 
and  Mg — 01  distances,  the  Cl — C(Mg)  distance 
being  the  most  important.  The  minimal  set  of 
coordinates  capable  of  producing  the  TS  are  these 
variables.  This  result  is  obtained  after  reducing  the 
size  of  the  control  space  followed  by  the  diagonal- 
ization  of  the  corresponding  Hessian  matrix.  It  is 


724 


VOL.  65,  NO.  5 


ADDITION  OF  ORGANOMAGNESIUM  REAGENTS 


IVChs  IVTSs  IVPs 


-46.32  -38.8/7.5  -109.2 

FIGURE  4.  Reactants,  chelate  complexes,  TSs,  and  products  for  the  anti  (upper  path)  and  syn  (lower  path)  addition 
of  (CH3)2Mg  plus  Cl2Mg  to  2-hydroxypropanal.  Relative  energies,  including  zero-point  energies  and  thermal  corrections, 
in  kcal/mol,  to  reactants  (total  energy  =  -1664.157711  au)  and  barrier  heights,  In  kcal/mol  (after  the  slash).  The 
arrows  on  the  TSs  represent  the  reaction  coordinate  vectors. 


important  to  note  that  while  the  atoms  participat¬ 
ing  in  the  control  space  are  always  the  same,  those 
in  the  complementary  space  are  not.  If  these  latter 
do  not  modulate  in  first  order  the  force  constants 
in  the  control  space,  the  TV  is  invariant.  The  in¬ 
variance  is  to  be  understood  in  terms  of  preserva¬ 
tion  of  the  reactive  fluctuation  patterns  associated 
with  the  chemical  interconversion  processes.  One 
of  the  interesting  results  obtained  is  a  sort  of 
geometric  invariance  of  the  fragments  participat¬ 
ing  in  a  TS  geometry.  This  fact  has  been  found  by 
us  [25,26,52]  and  other  workers  [53].  Structural 
invariance  of  the  TS  structure  fragments  is  an 
important  result  from  the  technical  viewpoint.  It 
helps  setting  up  the  search  of  TSs. 

In  the  case  of  the  addition  of  two  equivalents  of 
CH3MgCl  to  the  carbonyl  group  of  the  2-hydroxy¬ 
propanal,  the  barrier  heights  are  predicted  to  be 
lower  than  in  the  1:1  stoichiometry  case  (3-9 
kcal/mol),  the  anti  attack  on  the  anti  chelate  being 
the  most  favorable  pathway.  The  ratio  between  the 


products  obtained  by  means  of  anti  attacks  with 
respect  to  those  obtained  by  syn  attacks  can  be 
calculated  assuming  a  Boltzmann  distribution  of 
the  TSs  leading  to  the  two  diastereoisomers,  by 
using  the  following  equation  [54]: 

anti/syn  =  Yj  jJL  (2) 

where  anti  and  syn  refer  to  the  TS  conformation 
and  AE^  and  AE^  are  the  relative  energies  (with 
the  zero-point  vibrational  energies  and  tempera¬ 
ture  corrections)  of  the  TSs  for  the  anti  and  syn 
attacks,  respectively,  with  respect  to  the  corre¬ 
sponding  chelate  complexes.  The  product  ratio  ob¬ 
tained  (>  99:1,  anti:  syn)  agrees  very  well  with  the 
experimental  ratio,  obtained  by  Carda  et  al.  [55]. 
The  corresponding  TVs  for  the  variables  defining 
the  control  space  are  shown  in  Figure  2(b).  These 
TVs  are  well  associated  with  forming/breaking 
bonds.  Cl — C(Mg),  C(Mg) — Mg,  Cl — Ol,  and  Mg 
— Ol,  as  can  be  seen  from  their  main  components. 
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The  barrier  heights  for  the  addition  of  (CH3)2Mg 
to  2-hydroxypropanal  are  lower  than  in  the  case  of 
the  addition  of  1  equiv  of  CH3MgCl  but  higher 
than  in  the  intermolecular  processes  (2  equiv  of 
CH3MgCl).  The  barrier  heights  are  14-16 
kcal/mol,  the  products  being  the  most  stable 
species  (36-38  kcal/mol  below  the  chelate  com¬ 
plexes).  In  this  case,  we  find  that  the  anti  attack  is 
the  most  favorable  pathway,  with  a  lower  energy 
barrier  and  a  larger  stability  for  the  products  than 
for  the  syn  attack.  The  product  ratio  is  calculated 
by  Eq.  (2),  yielding  89:11  anti /syn.  The  corre¬ 
sponding  TVs  for  the  variables  defining  the  control 
space  are  shown  in  Figure  3.  These  TVs  are  well 
associated  with  forming/breaking  bonds, 
Cl^Cl(Mg),  Cl(Mg)— Mg,  Cl  — Ol,  and 
Mg — Ol,  as  can  be  seen  from  the  main  compo¬ 
nents. 

Since  Grignard  reagents  are  an  equilibrium  mix¬ 
ture  of  CH3MgCl,  (CH3)2Mg,  and  Cl2Mg,  all  of 
which  can  react  with  chiral  a-alkoxy  carbonyl 
compounds,  we  have  considered  the  addition  with 


(CH3)2Mg  4-  Cl  2  Mg.  In  this  case,  the  latter  system 
forms  the  chelate  intermediate  and  the  former  one 
carries  out  the  nucleophilic  attack.  The  barrier 
heights  for  the  addition  of  (CH3)2Mg  +  Cl  2  Mg  to 
2-hydroxypropanal  are  lower  than  in  the  case  of 
the  addition  of  1:1  CH3MgCl  and  (CH3)2Mg,  but 
higher  than  in  the  intermolecular  processes  (2:1  of 
2CH3MgCl).  The  barrier  height  is  7-16  kcal/mol, 
the  products  being  the  most  stable  species  (62-63 
kcal/mol  below  the  chelate  complexes).  In  this 
case,  the  syn  attack  is  the  most  favorable  path 
way  with  a  lower  energy  barrier  and  greater  stabil¬ 
ity  for  the  products  than  for  the  anti  attack.  The 
product  ratio  is  calculated  by  Eq.  (2)  and  is  >  99:1 
syn /anti.  The  corresponding  TVs  for  the  variables 
defining  the  control  space  are  shown  in  Figure  4. 
The  TVs  are  well  associated  with  the  forming/ 
breaking  bonds.  Cl — Cl(Mg),  Cl(Mg) — Mg, 
Cl — 01,  and  Mg — Ol,  as  can  be  seen  from  the 
main  components. 

In  Figure  5,  the  stationary  points  belonging  to 
the  different  reaction  pathways  of  the  addition 


FIGURE  5.  A  schematic  view  of  the  reaction  pathways  studied  in  this  work.  R1  corresponds  with  the  addition  of 
CHgMgCI  to  2-hydroxypropanal  while  R2  Is  associated  with  the  reaction  of  (CHglgMg  plus  Cl2Mg  and  2-hydroxypropanal. 
The  names  of  the  different  stationary  points  appearing  along  the  reaction  pathways  are  defined  in  the  text.  IllCh 
represents  IllCha  and  IllChs,  which  have  the  same  energy  and  roughly  the  same  structure. 
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processes  are  schematically  depicted.  In  this  repre¬ 
sentation,  a  global  perspective  of  the  reactive  PES 
is  summarized  in  order  to  compare  the  different 
molecular  mechanisms  found  in  this  work. 


Conclusions 

Quantum  chemical  characterization  of  TSs  may 
be  rationalized  to  discuss  molecular  mechanisms 
for  organic  reactions.  The  theory  of  nucleophilic 
addition  of  organometallic  reagents  to  carbonyl 
compounds  is  still  a  much  debated  subject,  partic¬ 
ularly  with  respect  to  the  origin  of  stereoselectiv¬ 
ity.  The  following  factors  are  found  to  be  dominant 
in  the  molecular  mechanism  for  the  carbonyl  addi¬ 
tion  of  organomagnesium  reagents  to  chiral  a-al- 
koxy  carbonyl  compounds: 

(i)  The  formation  of  syn  and  anti  chelate 
complexes  is  an  exothermic  process  in  the 
gas  phase,  corresponding  to  the  first  step 
in  the  addition  process,  and  takes  place 
without  an  energy  barrier.  The  magne¬ 
sium  is  coordinated  to  the  lone  pair  of  the 
carbonyl  oxygen  and  to  the  methoxy  oxy¬ 
gen.  The  chelate  complexes  can  be  de¬ 
scribed  as  puckered  five-membered  rings. 

(ii)  syn  and  anti  chelate  intermediates  deter¬ 
mine  the  stereochemistry  of  the  addition 
process,  maintaining  the  conformation 
throughout  the  reaction  pathway. 

(iii)  The  C — C  bond-forming  stage,  corre¬ 
sponding  to  the  1,3-intramolecular  migra¬ 
tion  from  chelate  complex  to  products,  is 
the  second  and  rate-limiting  step  for  the 
addition  process.  The  corresponding  tran¬ 
sition  structures  are  four-membered  rings 
and  the  most  favorable  pathway  is  the  anti 
attack. 

(iv)  The  inclusion  of  a  second  molecule  of  an 
organomagnesium  reagent  decreases  the 
barrier  height  and  the  process  can  be  de¬ 
scribed  as  an  assisted  intermolecular 
mechanism:  The  first  molecule  forms  the 
chelate  structure  and  the  second  molecule 
carries  out  the  nucleophilic  addition  to  the 
carbonyl  group. 

(v)  The  reaction  pathway  for  the  nucleophilic 
attack  of  CH3MgCl  +  2-hydroxypropanal 
has  a  barrier  height  larger  than  the 
(CH3)2Mg  +  2-hydroxypropanal  addition. 


CH3MgCl  is  a  more  powerful  quelant 
agent  than  is  (CH3)2Mg. 

(vi)  The  most  favorable  pathway  corresponds 
to  an  anti  attack  via  an  intermolecular-as- 
sisted  mechanism  for  the  addition  of 
2CH3MgCl.  These  theoretical  results  are 
in  agreement  with  the  experimental  data 
of  Reetz  et  al.  [36]  and  with  the  di- 
astereoisomeric  excess  obtained  by  Carda 
et  al.  [55]. 

(vii)  The  inclusion  of  the  solvent  effects  into 
the  calculation  can  lead  to  a  better  (quali¬ 
tative)  description  of  the  product  ratio. 
However,  studies  with  similar  systems  [4] 
show  that  polarity  effects  on  the  structures 
and  energies  of  the  reactions  are  rather 
small. 
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ABSTRACT:  The  molecular  mechanisms  for  the  ring  openings  of  cyclopropanone, 

2.2- dimethylcyclopropanone,  frflns-2,3-di-fcrf-butylcyclopropanone,  and 
spiro(bicyclo[2.2.1]heptane-2.T-cyclopropan)-2'-one  systems  were  studied  at  the  PM3 
semiempirical  level  in  the  gas  phase  and  including  solvent  effects.  The  behavior  of  the 
solvent  polarity  was  considered  by  using  the  SCRF  polarizable  continuum  method.  Six 
solvents  were  selected:  hexane,  ether,  tetrahydrofuran,  pyridine,  acetone,  and  acetonitrile. 
An  extensive  exploration  of  the  potential  energy  surface  using  analytical  gradient 
techniques  allows  the  characterization  of  stationary  points  associated  to  the 
stereomutation  conversion.  Along  a  disrotatory  ring-opening  mechanism,  cyclopropanone, 

2.2- dimethylcyclopropanone  and  frans-2,3-di-f^rf-butylcyclopropanone  are  intraconverted 
via  an  oxyallyl  intermediate.  The  epimeric  forms  of  the  spiro(bicyclo[2.2.1]heptane-2.r- 
cyclopropan)-2'-one  intraconvert  along  two  competitive  pathways  correspond  to  two-step 
processes  by  a  disrotatory  ring-opening  mechanism.  Two  oxyallyl  intermediates  and  four 
transition  structures  were  obtained  and  the  corresponding  transition  vectors  are  associated 
to  the  carbon-carbon  bond-breaking  process  and  the  dihedral  angle  measuring  the 
conrotatory  movement  of  the  plane  defined  by  the  three  carbon  atoms  of  the 
cyclopropanone  ring.  The  oxyallyl  intermediates  and  the  transition  structures  for  the  four 
model  systems  present  similar  structures  and  energies  and  they  are  located  on  a  rather 
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flat  region.  The  analysis  of  the  theoretical  results  shows  that  the  solvent  reaction  field 
decreases  the  energy  barriers  for  the  ring-opening  processes  and  a  stabilization  of  the 
oxyallyl  intermediates  takes  place.  The  calculated  relative  barrier  heights  are  in  good 
agreement  with  the  experimental  data  available,  and  the  trends  in  the  kinetics  can  be 
explained  primarily  by  steric  interactions.  Nevertheless,  for  the 

spiro(bicyclo[2.2.1]heptane-2.r-cyclopropan)-2'-one  system,  it  is  necessary  to  include  a 
specific  interaction  of  a  discrete  molecule  of  the  nucleophilic  solvent  on  the  quantum 
mechanical  representation  to  explain  the  experimental  behavior.  ©  1997  John  Wiley  & 
Sons,  Inc.  Int  J  Quant  Chem  65:  729-738,  1997 


Introduction 

Stereomutation  rearrangements  involving  cyclo- 
propanone  systems  has  been  the  subject  of  contin¬ 
uous  interest  from  experimental  and  theoretical 
viewpoints  [1-7].  The  molecular  mechanism  corre¬ 
sponding  to  the  ring-opening  process  via  oxyallyl 
intermediates  is  a  common  reaction  with  impor¬ 
tant  applications  in  organic  synthesis  [8-10].  Nu¬ 
merous  theoretical  studies  have  been  reported  on 
this  type  of  stereomutation  [5,  11-16],  many  of 
them  for  the  purpose  of  elucidating  the  nature  of 
the  ring-opening  process  [1,  2,  4,  5,  7,  11  16].  In 
particular,  the  cyclopropanone  ring  can  be  postu¬ 
lated  as  a  reactive  intermediate  in  the  Favorskii 
rearrangement  [10,  17-21].  Recently,  we  reported 
quantum  chemical  calculations  on  the  mechanism 
of  the  Favorskii  reaction  [22].  In  addition,  some 
efforts  have  been  devoted  in  our  laboratory  to  the 
theoretical  treatment  of  solvent  effects  in  chemical 
reactivity  studies  [23-25].  The  results  indicate  that 
the  activation  energies  and  relative  rate  constants 
can  be  estimated  with  good  accuracy  at  a  high 
level  of  theory,  but  more  important  for  organic 
chemists,  even  simple  calculations  at  moderate 
levels  of  theory,  such  as  semiempirical  procedures, 
give  useful  information  of  the  solvent  effects  on 
the  molecular  mechanism  for  different  chemical 
rearrangements.  Calculations  of  the  properties  of 
stationary  points  located  on  the  reactive  potential 
energy  surface  (PES)  of  great  complexity  can  be 
characterized  by  using  semiempirical  molecular 
orbital  theory.  We  tried  to  determine  the  reaction 
mechanisms  and  how  the  solute-solvent  interac¬ 
tions  can  affect  the  different  barrier  heights  and 
the  relative  energies  of  reactants,  possible  interme¬ 
diates,  transition  structures  (TSs)  and  products  on 
PES. 

Solvent  effects  or,  more  generally,  environmen¬ 
tal  effects  can  be  accounted  for  by  two  different 


approaches:  adding  discrete  solvent  molecules 
around  the  subsystem  of  interest,  the  super¬ 
molecule  approach,  or  placing  them  in  a  cavity 
surrounded  by  a  continuum  approach  [26-28].  In 
the  first  approach,  specific  solvent  effects  are  suc¬ 
cessfully  represented,  while  in  the  second,  nonspe¬ 
cific  and  long-range  interactions  are  appropriately 
included.  In  the  continuum  models  [26],  the  solute 
is  embedded  in  a  cavity,  while  the  solvent,  treated 
as  a  continuous  medium,  characterized  by  some 
macroscopic  properties,  e.g.,  the  same  dielectric 
constant  as  that  of  the  bulk  liquid,  is  incorporated 
into  the  solute  Hamiltonian  as  a  perturbation.  In 
this  reaction  field  approach,  the  bulk  medium  is 
polarized  by  the  solute  molecules.  The  electronic 
distribution  of  the  solute  polarizes  the  continuum 
which  generates  an  electric  field  inside  the  cavity, 
which,  in  turn,  affects  the  solute's  geometry  and 
electronic  structure.  This  is  a  convenient  model  for 
describing  electrostatic  interactions  due  to  the  sol¬ 
vent  polarity  while  short-range  interactions  are  not 
correctly  represented  by  this  method.  The  imple¬ 
mentation  of  this  interaction  scheme  is  achieved 
through  the  self-consistent  reaction  field  method 
developed  initially  by  Tapia  et  al.  [29,  30]. 

Kinetic  data  for  stereomutation  on  cyclo¬ 
propanone  systems  derive  from  experimental  re¬ 
sults  in  the  solution  phase  [1-4,  6,  7],  which  is 
often  obtained  with  a  great  deal  of  effort.  Substitu¬ 
tion  on  the  cyclopropane  ring  has  subtle  influences 
on  the  kinetics  for  ring  cleavage  and  the  origin  of 
these  kinetic  effects  is  not  clear.  We  describe  herein 
a  complementary  theoretical  analysis  carried  out 
with  a  PM3  semiempirical  procedure  to  study  the 
ring  opening  of  various  substituted  cyclopropan¬ 
one  systems,  which  have  generated  considerable 
interest  among  organic  and  physical  chemists  due 
to  important  mechanistic  roles  that  have  been  dis¬ 
cerned  but  not  entirely  elucidated  [1-7,  12-17].  It 
is  of  considerable  interest  to  know  the  energy 
profiles  of  the  corresponding  reaction  pathways  in 
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the  gas  phase  and  in  the  presence  of  different 
solvents.  This  computational  investigation  con¬ 
tributes  to  a  better  understanding  of  the  stereomu¬ 
tation  via  a  ring-opening  process,  being  the  main 
objective  the  characterization  of  the  reactive  poten¬ 
tial  energy  surface  in  the  gas  phase  including  the 
environmental  effects  and  thus  to  obtain  the  na¬ 
ture  of  the  molecular  mechanism  and  to  rationalize 
the  experimental  observations. 


Computational  Methods  and  Models 

PM3  semiempirical  calculations  [31]  were  car¬ 
ried  out  with  the  GAUSSIAN94  package  of  pro¬ 
grams  [32].  The  PESs  were  calculated  in  detail  to 
ensure  that  all  relevant  stationary  points  were  lo¬ 
cated  and  properly  characterized.  The  exact  loca¬ 
tion  of  the  TSs  was  achieved  by  using  an  algorithm 
[33-35]  in  which  the  coordinates  describing  the 
system  are  separated  into  two  sets:  the  control 
space,  which  is  responsible  for  the  unique  negative 
eigenvalue  in  the  respective  force  constants  matrix, 
and  the  remaining  coordinates  set,  which  is  called 
complementary  space.  The  geometry  optimizations 
were  carried  out  alternatively  on  each  subspace, 
one  at  a  time,  until  a  stationary  structure  was 
obtained.  Finally,  a  complete  analytical  optimiza¬ 
tion  of  the  TS  was  achieved  with  an  "eigenvalue 
following"  optimization  method  [36,  37]  for  all 
variables. 

The  intrinsic  reaction  coordinate  (IRC)  [38]  path¬ 
ways,  from  the  TSs  down  to  the  two  lower-energy 
structures,  were  traced  using  the  second-order 
Gonzalez-Schlegel  integration  method  [39,  40]  in 
order  to  verify  that  each  saddle  point  links  the  two 
putative  minima.  The  requested  convergence  on 
the  density  matrix  was  10“^  atomic  units  and  the 
threshold  value  of  maximum  displacement  was 
0.0018  A  and  that  of  maximum  force  was  0.00045 
Hartree/bohr  using  the  Berny  analytical  gradient 
optimization  routine  [41,  42].  The  nature  of  each 
stationary  point  was  checked  by  diagonalizing  the 
Hessian  matrix  to  determine  the  number  of  imagi¬ 
nary  frequencies  (zero  for  the  local  minima  and 
one  for  the  TSs).  The  unique  imaginary  frequency 
associated  with  the  transition  vector  (TV)  [43]  of 
the  different  TSs  was  analyzed. 

The  energies  of  the  stationary  structures  on  the 
PES  including  solvent  effects  was  calculated  at  the 
PM3  semiempirical  level  using  the  Conductor-like 
Screening  Model  (COSMO)  option  [44]  included  in 


the  MOPAC93  package  program  [45].  This  COSMO 
model,  proposed  by  Klamt  and  Schuurmann  [44], 
calculates  the  electrostatic  solvation  energy  by  rep¬ 
resenting  the  solute  charge  distribution  as  a  set  of 
point  charges  and  dipoles  in  the  neglect  differen¬ 
tial  diatomic  overlap  formalism.  The  cavity  chosen 
for  the  calculations  was  the  solvent-excluded  sur¬ 
face  obtained  by  overlapping  spheres.  The  liquid  is 
assimilated  to  a  continuum  characterized  by  its 
dielectric  constant.  Six  solvents  were  selected:  hex¬ 
ane,  ether,  tetrahydrofuran,  pyridine,  acetone,  and 
acetonitrile. 

Four  systems  were  studied:  cyclopropanone  (C), 
2,2-dimethylcyclopropanone  (D),  trans-2,3-di-tert- 
butylcyclopropanone  (T),  and  spiro(bicyclo[2.2.1] 
heptane-2.r-cyclopropan)-2'-one  (S)  in  the  gas 
phase  and  in  six  different  solvents:  hexane,  ether, 
tetrahydrofuran,  pyridine,  acetone,  and  acetoni¬ 
trile. 


Results  and  Discussion 

An  extensive  exploration  of  the  PES  by  means 
of  the  PM3  method  was  carried  out  in  order  to 
asses  the  different  stationary  points  related  to  the 
ring-opening  process.  The  geometries  of  cyclo¬ 
propanone  (C),  2,2-dimethylcyclopropanone  (D), 
and  trans-2,3-di-terf-butylcyclopropanone  (T),  as 
well  as  the  corresponding  oxyallyl  structures  and 
transition  structures  are  illustrated  in  Figures  1-3, 
respectively.  For  the  spiro(bicyclo[2.2.1]heptane- 
2.T-cyclopropan)-2'-one  system  (S),  two  reaction 
pathways  depicted  corresponding  to  the  two  pos¬ 
sible  disrotatory  ring-opening  processes  for  this 
steromutation  were  characterized;  for  this  system, 
two  oxyallyl  intermediates  and  four  TSs  were  lo¬ 
cated.  The  structures  associated  to  the  stationary 
points  of  both  channels  are  illustrated  in  Figure  4. 
The  labels  of  the  structures  corresponding  to  TSs 
and  oxyallyl  intermediates  are  designed  by  the 
letters  TS  and  I,  respectively.  The  stationary  points 
are  distinguished  from  each  other  by  appending 
the  label  C,  D,  T,  or  S.  For  the  S  system,  the  two 
reactive  channels  are  designed  by  the  letters  A  and 
B,  and  the  numbers  1  and  2,  for  both  steps  of  the 
two  stepwise  mechanisms.  Thus,  the  localized  sta¬ 
tionary  points  are:  S,  TS-SAl,  TS-SBl,  I-SA,  I-SB, 
TS-SA2,  TS-SB2,  and  S'. 

The  most  relevant  geometrical  parameters  of  the 
stationary  points  obtained  for  the  different  systems 
are  depicted  in  Figures  1-4.  Optimized  geometries 
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<C2C1C3  =  62.48 
<C1C2C3  =  58.75 
|i  =  0.87D 
E  =  -0.005724  (0.00) 


C2-C3=  2.353 
<04-Cl-C2-H5=  0.0 
<H7-C3-C2.H5=  0.0 
H=1.97D 
E  =  0.085156  (57.03) 


C2-C3=  2.354 
<O4-Cl-C2-H5  =  0.49 
<H7-C3-C2-H5  =  0.0 
^1=1.97  0 
E  =  0.085155  (57.03) 


FIGURE  1.  Representation  of  the  optimized  geometries  for  the  stationary  points:  cyclopropanone,  TS  for  disrotatory 
ring  opening,  and  oxyally!  intermediate.  Distances  in  angstroms  and  bond  angles  in  degrees.  Values  in  italics  (in 
parentheses)  are  net  atomic  charges. 


of  all  structures  are  available  from  the  authors  on 
request.  For  C,  the  optimized  structure  agrees  with 
the  experimental  data  obtained  by  Nelson  and 
Pierce  [46]  from  a  microwave  analysis  and  the 
computed  geometries  obtained  by  Lim  et  al.  [5]  at 
the  (4/4)CASSCF/6-31G*  calculation  level.  The 
TSs  and  the  oxyallyl  intermediates  are  located  on  a 
very  flat  region  of  the  PES.  On  going  from  cyclo¬ 


propanone  to  the  TSs  or  oxyallyl  intermediate,  the 
electronic  charge  redistribution  shows  that  an  ex¬ 
cess  of  negative  charge  is  developed  on  the  car¬ 
bonyl  carbon  atom  and  to  a  lesser  extent  on  the 
peripheral  carbon  atoms  (see  Figs.  1-4).  Some  in¬ 
teresting  observations  emerge  from  the  Mulliken 
population  analysis  [47].  There  is  no  change  in  the 
Mulliken  charges  between  the  TSs  and  oxyallyl 


<C2-C1-C3  =  62.68 
<04-Cl-C2-H5  = 
<H5-C2-C1-H6  ^ 

\i  =  0.88  D 
£  =  -0.029321  (0.00) 


C2-C3  =  2.351 
<04-Cl-C2-H5=  10.62 
<H5-C2-C1-H6=  181.17 
^  =  2.07  D 

E  =  0.042899  (45.32) 


C2-C3  =  2.355 
<O4-Cl-C2-H5  =  0.35 
<H5-C2-C1-H6=  180.22 
H  =  2.15D 

E  =  0.042741  (45.22) 


FIGURE  2.  Representation  of  the  optimized  geometries  for  the  stationary  points:  2,2-dimethylcyclopropanone,  TS  for 
disrotatory  ring  opening,  and  oxyallyl  intermediate.  Distances  in  angstroms  and  bond  angles  in  degrees.  Values  in 
italics  (in  parentheses)  are  net  atomic  charges. 
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<C3-C1-C2  =  62.70 
<04-C]-C2-H5  =  285.08 
<H5-C2-CI-C6  =  213.00 
III  0.90  D 
E= -0.078467  (0.00) 


C2-C3  =  2.387 
<04-Cl-C2-H5=  180.01 
<H5-C2-C1-C6  =  179.97 
11=  1.98  D 
E  =  0.001436  (50.14) 


C2-C3  =  2.420 
<04-CI-C2-H5  =  179.82 
<H5-C2-C1-C6=  180.06 
H=1.97D 
£  =  0.001126(49.95) 


FIGURE  3.  Representation  of  the  optimizecj  geometries  for  the  stationary  points:  frans-2, 3-di-fert-butylcyclopropanone, 
TS  for  disrotatory  ring  opening,  and  oxyallyl  intermediate.  Distances  in  angstroms  and  bond  angles  in  degrees.  Values 
in  italics  (in  parentheses)  are  net  atomic  charges. 


intermediates.  The  imaginary  frequency  for  the 
TSs  TS-C,  TS-D,  and  TS-T  are  131.8i,  65.4i,  and 
70,3i  cm"\  respectively.  For  moclel  S,  the  TSs 
TS-SAl,  TS-SBl,  TS-SA2,  and  TS-SB2  present  the 
following  imaginary  frequency  values:  64.4i,  54. 9i, 
78.0i,  and  49.0i  cm“\  respectively.  The  corre¬ 
sponding  TVs  are  associated  to  a  disrotatory  ring 
opening,  as  expected.  These  four  TSs  yield  similar 
optimized  geometrical  parameters  and  energies. 
The  predicted  energy  barrier  between  the  cyclo- 
propanone  systems  and  TSs  are  in  the  range  of 
45.3-57.0  kcal/mol.  For  the  simple  model  C,  calcu¬ 
lations  at  high  level  of  theory  [5]  show  a  barrier 
height  of  32.4  kcal/mol.  As  expected,  the  PM3 
results  tends  to  overestimate  this  barrier.  Fiow- 
ever,  our  goal  was  to  rationalize  the  experimental 
behavior  of  this  stereomutation  process  for  more 
complex  systems  in  various  solvents. 

The  TSs  are  energetically  and  structurally  very 
similar  to  oxyallyl  intermediates.  The  principal  dif¬ 
ferences  are  the  rotation  of  a-carbon  atoms  of  the 
cyclopropanone  rings,  in  the  range  0.1-20.4°,  out 
of  plane  in  the  TS,  and  the  C — O  distance  of  the 
carbonyl  group  that  increases  in  the  range  of 
0.052-0.060  A,  on  going  from  cyclopropanone  sys¬ 
tems  to  oxyallyl  intermediates.  The  energy  differ¬ 
ences  between  the  TSs  and  oxyallyls  intermediates 
are  less  than  0,2  kcal/mol.  These  structures  can  be 
described  as  diradicals  with  a  strong  carbon-oxy¬ 
gen  double  bond  and  an  electron  mainly  localized 
on  the  terminal  carbon  atoms.  An  analysis  of  the 


barrier  energies  shows  that  the  rate  of  the  intercom 
version  processes  follows  the  order  S  >  D  >  T  > 
C.  This  order  can  be  explained  by  two  additive 
effects — (i)  Flyperconjugative  effects:  Alkyl  substi¬ 
tution  strongly  stabilizes  the  TS  or  the  oxyallyl 
intermediate  for  the  ring-opening  process.  This  hy- 
perconjugative  stabilization  is  more  effective  by 
the  norbornane,  S  system,  than  by  the  tert-bntyl 
groups,  methyl  groups,  and  hydrogen  atoms  in  the 
T,  D,  and  C  systems,  respectively,  (ii)  The  size  of 
the  steric  hindrances  of  this  reaction  on  going  from 
the  cyclopropanones  to  the  TSs  corresponding  to 
the  ring-opening  process:  In  a  disrotatory  mecha¬ 
nism,  ring  opening  of  the  T  system  forces  a  tertiary 
alkyl  group  into  a  1,3-c/s  allylic  interaction  with  a 
hydrogen  atom  in  the  ring-opened  oxyallyl  inter¬ 
mediate.  For  the  D  system,  this  repulsive  interac¬ 
tion  is  less  hindered  and  takes  place  between  two 
hydrogen  atoms.  In  contrast,  due  to  the  presence 
of  the  rigid  norbornane  ring  in  the  S  system,  the 
rotation  movement  around  C2-C3  and  C3-C4 
bonds  is  smaller  than  are  those  of  freely  rotating 
analogs  and  the  1,3-cfs-allylic  interaction  involves 
the  ring  hydrogen  and  the  oxyallyl  hydrogen.  Con¬ 
sequently,  a  substantial  promotion  of  the  stereo¬ 
mutation  process  can  be  expected. 

The  next  step  in  our  investigation  is  the  study 
of  the  solvent  effects  on  the  basic  features  of  the 
stereomutation  reaction  and  we  are  now  in  a  posi¬ 
tion  to  compare  our  results  with  those  of  Greene 
et  al.  [1,  2]  for  the  T  system,  Cordes  and  Berson  [4, 
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<C1-C2-C3  =  59.02 
<C2-C1-C3  =  62.43 
<04-Cl-C2-H5  =  287.5^ 
<H6-C2-C1-H5  =  144.41 
11  =  0.89  D 

E  = -0.012548  (-0.01) 


(a) 


TS-SAl 


Cl-C2=  1.411 
C2-C3  =  2.324 
<C2-C1-C3  =  109.32 
<04-Cl-C2-H5=  194.01 
<H6-C2-C1-H5  =  179.99 
H  =  2.16D 
£  =  0.062908(47.29) 


TS-SBl 


(b) 


C1-C2  =  1.410 
C2-C3  =  2.344 
<C2-C1-C3  =  110.74 
<04-Cl-C2-H5=  190.12 
<H6-C2-C1-H5  =  180.57 
H  =  2.16D 
£  =  0.064073  (48.16) 


C2-C3  =  2.360 
<04-Cl-C2-H5  =  174.18 
<H6-C2-Cl-H5  =  178.51 
11  =  2.23  D 
£  =  0.064309(48.17) 


FIGURE  4.  Representation  of  the  optimized  geometries  for  the  stationary  points: 

spiro(bicyclo[2.2.1]heptane-2.f-cyclopropan)-2'-one,  TS  for  disrotatory  ring  opening,  and  oxyallyl  intermediate. 
Distances  in  angstroms  and  bond  angles  in  degrees.  Values  in  italics  (in  parentheses)  are  net  atomic  charges. 
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7]  for  the  S  system,  and  the  theoretical  work  of 
Lim  et  al.  [5]  on  the  C  and  D  systems.  The  relative 
barrier  heights  vs.  the  solvatochromic  parameter 
Ej(30)  (Dimroth-Reichardt  parameter)  [48-51]  for 
the  six  solvents  are  depicted  in  Figure  5.  As  could 
be  expected  from  the  electrostatic  data,  TSs  and 
oxyallyl  intermediates  present  identical  solvation 
effects;  however,  preferential  solvation  of  these 
stationary  points  would  be  expected  due  to  that 
they  are  more  polar  than  are  the  cyclopropanone 
ground-state  systems.  It  should  be  noted  that  the 
dipole  moments  are  increased  in  the  range  of 
1.07-1.34  D  upon  the  ring  opening  of  the  corre¬ 
sponding  cyclopropanones  for  the  four  model  sys¬ 
tems.  This  is  an  indication  that  solvation  influ¬ 
ences  the  stability  of  these  structures  to  a  different 
extent.  An  analysis  of  the  results  shows  that  the 
negative  of  the  relative  barrier  height  increases  on 
going  from  the  nonpolar  hexane  to  the  polar  ace¬ 
tonitrile  solvent.  A  change  to  a  more  polar  solvent 
will  increase  the  reaction  rate  due  to  that  TSs  are 
more  polar  than  are  the  initial  cyclopropanone 
systems.  The  low  values  of  these  relative  energies 
for  the  steromutations  in  solution  reflect  the  dirad¬ 
ical  rather  than  the  zwitterionic  nature  of  the 
oxyallyls,  while  the  presence  of  a  zwitterion  is 
accompanied  by  much  larger  solvent  effects  [49]. 


FIGURE  5.  Relative  barrier  heights  (kcal/mol)  for  the 
ring-opening  process  vs.  solvent  parameter  Ej(30) 
(Dimroth-Reichardtterm).  The  ordinate  shows  the 
negative  of  the  barrier  height  relative  to  that  in  hexane 

taken  as  zero.  ( - )  Cyclopropanone;  ( - ) 

2,2-dimethylcyclopropanone;  (•“♦ ) 
frans-2,3-di-fert-butylcyclopropanone;  (— • ) 
spiro(bicyclo[2.2.1]heptane-2.1'-cyclopropan)-2'-one. 


Additionally,  our  results  are  in  reasonable  agree¬ 
ment  with  the  Monte  Carlo  simulations  reported 
by  Lim  et  al.  [5]  for  the  C  and  D  models.  This  fact 
supports  the  use  of  the  PM3  semiempirical  proce¬ 
dure  to  study  the  steromutation  processes  in  solu¬ 
tion. 

In  Figure  6,  a  comparison  between  our  results 
and  experimental  data  for  the  S  [4,  7]  system  is 
given.  The  values  of  the  relative  barrier  heights  vs. 
the  Ej  (30)  for  the  spirocyclopropanone  S  are  not 
correlated  with  the  experimental  data  for  the  T 
system  or  with  the  values  theoretically  predicted 
for  the  more  simple  D  and  C  models.  In  particular, 
Cordes  and  Benson  [7]  found  that  the  rate  of  stere¬ 
omutation  in  the  S  system  is  slower  in  ether  and  in 
acetone  than  in  hexane  and  in  methylene  chloride. 
This  deceleration  was  explained  by  Cordes  and 
Benson  [7]  in  terms  of  a  nucleophilic  association  of 
these  solvents  with  the  carbonyl  group  in  the  cy¬ 
clopropanone  ring,  leading  to  a  ground-state  stabi¬ 
lization.  Reaction  field  representation  of  solvent 
effects  is  a  convenient  model  for  describing  elec¬ 
trostatic  interactions  due  to  the  solvent  polarity, 
liowever,  short-range  interactions  are  not  correctly 
represented  by  this  method.  Therefore,  to  under¬ 
stand  the  role  played  by  specific  and  nonspecific 


FIGURE  6.  Relative  barrier  heights  (kcal/mol)  for  the 
stereomutation  process  vs.  solvent  parameter  £^(30) 
(Dimroth  -Reichardt  term)  for  the 
spiro(bicyclo[2.2.1]heptane-2.t-cyclopropan)-2'-one 

system.  ( - )  Experimental  values  reported  by  Cordes 

and  Berson  (these  authors  used  the  free  energy  of 

activation)  [7].  ( - )  Theoretical  data  including  specific 

interaction  in  the  nucleophilic  solvent. 
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solute-solvent  interactions  in  the  stereomutation 
process  of  the  S  system  in  solution  and  under¬ 
standing  how  nucleophilic  solvents  alter  the  bar¬ 
rier  height  for  the  ring  opening,  we  studied  the 
reaction  including  a  single  discrete  molecule  of  the 
nucleophilic  solvents. 

The  apparent  contradiction  between  experiment 
and  theory  could  be  overcome  taking  into  account 
a  single  molecule  of  nucleophilic  solvents  in  the 
quantum  mechanical  treatment.  Thus,  to  explore 
this  aspect,  specific  interactions  between  the  ni¬ 
trogen  or  oxygen  lone  pairs  of  the  nucleophilic 
solvents  (ether  and  acetone)  and  the  vulnerable 
carbonyl  carbon  of  the  cyclopropanone  ring  were 
considered.  Following  the  extensive  crystal  struc¬ 


ture  studies  of  Biirgi  et  al.  [52-54]  and  previous 
analysis  by  Cordes  et  al.  [7,  55],  a  model  in  which 
the  distance  between  the  oxygen  atom  and  the 
carbon  atom  of  the  cyclopropanone  ring,  O*** 
C  =  0,  is  fixed  to  a  value  of  2.6  A  (near  van  der 
Waals  contact)  and  0***C  =  0  angles  of  98'"  (a 
reasonable  Biirgi-Dunitz  angle)  was  selected.  The 
minima  and  the  TSs  were  also  located  in  the  pres¬ 
ence  of  this  additional  solvent  molecule  with  an 
arrangement  depicted  in  Figure  7.  The  analysis  of 
the  results,  presented  in  Figure  6,  shows  that  this 
extra  stabilization  (2.8  and  4.0  kcal/mol  for  ether 
and  acetone  solvent,  respectively)  is  more  favor¬ 
able  in  the  cyclopropanone  ring  systems  than  in 
the  TSs  or  oxyallyl  intermediates,  because  the  con- 


FIGURE  7.  A  schematic  view  of  the  optimized  geometries  for  the  spiro(bicyclo[2.2.1]heptane-2.1'-cyclopropan)-2'-one 
system  interacting  with  a  single  molecule  of  the  nucleophilic  solvent:  (a)  ether;  (b)  acetone.  The  distance  between  the 
oxygen  atom  of  the  solvent  and  the  carbonyl  carbon  of  the  cyclopropanone  ring  was  fixed  at  2.6  A  and  the  angle 
(<  oxygen  atom -carbonyl  carbon -carbonyl  oxygen)  was  fixed  at  98°.  Distances  in  angstroms  and  bond  angles  in 
degrees. 
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centrated  charge  in  the  ground  state  becomes  dis¬ 
persed  in  the  TSs.  This  result  is  in  good  agreement 
with  the  experimental  data.  In  contrast,  the  racem- 
ization  of  the  T  system  has  not  an  anomalous 
behavior  in  nucleophilic  solvents  due  to  the  pres¬ 
ence  of  tert-buiyl  groups,  preventing  the  formation 
of  this  interaction. 


Conclusions 

In  this  work,  we  carried  out  a  PM3  theoretical 
study  of  the  solvent  effects  along  the  reaction 
profiles  on  the  corresponding  PES  for  the  ring¬ 
opening  process  of  the  cyclopropanone,  2,2-di- 
methylcyclopropanone,  frflns-2,3-di-fcrt-butylcyc- 
lopropanone,  and  spiro(bicyclo[2.2.1]heptane-2.1'- 
cyclopropan)-2'-one  systems.  The  polarizable  con¬ 
tinuum  model  and  the  inclusion  of  a  discrete 
molecule  of  nucleophilic  solvents  are  used  to  intro¬ 
duce  solvent  effects.  The  continuum  and  discrete 
solvent  models  offer  a  complementary  view  of  the 
nature  of  solute -solvent  interactions.  The  specific 
details  of  the  process  may  change  at  higher  levels 
of  theory  (e.g.,  ab  initio,  inclusion  of  correlation 
energy),  but  despite  the  approximate  procedure  of 
the  calculations  employed  here,  some  important 
features  were  clarified.  The  following  conclusions 
can  be  drawn  from  the  results  reported  in  this 
study: 

(i)  The  stereomutation  processes  take  place  via 
an  oxyallyl  intermediate  along  a  disrota- 
tory  ring-opening  mechanism.  The  corre¬ 
sponding  TSs  and  oxyallyl  intermediates 
are  structurally  and  energetically  related. 

(ii)  The  oxyallyl  intermediates  and  the  TSs  for 
the  four  model  systems  are  shallow  min¬ 
ima  on  a  rather  flat  region,  indicating  the 
proximity  of  the  oxyallyls  to  the  transition 
structures  for  the  ring-opening  processes. 

(iii)  The  solvent  reaction  field  decreases  the  en¬ 
ergy  barriers  for  the  stereomutation  pro¬ 
cesses  and  a  stabilization  of  the  oxyallyl 
intermediates  takes  place.  A  good  agree¬ 
ment  between  the  theoretical  results  and 
experimental  data  is  found. 

(iv)  For  the  stereomutation  of  the  spiro(bicyclo 
[2.2.1]heptane-2.T-cyclopropan)-2'-one  sys¬ 
tem,  the  inclusion  of  one  discrete  molecule 
of  the  nucleophilic  solvents  forming  an 
specific  interaction  between  the  lone  pairs 


of  ether,  acetone,  acetonitrile,  pyridine,  or 
tetrahydrofuran  molecules  and  the  car¬ 
bonyl  group  of  the  cyclopropanone  ring  is 
able  to  explain  the  experimental  data.  This 
discrete  molecule  plays  an  active  role  in 
the  stereomutation  process,  showing  that 
this  interaction  is  more  favorable  with  the 
carbonyl  group  of  the  cyclopropanone  sys¬ 
tem  than  is  the  TS  for  the  ring  opening  or 
the  oxyallyl  intermediate. 
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ABSTRACT:  As  a  model  of  olefin  polymerization  for  a  heterogeneous  Ziegler-Natta 
catalyst,  the  mechanisms  of  the  insertion  of  ethylene  and  propylene  into  R-Ti- 
(CI2  A1H2)^(R  =  CH3,  C3H7)  are  studied  by  ab  initio  self-consistent  field  (SCF)  methods 
and  many-body  perturbation  theory.  The  structures  of  the  reactant,  the  intermediate,  the 
transition  state,  and  the  product  are  optimized  by  the  Hartree-Fock  molecular  orbital 
calculation  level.  The  reactions  are  classified  into  two  steps:  the  first  forms  of  olefin-Ti  tt 
complex,  and  the  second  step  is  the  carbon-carbon  bond  formation  through  the  pull-push 
mechanisms.  The  cocatalysis  play  a  role  in  facilitating  the  pull-push  mechanism  with  the 
Al-Cl  bonds  alternation.  For  the  insertion  reaction  of  propylene,  the  four-type 
conformations  for  attacking  of  propylene  (two  primary  forms:  methyl-substituted  carbon 
attacks  to  the  carbon  side  of  Ti-C  bond;  two  secondary  forms:  methyl-substituted  carbon 
attack  to  Ti  atom)  are  studied.  For  the  complex  formation,  the  secondary  forms  are  more 
stable  in  energy  than  the  primary  forms.  For  the  transition  state,  the  primary  forms  are 
more  stable  in  energy  than  the  secondary  forms.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant 
Chem  65:  739-747,  1997 


Introduction 

It  is  widely  known  that  Ziegler-Natta  catalysis 
polymerization  is  one  of  the  most  important 
industrial  polymerization  reaction,  and  many  ex¬ 
perimental  studies  have  been  reported  [1-10],  The 
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manufacture  for  Ziegler-Natta  type  polymeriza¬ 
tion  is  based  on  heterogeneous  catalyst  which  have 
become  highly  selective  and  efficient.  Classical 
Ziegler  catalysts  are  heterogeneous  with  the  poly¬ 
merization  taking  place  on  dislocations  and  edges 
of  TiCl4  crystals.  Heterogeneous  titanium-based 
Ziegler  catalysts  are  more  versatile;  they  are  able 
to  polymerize  propene  with  a  very  high  degree  of 
stereoselectivity  and  catalyze  the  copolymerization 
of  ethylene  with  higher  alk-l-enes  such  as  hex-1- 
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ene,  important  for  the  production  of  flexible, 
nonbrittle  polymers  for  use  as  films  and  in  packag¬ 
ing.  Although  there  are  successful  processes  which 
give  rise  to  a  wide  range  of  polymer  products,  the 
diversity  of  active  sites  in  heterogeneous  catalysts 
leads  to  an  uneven  degree  of  comonomer  incorpo- 
ration,  with  a  high  incorporation  rate  in  short 
chains,  and  little  incorporation  in  the  high-molecu- 
lar-weight  fraction,  a  drawback  where  metal¬ 
locenes  offer  particular  promise.  In  1959  Bres- 
low  and  Newburg  [11]  suggested  for  their 
[TiCl2Cp2]AlClEt2  system  that  the  transition  metal 
is  first  alkylated  and  then  forms  a  halide-bridged 
binuclear  complex  which  is  capable  of  reaction 
with  ethylene.  In  spectroscopic  study  of  the  forma¬ 
tion  of  ion  pair  in  Cp2TiCl2  and  MeAlCl2  system, 
both  groups  of  Long  [12]  and  Eisch  [9]  did  not 
observe  the  ions  (Cp2TiCH3  and  AlClJ ).  The  pos¬ 
sibility  that  Cp2TiCl2:Cl2AlCH3  might  be  the  ac¬ 
tive  catalysts  is  ruled  out  by  the  ultraviolet  (UV) 
studied  of  Long  [12]  and  the  kinetic  studies  of 
Fink  [13].  Eisch  and  co-workers  [14]  by  H,  C, 
and  A1  nuclear  magnetic  resonance  (NMR)  spec¬ 
troscopy,  also  concluded  that  the  ion  pair 
[(Cp2TiCH2SiMe3)+— AICI4:  included  Ti™Cl-Al 
bridged  bond]  in  Cp2Ti(Cl)CH2SiMe3  and  AICI3 
system  are  effective  catalysts  for  the  polymeriza¬ 
tion  of  ethylene. 

For  the  generic  mechanism  of  olefin  insertion  of 
the  Cossee  mechanism  [10]  has  been  accepted 
widely  as  the  most  plausible.  The  first  step  is 
olefin  coordination  to  a  vacant  site  of  Ti  atom.  In 
the  second  step  olefin  is  inserted  into  the  Ti-C 
bond  through  a  four-membered  cyclic  transition 
state.  Many  theoretical  studies  of  Ziegler-Zatta 
catalysis  polymerization  have  been  reported.  Most 
of  these  studies  discuss  the  reaction  mechanism  in 
Ziegler-Natta  polymerization  based  on  Cossee's 
model,  Cossee's  model  was  treated  with  semiem- 
pirical  self-consistent  field  (SCF)  molecular  orbital 
(MO)  methods  by  Armstrong  and  co-workers  [15] 
and  by  Novaro  and  co-workers  [16].  Their  results 
suggested  that  the  Ziegler-Natta  mechanism  might 
be  explained  by  a  concerted  motion  of  the  olefin 
and  alkyl  group  that  would  bring  about  a  transi¬ 
tion  from  octahedral  coordination  to  a  trigonal-bi- 
pyramidal  coordination  for  the  Ti  atom.  This  view 
was  also  confirmed  with  the  ab  initio  SCF  MO 
method  by  dementi  and  co-workers  [17,  18].  Bal- 
azs  and  Johnson  [19]  demonstrated  the  reaction 
CH3TiCl4  +  C2H4  by  using  a  SCF-X,-SW  MO 
method.  The  interaction  between  C2H4  and 
CH3TiCl  J  was  examined  by  ab  initio  MO  methods 


[20,  21]  and  by  density  functional  methods  [22].  In 
the  more  realistic  systems  of  homogeneous  cataly¬ 
sis  (including  Cp),  hydrozirconation  of  ethylene 
and  acetylene  by  Cp2Zr(H)Cl  and  (or)  Cp2TiCFl3 
was  studied  by  ab  initio  MO  methods  [23,  24]. 
The  insertions  of  ethylene  into  Cp2ZrCHj  and 
(SiH2Cp2)ZrCHj  were  also  treated  by  ab  initio 
MO  method  [25]  and  by  density  functional  meth¬ 
ods  [26-28].  The  stereotacticity  of  zirconium  met¬ 
allocene  Ziegler-Natta  propylene  polymerization 
catalysis  was  studied  using  a  combination  of  ab 
initio  MO  method  and  empirical  force  field  molec¬ 
ular  mechanics  techniques  [29-34],  However, 
above  theoretical  studies  did  not  reported  for  the 
mechanism  of  cocatalysis  of  Ziegler-Natta  poly¬ 
merization.  The  present  author  has  proposed  the 
fundamental  carbon-carbon  bond  formation 
mechanisms  with  a  simple  model  of  Cossee  type 
in  a  previous  study  [35].  The  proposed  mecha¬ 
nisms  were  on  the  basis  of  electronic  pull-push 
process:  the  electrons  in  the  Ti-R  bond  move  to 
the  region  of  new  carbon-carbon  bond  formation, 
and  77  electrons  of  ethylene  move  to  the  region  of 
new  Ti-carbon  bond  formation.  These  mechanisms 
were  the  same  as  those  of  alumination  reaction  of 
olefin  as  shown  in  our  previous  studies  [36,  37].  In 
the  study  [35],  the  role  of  cocatalysis  for  the  Zieg¬ 
ler-Natta  type  reaction  was  also  reported  on  the 
basis  of  the  classical  model  (CH3TiCl4AlH2  + 
H2C  =  CH2).  For  this  model,  it  was  clarified  that 
the  cocatalysis  play  a  role  in  facilitating  the  pull- 
push  mechanism  through  the  Al-Cl  bond  alterna¬ 
tion.  Although  we  clarified  entirely  the  role  of 
cocatalysis  for  the  model,  we  could  not  explain  the 
stereotacticity  for  polymerization  on  the  basis  of 
the  model.  Consequently,  for  the  explanation  of 
the  role  of  cocatalysis  and  the  stereoreguration  for 
the  heterogeneous  Ziegler-Natta  polymerization, 
the  present  author  proposes  a  new  model  as  the 
following: 
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This  model  is  more  realistic  compared  with  the 
previous  one.  For  our  model  of  olefin  polymeriza¬ 
tion,  a  heterogeneous  as  well  as  homogeneous 
catalysis  has  been  adopted. 

In  this  study,  two  subjects  are  reported.  The 
first  is  the  role  of  cocatalysis  for  olefin  insertion 
mechanism.  The  second  is  the  stereoregulation  of 
propylene  polymerization  on  the  basis  of  our 
model. 


Method  of  Calculation 

The  basis  sets  used  for  Ti  atom  through  the 
study  were  3-21G  of  Dobbs  and  Hehre  [38].  For  C, 
H,  Cl,  and  A1  atoms  the  standard  3-21G  basis 
functions  [39,  40]  were  used.  The  set  is  denoted  at 
the  following  sections  as  BS-L  For  better  energies 
the  6-31G(d)  basis  functions  [41,  42]  for  carbon 
atom  and  the  (8s/4p/3d)  contracted  functions  [43] 
for  Ti  atom  were  used,  denoted  as  BS-II.  All  equi¬ 
librium-  and  transition-state  geometries  were  de¬ 
termined  by  the  use  of  analytically  calculated 
energy  gradients  with  Hartree-Fock  (HF)  wave 
functions.  The  stationary  points  were  identified  as 
the  equilibrium  or  the  saddle  point  by  examining 
the  calculated  normal  vibrational  frequencies.  The 
force  constant  matrix  and  thereby  the  vibrational 
frequencies  were  calculated  by  analytical  second- 
derivative  procedures  [44,  45],  and  zero-point  vi¬ 
brational  energy  corrections  were  obtained  at  this 
level.  Additional  calculations  were  performed  to 
obtain  improved  energy  comparisons — the  calcu¬ 
lations  at  the  HF-optimized  structures  with  elec¬ 
tron  correlation  incorporated  through  the  second- 
and  third-order  Moller-Plesset  (MP2  and  MP3) 
perturbation  theory  [46~50]. 

Ab  initio  molecular  orbital  calculations  were 
carried  out  by  using  the  GAUSSIAN94  program 
[51]. 


Results  and  Discussion 

INSERTION  OF  ETHYLENE  INTO 
H3CTi(Cl2AlH2)J  AND  ROLE 
OF  COCATALYSIS 

The  stationary  point  geometries  of  reactant, 
complex,  transition  state,  and  product  for 
(Cl2AlH2)2TiCH3  +  H2C  =  CH2  reaction  are  illus¬ 
trated  in  Figure  1.  For  all  stationary  points,  Ti,  CP, 
Cl^,  and  three  carbon  atoms  are  almost  in  the  same 


plane.  The  stationary  point  geometry  of  reactant 
has  a  Cg  symmetry.  The  Cg  symmetry  plane  bisects 
each  angle  of  CP-Ti-CP  and  CP-Ti-CP,  and  is  in 
the  plane  of  Ti-C-H^.  For  the  reactant  conforma¬ 
tion,  there  are  two  possible  spaces  for  approaching 
of  olefin:  CP~Ti-C-CP  and  CP-Ti-C-CP  spaces. 
The  space  of  CP-Ti-C-CP  is  more  wide  than  the 
space  of  CP-Ti“C-CP  from  the  comparison  of  the 
angels  of  CPTiC(H3)  and  CPTiC(H3).  The  CP-Ti 
(and  CP-Ti)  bond  length  is  a  little  shorter  than  the 
CP-Ti  (and  CP-Ti)  bond  length.  The  difference 
between  CP-Ti  and  CP-Ti  bond  lengths,  how¬ 
ever,  is  only  0.019  A  in  length.  Consequently,  the 
space  of  CP-Ti-C-CP  is  favorable  for  the  ap¬ 
proaching  of  olefin. 

For  the  tt  complex,  C  ^  carbon  in  ethylene  inter¬ 
acts  stronger  with  the  Ti  atom  than  the  C“  carbon, 
because  of  the  repulsion  of  C“  and  CP  atom  (or 
Ti-CP  bond).  In  the  catalysis  part,  the  bond  be¬ 
tween  Ti  atom  and  CP  becomes  longer  by  0.321  A 
in  length  than  that  in  the  reactant.  Because  the  CP 
atom  moves  to  the  reverse  side  of  methyl-Ti  bond 
for  Ti  atom  during  the  complex  formation.  The 
longer  Ti-CP  leads  to  be  the  shorter  CP-Al  bond, 
the  longer  CP-Al  bond,  and  the  shorter  Ti-CP 
bond  in  comparison  with  those  of  the  reactant. 
Accordingly,  this  part  (Ti,  CP,  Al,  and  CP)  is  a 
complex  between  CP-AIH2  and  Ti-CP.  On  the 
other  hand,  the  variation  of  Ti-CP  bond  for  the  tt 
complex  formation  is  small  (only  0.03  A  in  length). 
As  a  result,  the  one  side  of  cocatalysis  is  active  for 
the  77-complex  formation  and  the  other  is  not  ac¬ 
tive. 

For  the  transition  state,  the  Ti-CP  bond  be¬ 
comes  shorter  in  length  than  that  of  the  77  com¬ 
plex.  This  behavior  facilitates  the  bond  breaking  of 
Ti-C(H3).  Because  the  insertion  mechanism  is  the 
pull-push  as  shown  in  our  previous  work  [35,  36], 
the  electrons  of  Ti-C(H3)  bond  move  to  the  region 
of  new  C-C  bond  formation  and  original  77  elec¬ 
trons  of  ethylene  part  move  to  the  region  of  the 
new  C-Ti  bond  formation.  Accordingly,  the  ap¬ 
proach  of  CP  to  Ti  atom  facilitates  the  moving  of 
the  electrons  of  Ti-C(H3)  bond  region.  On  the 
other  side,  the  Ti-CP  bond  length  becomes  longer 
than  that  of  the  77  complex.  This  behavior  facili¬ 
tates  moving  the  77  electrons  of  the  ethylene  part 
easily  to  the  new  C-Ti  bond  formation  region  (the 
reverse  side  of  Ti-CP  bond  for  Ti  atom). 

For  the  product,  polymer  grows  up  for  the  more 
wide  space  of  Cl-Ti-Cl.  The  mechanism  of  co¬ 
catalysis  is  shown  in  Figure  2.  From  the  figure,  the 
bonds  alternation  for  Cl-Al  and  Ti-Cl  bond  occurs 
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FIGURE  1,  HF/BS-!  optimized  geometries  of  reactant,  tt  complex,  transition  state,  and  product  for  reaction 
H3CTi(Cl2AlH2)2+  H2C  =  CH2.  Distances  and  angles  are  in  angstrom  and  degrees,  respectively. 


during  the  one  process  of  polymerization.  It  is  also 
considered  that  this  is  the  reaction  of  olefin  inser- 
tion  into  H3C-TiClJ  plus  cocatalysis  (C1A1H2)2- 
Many  theoretical  works  [22,  30]  treated  the  inser¬ 
tion  of  ethylene  into  H3C-TiCl2  as  a  model  of  the 
homogeneous  catalysis  of  [Cp2TiCH3]^.  The  po¬ 
tential  energy  surface  of  H3CTiCl2  ethylene  is 
different  from  that  of  our  system.  For  Cl2TiCH3 -i- 
ethylene,  the  complexation  energy  at  the  tt  com¬ 
plex  is  over  40  kcal/mol.  On  the  other  hand,  the 
complexation  energy  of  our  system  is  about  10 
kcal/mol.  TheTarge  difference  of  the  complexation 
energies  comes  from  the  distribution  of  the  posi¬ 
tive  charge  by  the  cocatalysis.  The  influence  of  the 


complexation  energy  on  the  charge  distribution 
can  be  seen  for  the  other  homogeneous  systems. 
The  TT  complexation  energy  for  Cp2TiCH3  +  ethyl¬ 
ene  was  proposed  by  Wess  and  co-workers  [24]: 

—  5.3  kcal/mol  for  the  HF  calculation  and  -23.7 
kcal/mol  for  the  MP2  calculation  level.  Yoshida 
and  co-workers  [25]  proposed  the  tt  complexation 
energy  for  H2SiCp2TiCFlJ+  ethylene  by  various 
methods:  —13.0  kcal/mol  for  the  HF  calculation, 

—  28.6  kcal/mol  for  the  MP2  calculation,  —20.9 
kcal/mol  for  the  MP3  calculation,  and  -21.9 
kcal/mol  for  QCISD  calculation  levels.  For  these 
homogeneous  systems,  the  tt  complexation  ener¬ 
gies  by  the  HF  calculation  are  almost  the  same  to 
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FIGURE  1.  Continued. 


the  value  for  our  system.  Yoshida  and  co-workers 
pointed  out  that  the  calculation  values  for  their 
system  are  far  from  convergence  in  the  perturba¬ 
tion  series.  Accordingly  the  difference  between  the 
calculated  77  complexation  energies  by  the  HF  and 
by  the  MP2  levels  is  quite  large  (15.6  kcal/mol)  for 
the  homogeneous  system.  For  our  system,  the  dif¬ 
ference  of  the  77  complexation  energies  by  the  HF 
and  by  the  MP2  calculations  is  only  4.2  kcal/mol. 
The  ethylene  binding  energy  for  H2SiCp2TiCHj  is 
21.9  kcal/mol  (best  value),  and  is  the  middle  value 
of  those  for  C^TiCHj  and  for  our  system. 

For  the  activation  energy  at  the  transition  state 
— the  energy  is  given  relative  to  the  77  complex  of 
[Ti]-CH3  and  ethylene — 13.8  kcal/mol  for  the  HF 
calculation  and  4.3  kcal/mol  for  the  MP2  calcula¬ 
tion  level  for  C^TiCHj  system  were  proposed 
[30].  For  the  H2SiCp2TiCH3  system  [25],  14.3 
kcal/mol  for  the  HF  calculation,  1.3  kcal/mol  for 
the  MP2  calculation,  10.9  kcal/mol  for  the  MP3 
calculation,  and  7.1  kcal/mol  for  the  RQCISD  cal¬ 
culation  levels  were  proposed.  The  activation  ener¬ 
gies  from  the  77  complex  for  our  system  is  9.3 
kcal/mol  for  the  HF  calculation,  0  kcal/mol  for 
the  MP2  calculation,  and  3  kcal/mol  for  the  MP3 
calculation  levels.  The  activation  energy  barrier 
height  for  our  system  is  about  7-2  kcal/mol  lower 
than  those  for  the  other  homogeneous  systems.  It 
is  considered  that  this  energy  difference  comes 
from  the  energy  reduced  by  the  cocatalysis.  This 
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FIGURE  2.  Mechanism  of  olefin  insertion  into  Ti-CH3 
bond  of  H3CTi(Cl2AIH2)^ 


low-energy  barrier  corresponds  to  the  fact  that  the 
reaction  of  heterogeneous  catalysis  is  more  active 
than  that  of  homogeneous  catalysis. 

ISOTACTIC  STEREOREGULATION  IN 

PROPYLENE  POLYMERIZATION 

For  evaluation  of  regio-  and  stereoregulatory 
capabilities,  we  have  to  compare  the  four  possible 
conformations  for  77  complex  and  transition  state 
shown  in  Chart  1  the  differences  in  the  geometry 
and  the  total  energy.  The  four  77  complexes  and 
four  transition-state  geometries,  primary  A,  pri¬ 
mary  B,  secondary  A,  and  secondary  B  shown 
in  Chart  1,  for  propylene  insertion  into 
H3C — Ti(Cl2AlH2)2  have  been  optimized  by  the 
HF/BS-I  method.  The  some  geometry  parameters 
and  the  relative  energies,  relative  to  primary  A  for 
all  the  case,  are  summarized  in  Table  II. 

For  77  complex,  the  difference  of  the  distances  of 
Ti-C  (C“  and  C  bonds  are  drastic  for  the  confor¬ 
mations  of  the  primary  and  secondary.  This  comes 
from  the  77-electron  distribution  of  propylene. 
Namely  the  unsubstituted  carbon  in  propylene  has 
more  77-electron  density  than  methyl-substituted 
carbon.  It  is  widely  known  that  the  77  complex  is 
formed  with  the  interaction  between  the  empty 
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TABLE  I 


Relative  energies  from  the  reactant  for  ethylene  polymerization  and  propylene  polymerization  (kcal  /  mol). 


Method 

HF/BS-I 

MP2/BS-I 

HF/BS-!I 

MP2/BS-II 

MP3/BS-II 

Ethylene  +H3CTi(Cl2AIH2)2 

Reactant 

0.0 

0.0 

0.0 

0.0 

0.0 

TT  Complex 

-13.1 

-14.4 

-8.7 

-12.9 

-11.3 

Transition  state 

-2.2 

-12.1 

0.6 

-13.7 

-8.3 

Product 

-31.6 

-31.6 

-30.4 

-34.5 

-34.3 

Propylene  +H3CTi(Cl2AIH2)2 

Reactant 

0.0 

0.0 

0.0 

0.0 

77  Complex 

Primary  A 

-18.5 

-20.2 

-13.6 

-19.3 

Primary  B 

-18.9 

-19.9 

-14.0 

-19.0 

Secondary  A 

-20.9 

-20.4 

-16.3 

-19.5 

Secondary  B 

-20.6 

-20.5 

-15.7 

-19.5 

Transition  state 

Primary  A 

-5.9 

-16.0 

-1.9 

-18.0 

Primary  B 

-6.3 

-16.6 

-2.0 

-18.1 

Secondary  A 

-3.0 

-14.0 

1.2 

-14.1 

Secondary  B 

-2.8 

-14.0 

1.9 

-13.9 

orbital  of  transition  metal  and  tt  orbital  of  olefin. 
Consequently,  the  unsubstituted  carbon  interacts 
stronger  with  Ti  atom  than  the  methyl-substituted 
carbon.  The  differences  between  the  total  energies 
of  the  primary  and  secondary  conformations  for 
the  HF  and  MP2  calculation  levels  are  2  and  0.2-0.3 
kcal/mol,  respectively.  The  conformations  of  the 
secondary  are  a  little  favorable  in  energy  to  com¬ 
pare  with  those  of  the  primary.  The  energy  differ¬ 
ence,  however,  is  very  small,  and  the  stereo-regu¬ 
lation  of  the  product  is  not  decided  probably  by 
the  formation  of  the  tt  complex.  The  conformation 


of  the  secondary  is  similar  to  that  of  the  ethylene 
insertion.  The  energy  difference  between  A  and  B 
for  both  of  the  primary  and  secondary  is  little 
(under  1  kcal/mol).  The  tt  complexation  energy 
for  propylene  is  about  7-8  kcal/mol  more  stable 
than  that  for  the  ethylene.  The  difference  of  these 
complexation  energies  for  both  systems  comes  from 
the  TT-electron  distribution  with  methyl-substitu¬ 
tion  group.  As  the  results,  the  tt  complexation 
energy  have  more  influence  for  the  electronic  part 
of  the  methyl-substitution  group  than  that  for  the 
steric  part. 


TABLE  II _ 

Ti  -  C  bond  distance  and  the  relative  energy  from  primary  A  geometry  for  propylene  polymerization. 


Bond  distance  (A)  Relative  energy  (kcal  /  mol) 


Ti  -C“ 

Ti-C^ 

Ti-CCHg) 

C“-C^ 

C'^-C(H3) 

HF/BS-I 

MP2/BS-I 

HF/BS-II 

MP2/BS~II 

Complex 

Primary  A 

2.355 

3.001 

2.034 

1.361 

3.055 

0.0 

0.0 

0.0 

0.0 

Primary  B 

2.348 

3.029 

2.029 

1.361 

3.213 

-0.38 

0.27 

-0.40 

0.30 

Secondary  A 

3.008 

2.325 

2.026 

1.369 

3.047 

-2.47 

-0.19 

-2.71 

-0.28 

Secondary  B 

3.003 

2.327 

2.025 

1.367 

3.053 

-2.10 

-0.27 

-2.17 

-0.20 

Transition  state 

Primary  A 

2.092 

2.540 

2.139 

1.432 

2.108 

0.0 

0.0 

0.0 

0.0 

Primary  B 

2.092 

2.537 

2.141 

1.431 

2.105 

-0.37 

-0.61 

-0.12 

-0.11 

Secondary  A 

2.195 

2.512 

2.121 

1.406 

2.172 

2.89 

1.92 

3.08 

3.96 

Secondary  B 

2.199 

2.511 

2.116 

1.406 

2.174 

3.14 

2.00 

3.79 

4.14 
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For  the  transition  state,  the  Ti-C“  bond  lengths 
of  the  primary  conformations  are  about  0.1  A 
shorter  than  those  of  the  secondary.  From  the 
comparison  of  the  geometry  parameters  between 
the  primary  and  secondary  conformations,  the  pri¬ 
mary  conformations  are  earlier  transition  states 
than  the  secondary  conformations.  The  conforma¬ 
tions  of  the  primary  are  about  3-4  kcal /mol  more 
stable  in  energy  than  those  of  the  secondary  at 
both  calculation  levels  of  the  HF  and  MP2.  The 
difference  of  activation  energy  barrier  heights  be¬ 
tween  the  primary  and  the  secondary  conforma¬ 
tions  comes  from  the  rr  electrons  distribution  with 
methyl-substituted  group.  As  described  above,  the 
unsubstituted  carbon  in  propylene  has  much  tt- 
electron  density.  Consequently,  the  reaction  path 
of  the  secondary  conformation  is  unfavorable  for 
the  pull-push  mechanism;  the  electrons  in  the 
Ti-R  bond  move  to  the  region  of  new  carbon-car¬ 
bon  bond  formation  and  the  electrons  of  tt  bond 
move  to  the  region  of  new  Ti-C  bond  formation. 
Namely  the  electron  distribution  of  propylene  for 
the  secondary  conformations  has  reverse  effects  for 
the  electron  movement  along  the  reaction  path. 
The  energy  differences  between  A  and  B  for  both 
primary  and  secondary  conformations  are  also  un¬ 
der  1  kcal/mol.  The  energy  difference  between 
primary  A  and  primary  B  should  determine  isotac¬ 
tic  stereoregulation  for  the  product.  From  the  small 
energy  difference,  it  is  concluded  that  the  isotactic- 
ity  is  not  decided  for  the  initiation  step  of  poly¬ 
merization.  This  conclusion  corresponds  to  the  re¬ 
sults  [34]  for  propylene  insertion  to  transition  metal 
(Ti,  Zr,  and  Hf)-C  bond  for  homogeneous  catalysis 
systems  by  ab  initio  MO-MM  (molecular  mechan¬ 
ics)  method.  They  concluded  the  stereoselectivity 
is  decided  after  the  first  propylene  insertion  from 
the  results  of  the  MM  calculations.  The  stereoregu¬ 
lation  via  direct  ligand -olefin  interaction  for  their 
catalyst  is  not  significant.  Accordingly  the  stereos¬ 
electivity  is  decided  by  steric  energies  between 
monomer  and  polymer.  However,  their  calculation 
is  the  MM  and  included  only  steric  effects  (not 
electronic  effects). 

In  order  to  study  the  steric  and  electronic  effects 
of  polymer  by  ab  initio  molecular  orbital  calcula¬ 
tion  level,  the  transition  states  for  the  second  step, 
as  shown  in  Figure  3,  were  calculated.  In  Figure  3, 
conformation  I  indicates  the  polymer  part  C^HM2, 
spreads  to  the  outside  of  the  space  of  Cl^TiCl’^C^. 
Conformation  II  indicates  the  polymer  part  spreads 
to  the  inside  of  the  space  of  Cl*^TiCl^CT  For  these 


Cl« - AlHo 


ci»- 


-AIH, 


Conformation  (I)  Conformation  (II) 

FIGURE  3.  Conformations  of  polymer  part  at  the 
transition  state  for  reaction  of  (CH3)2CHCH2Ti(Cl2AIH2)2 
and  propylene. 


two  conformations  of  polymer  part,  the  transition 
states  of  eight  types  (primary  A-I,  A-II,  B-I,  and 
B-II,  and  secondary  A-I,  A-II,  B-I,  and  B-II)  were 
calculated.  These  eight  conformations  are  real  tran¬ 
sition  states  and  have  one  negative  eigenvalue  for 
their  force  constant  matrix.  The  relative  energies 
for  these  eight  transition  state  are  listed  in  Table 
III.  The  energy  difference  between  the  primary  A-I 
and  A-II  is  only  0.7  kcal/mol  at  the  MP2  calcula¬ 
tion.  The  energy  of  the  primary  B-II  is  almost  the 
same  to  that  of  the  primary  A-I.  However,  the 
primary  B-I  is  about  5  kcal/mol  higher  in  energy 
than  the  primary  A-I.  The  value  of  5  kcal/mol 
indicates  the  static  repulsion  energy  between  the 
methyl  groups  of  propylene  and  polymer  part.  The 
primary  B-II  is  about  5  kcal/mol  more  stable  in 
energy  than  the  primary  B-I.  If  the  space  of 
CI^TiCl^  is  crowded  with  other  catalysis  (maybe 


TABLE  III _ 

Relative  energy  from  primary  A-I  conformation 
for  the  second  step  of  propylene  polymerization. 


Relative  energy  (kcal  /  mol) 
HF/BS-I  MP2/BS-I 


Transition  state 


Primary  A-I 

0.0 

0.0 

Primary  A-II 

0.88 

0.69 

Primary  B-I 

5.27 

4.61 

Primary  B-II 

0.51 

-0.23 

Secondary  A-I 

1.63 

3.75 

Secondary  A-II 

2.32 

4.26 

Secondary  B-I 

2.28 

4.01 

Secondary  B-I! 

1.97 

3.49 
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this  assumption  indicates  a  realistic  heterogeneous 
catalysis),  the  conformation  II  types  are  unstable  in 
the  energy.  All  secondary  types  are  about  4 
kcal/mol  more  unstable  in  energy  than  the  pri¬ 
mary  A-L  This  reason  comes  from  the  electronic 
distribution  of  propylene  as  mentioned  above.  The 
insertion  is  the  pull-push  mechanism,  and  tt  elec¬ 
trons  of  monomer  have  to  move  new  Ti-C  bond 
region  through  the  transition  state.  Consequently, 
the  regioselectivity  of  reaction  is  controlled  by  the 
static  repulsion  between  the  methyl  groups  of 
monomer  and  of  the  first  unit  in  polymer  for  the 
primary  types,  and  is  controlled  by  the  electronic 
effects  (the  tt  electrons  distribution  of  monomer) 
for  the  secondary  types.  As  a  result,  the  reaction 
path  of  the  primary  A-I  type  leads  to  the  isotactic 
sequence. 


Conclusions 

The  mechanisms  of  the  insertion  of  ethylene 
and  propylene  into  R-Ti(Cl2  A1H2)2  (R  =  CH3  and 
C3H7)  were  studied  by  ab  initio  MO  methods.  Our 
treated  catalytic  system  is  one  kind  of  a  edge 
model  of  TiCl4  crystals  for  heterogeneous 
Ziegler-Natta  catalysts.  The  reactions  are  classi¬ 
fied  into  two  steps:  the  formation  of  olefin-Ti  tt 
complex  and  olefin  insertion  through  the 
pull-push  mechanism.  For  the  7r-complex  forma¬ 
tion  step,  one  side  of  cocatalysis  works  actively  to 
weak  the  Ti-CH3  bond.  For  the  olefin  insertion 
step,  the  cocatalysis  works  to  break  the  old  Ti-CH3 
bond,  and  the  other  side  of  cocatalysis  works  to 
form  a  new  Ti-C  bond.  As  the  results,  the  cocatal¬ 
ysis  play  a  role  in  facilitating  the  pull-push  mech¬ 
anism  through  the  Al-Cl  bonds  alternation. 

For  the  initial  step  of  the  propylene  polymeriza¬ 
tion,  four  types  of  conformation  were  studied  for 
each  TT  complex  and  for  each  transition  state:  pri¬ 
mary  A,  primary  B,  secondary  A,  and  secondary  B. 
For  the  7r-complex  formation,  the  secondary  con¬ 
formations  are  a  little  favorable  in  energy  to  com¬ 
pare  with  those  of  the  primary.  The  energy  differ¬ 
ence,  however,  is  only  0.2-0.3  kcal/mol  by  the 
MP2  calculation  and  about  2  kcal/mol  by  the  HF 
calculation  level.  Accordingly,  the  77  complex  does 
not  decide  probably  the  stereoregulation  of  the 
product.  For  the  transition  state,  the  primary  con¬ 
formations  are  favorable  by  about  4  kcal/mol  in 
energy  to  compare  with  those  of  the  secondary. 
The  energy  difference  between  the  primary  confor¬ 


mations  A  and  B  is  only  0.3  kcal/mol  by  the  MP2 
level.  Therefore,  the  isotacticity  is  not  decided  at 
the  initiation  step  of  polymerization. 

The  transition  states  for  the  second  step  of  poly¬ 
merization  were  calculated  by  the  HF/BS-I  calcu¬ 
lation  level.  All  secondary  types  of  the  transition 
states  are  about  4  kcal/mol  higher  in  energy  than 
the  primary  A-I.  Conformation  II  for  the  primary 
types  has  almost  the  same  energy  to  the  primary 
A-I.  However,  if  the  space  of  Cl^TiCl^C'^  is 
crowded  with  other  catalysis,  all  conformations  of 
type  II  are  unstable  in  energy.  Accordingly,  the 
transition  state  of  the  primary  B  is  about  5 
kcal/mol  higher  than  that  of  the  primary  A.  The 
most  stable  structure,  the  primary  A-I,  leads  to  the 
isotactic  type  product. 

As  mentioned,  the  role  of  the  cocatalysis  for  the 
pull-push  mechanism  and  the  isotacticity  for 
propylene  polymerization  could  be  shown  on  the 
basis  of  our  model  of  the  Ziegler-Natta  type  catal¬ 
ysis.  It  is  considered  that  the  role  of  cocatlysis  and 
the  mechanisms  of  the  isotacticity  polymerization 
can  be  extended  to  the  case  of  the  homogeneous 
catalysis  system. 
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ABSTRACT:  A  density  functional  theory  study  of  Au,  Au2,  AU3,  and  AU4  is 
performed  focusing  on  the  study  of  gold  tips  or  contacts  of  interest  in  investigations  of 
molecular-scale  electronics.  The  ground  state  for  the  four  systems  corresponds  to  the  one 
with  the  lowest  multiplicity,  and  the  ordering  of  energies  follows  the  multiplicity  in  all 
cases.  It  is  found  that  the  tetrahedral  geometry  of  AU4  is  not  the  preferred  one  for  the 
lowest  spin  states  with  neutral  charge,  but  is  the  preferred  one  for  charged  systems.  The 
ionization  potential  and  electron  affinities  for  the  four  systems  were  also  obtained.  A 
qualitative  interpretation  of  the  Kohn-Sham  highest  occupied  molecular  orbital  (HOMO) 
and  lowest  unoccupied  molecular  orbital  (LUMO)  orbitals  is  given  based  on  the  present 
calculations.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  749“758,  1997 


Introduction 

The  development  of  scanning  tunneling  mi¬ 
croscopy  (STM)  techniques  has  enormously  trig¬ 
gered  the  possibilities  of  using  organic  chemistry 
as  the  fundamental  complement  for  semiconduc¬ 
tors  [1].  This  may  allow  the  fabrication  of  logical 
circuits  at  the  level  of  single  molecules  [2].  These 
studies  require  the  precise  knowledge  of  the  effect 
of  the  STM  tips  or  contacts  on  these  molecules. 

One  of  the  major  metals  used  as  tips  or  contacts 
for  the  development  of  molecular-level  electronic 
circuits  is  gold.  In  some  cases,  small  gold  clusters 
are  the  preferred  material  for  measurements  of 
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current- voltage  characteristics  [3].  Other  interest 
centers  on  the  measurements  of  electrical  resis¬ 
tance  of  wires  composed  of  single  atoms,  and 
previous  studies  have  been  performed  with  xenon 
[4],  Therefore,  in  this  study  we  have  decided  to 
focus  on  the  characteristics  of  small  gold  clusters 
that  could  represent  the  tips  or  contacts  in  experi¬ 
ments  of  molecular  conductance.  These  contacts 
could  be  small  enough  to  form  one-atom-point 
contacts.  In  addition,  the  interactions  of  gold  clus¬ 
ters  with  single  molecules  must  be  studied  in 
order  to  understand  and  design  systems  for 
single-molecule  electronics.  There  are  several  ex¬ 
perimental  techniques  such  as  molecular  beams 
that  allow  the  synthesis  and  characterization  of 
clusters  of  a  defined  size.  Several  studies  have 
been  able  to  determine,  theoretically  and  exper- 
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imentally,  dissociation  energies,  ionization  po¬ 
tentials,  and  chemical  reactivity  for  some  gold 
clusters  [5-17].  The  study  of  gold  clusters  is  inter¬ 
esting  because  clusters  combine  both  the  proper¬ 
ties  of  metallic  surfaces  and  metallic  compounds. 
The  goal  of  this  contribution  is  to  study  and  deter¬ 
mine  the  basic  properties  of  the  lowest  states  of 
gold  monomer,  dimer,  trimer,  and  tetramer  using 
the  best  theoretical  tools  available  at  present,  and 
at  the  same  time,  to  determine  a  proper  tool  to 
study  much  larger  clusters  and  gold  complexes  of 
interest  for  molecular-scale  electronics. 


Theory 

We  have  used  density  functional  theory  (DFT) 
techniques  [18-24]  combined  with  effective  core 
potentials  [25].  Presently,  this  seems  to  be  the  most 
powerful  combination  of  tools  to  deal  with  rela¬ 
tively  large  numbers  of  big  atoms.  The  basis  set 
used  for  the  gold  atom  is  the  Los  Alamos  National 
Laboratory's  (LANL)  set  for  effective  core  poten¬ 
tials  (ECP)  of  double-^  type  [26-28].  The  use  of 
DFT  is  fully  justified  due  to  the  fact  that  is  an  ab 
initio  tool  able  to  deal  with  a  broad  variety  of 
systems.  Several  successful  applications  of  DFT 
have  been  reported  using  the  so-called  hybrid 
functionals,  where  a  portion  of  the  exchange  is 
calculated  as  a  fully  nonlocal  functional  of  the 
wave  function  of  an  auxiliary  noninteracting  sys¬ 
tem  of  electrons.  Since  this  resembles  the  exchange 
in  the  Hartree-Fock  (HF)  procedure  (actually  in 
any  wave  function  procedure),  it  is  common  to 


refer  to  this  functional,  or  procedure,  as  a  DFT-HF 
hybrid.  However,  we  have  to  consider  that  the 
so-called  exchange  is  being  calculated  using  a  non¬ 
interactive  wave  function  whose  density,  but  not 
its  wave  function,  corresponds  to  the  real  system. 
A  detailed  analysis  and  their  theoretical  rigor  was 
recently  reviewed  [29].  The  functional  that  we  used 
is  the  B3PW91  [30-33].  All  calculations  were  per¬ 
formed  using  the  Gaussian-94  program  [34].  All 
geometry  optimizations  were  performed  via  the 
Berny  algorithm  in  redundant  internal  coordinates 
[35].  The  thresholds  for  convergency  were  0.00045 
atomic  units  (a.u.)  and  0.0003  a.u.  for  the  maxi¬ 
mum  force  and  root  mean  square  (RMS)  force, 
respectively.  The  self-consistency  of  the  noninter¬ 
active  wave  function  was  performed  with  a  re¬ 
quested  convergence  on  the  density  matrix  of  10 
and  10"^  for  the  RMS  and  maximum  density  ma¬ 
trix  error  between  iterations,  respectively.  These 
settings  provide  correct  energies  of,  at  least,  five 
decimal  figures  and  geometries  of  approximately 
three  decimal  figures.  On  the  other  hand,  the  use 
of  the  pseudopotentials  with  relativistic  correc¬ 
tions  have  been  widely  demonstrated  to  be  a  good 
compromise  with  the  alternative  use  of  full-elec¬ 
tron  procedures.  This  reduces  the  required  compu¬ 
tational  effort  without  loss  of  accuracy  [36]. 


Results  and  Discussions 

Table  I  shows  the  results  for  the  lowest  state  of 
each  spin  symmetry  of  the  gold  atom.  The  pre¬ 
dicted  ground  state  is  a  doublet,  and  this  result  is 


TABLE  I 


Results  for  the  gold  monomer  (1  Ha  =  27.21 14eV).® 

Multiplicity 

Energy  (Hg) 

Energy  (eV) 

Experimental  (eV) 

Doublet  (Au) 

-135,48424  (0.00) 

Quartet  (Au) 

-135.22896  (6.95) 

Sextet  (Au) 

-134.84610  (17.36) 

Octet  (Au) 

-134.37500  (30.18) 

Singlet  (Au“) 

-135.56147 

Triplet  (Au~) 

-135.45921 

Singlet  (Au"^) 

-135.14100 

Triplet  (Au"^) 

IP  of  Au 

-135.03201 

9.34'" 

9.22 

EA  of  Au 

2.10*" 

2.31 

KS-HOMO  Au 

-6.77 

KS-LUMO  Au 

-4.60 

^Relative  energies  In  parentheses  (eV). 

Relativistic  MRSDCI  /  (13s11p5D4f)  yiel(de(d  IP  =  9.04  eV  and  EA  = 

2.11  eV  [9]. 
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in  agreement  with  the  experimental  observation 
[37].  The  ordering  of  the  higher  spin  states  follows 
the  predicted  trend.  In  order  to  calculate  the  ion¬ 
ization  potential  (IP)  and  electron  affinity  (EA)  of 
the  gold  atom,  we  also  studied  the  positive  and 
negative  ions  of  gold.  These  results  are  also  in 
excellent  agreement  with  the  experimental  values, 
and  they  are  slightly  better  than  expensive  rela¬ 
tivistic  multireference  single-double  configuration 
interaction  (MRSDCI)  calculations  using  larger  ba¬ 
sis  sets  [9].  Table  I  also  shows  the  Kohn-Sham 
(KS)  highest  occupied  molecular  orbital  (HOMO) 
and  lowest  unoccupied  molecular  orbital  (LUMO) 
energies  of  the  Au  atom.  They  are  plotted  in  Fig¬ 
ure  1  where  the  A  values  are  shown  from  the 
negative  of  the  HOMO  level  to  the  IP,  and  from 
the  negative  of  the  LUMO  level  to  the  EA.  These  A 
values  are  2.57  and  2.50  eV  for  the  HOMO  and 
LUMO,  respectively.  These  values  correspond  to 
the  electron  contribution  to  the  energy  in  the  adia¬ 
batic  integral  [38,  39]  of  the  Kohn-Sham  [40]  pro¬ 
cedure: 


d\, 

•^0 

resulting  from  the  reorganization  of  the  electron 
density  after  the  electron  ejection  or  capture. 

Table  II  shows  the  results  for  the  gold  dimer. 
The  ground  state  is  a  singlet  with  a  bond  length  of 


T  Au+ 
A=2.57 


Au  KS-HOMO 


1P=9.34 


6.77 


EA=2.10 


Au- 


Au  KS-LUMO 


A=2.50 


Au 


4.60 


FIGURE  1 .  Energy  levels  for  the  Au  atom  (in  eV) 


TABLE  II _ 

Results  for  the  gold  dimer  (D^  is  caicuiated  with  respect  to  two  doublet  monomers).^ 


Multiplicity 

Energy  (Hg) 

Bond  Length  (A) 

Energy  (eV) 

De  (eV) 

Singlet  AUg 

-271.03845 

2.547 

0.00 

1.90 

(2.472) 

(2.30) 

[2.549] 

[2.22]DHKBLYP 

[2.556] 

[2.1 9]  Pauli  BP 

[2.517] 

[2.26]ZORA-BP 

[2.515] 

[2.29]  DKH  BP 

[2.513] 

[2.54]  DKH  PW 

[2.54] 

[1 .86]ARPR-CEPA 

[2.54] 

[2.13]RPP-QCI(T) 

Triplet  Au2 

-270.97384 

2.961 

1.76 

0.15 

Quintet  Au2 

-270.76759 

2.638 

7.37 

-5.47 

Septet  Au2 

-270.49636 

2.502 

14.75 

-12.85 

Doublet  AUg  ^2^ 

-271.11149 

2.695 

Doublet  AU3 

EA  of  Au2 

-270.69432 

2.675 

1.99 

(1.94) 

IP  of  AU2 

9.36 

®  Experimental  values  are  in  parentheses.  Other  calculations  [17]  are  shown  in  square  brackets. 
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2.547  A  and  an  atomization  energy  of  1.90  eV  with 
respect  to  two  gold  ground-state  atoms.  This  is 
followed  by  a  triplet  state  at  1.76  eV  of  separation 
and  with  a  weak  bond  of  only  0.15  eV.  The  lowest 
quintet  and  lowest  septet  were  also  calculated. 
They  show  stable  structures  with  a  bond  length  of 
2.695  and  2.675  A,  respectively.  In  addition,  we 
have  also  analyzed  the  positive  and  negative  ions 
in  order  to  obtain  IP  and  EA  of  the  ground-state 
gold  dimer.  These  calculations  yielded  9.36  and 
1.99  eV,  respectively,  practically  the  same  values 
as  for  the  monomer.  The  EA  is  in  excellent  agree¬ 
ment  with  the  experimental  value  of  1.94  eV 
[12-14].  We  were  not  able  to  find  an  experimental 
value  for  the  IP  of  Au2.  The  bond  dissociation 
energy  (D^)  of  1.90  eV  is  in  fairly  good  agreement 
with  the  reported  experimental  values  of  2.30  eV 
[41]  and  2.33  eV  [10,  11].  The^bond  length  is 
slightly  overestimated  by  0.077  A.  However,  our 
results  are  in  closer  agreement  with  a  direct  per¬ 
turbation  relativistic  DPT  procedure  [16]  which 
reported  a  of  2.04  eV  and  a  bond  length  of  2.58 
A.  Several  calculations  on  Au2  have  recently  been 
reported  [17]  and  they  are  listed  in  Table  II. 

Table  III  shows  the  results  for  the  trimer.  The 
lowest  doublet,  quartet,  and  sextet  were  calcu¬ 
lated.  As  with  the  monomer  and  dimer,  the  order¬ 
ing  of  energies  follows  the  multiplicity.  As  per  our 
goal,  resembling  the  STM  tip  implies  the  search  for 
local  minima  with  a  triangular  structure.  There¬ 
fore,  all  our  geometry  optimizations  started  with  a 
triangular  structure.  In  addition,  the  search  was 
extended  to  the  positive  and  negative  singlets  and 
triplets.  Except  for  the  negative  singlet  of  AU3  that 
yielded  a  linear  structure,  all  the  other  geometries 
were  triangular.  For  the  charged  ions,  the  singlets 
were  lower  in  energy  than  the  triplets.  The  quartet 
was  the  only  neutral  structure  to  have  sym¬ 
metry  and  correspond  to  the  one  with  the  longest 
bond  length.  The  doublet  and  triplet  are  stable 
with  respect  to  dissociation  into  three  ground-state 
Au  atoms  by  2,79  and  0.48  eV,  respectively.  The 
sextet  is  unstable  by  7.41  eV.  Only  the  singlet  is 
stable  with  respect  to  a  monomer  plus  dimer  sys¬ 
tem  by  0.88  eV. 

Table  IV  shows  the  results  for  the  tetramer.  The 
search  for  local  minima  was  started  with  an  almost 
tetrahedral  structure  with  no  constraints  on  its 
symmetry  (C^  symmetry).  The  ordering  of  ener¬ 
gies  was  again  the  same  as  their  multiplicities; 
however,  no  tetrahedral  geometry  was  found  for 
the  lowest  singlet  and  triplet.  The  lowest  quintet 
corresponds  to  a  tetrahedral  structure  at  3.16  eV 


above  the  singlet.  The  singlet  ended  up  planar  in  a 
triangular  arrangement  connecting  the  fourth  gold 
atom  in  one  of  the  vertices.  The  triplet  afforded  an 
almost  perfect  square  with  a  puckered  angle  of 
12.9°  and  0.81  eV  above  the  singlet,  and  an  addi¬ 
tional  search  constrained  to  D4/,  symmetry  yielded 
practically  the  same  energy  as  the  puckered  one. 
The  quintet  was  a  tetrahedron.  Interestingly,  the 
three  structures  are  stable  with  respect  to  four 
gold  atoms  in  their  ground  state  by  4.96,  4.15,  and 
1.80  eV,  respectively.  The  singlet  and  triplet  are 
stable  with  respect  to  two  dimers  by  1.15  and  0.34 
eV,  respectively.  The  quintet  is  unstable  by  2.01 
eV.  The  singlet  and  triplet  are  also  stable  with 
respect  to  a  ground-state  trimer  and  a  monomer 
while  the  quintet  is  not  stable.  We  can  also  observe 
from  Tables  II-IV  that  the  bond  length  of  the 
doublet  negative  tetramer  is  in  very  close  agree¬ 
ment  with  the  experimental  Au-Au  gold  distance 
in  the  Au(lOO)  (1  X  1)  surface.  [15]. 


Structural  and  Electronic  Analysis 

The  point  contacts  of  gold  clusters  are  of 
paramount  importance  for  single-molecule  elec¬ 
tronic  studies.  To  our  knowledge,  there  are  no  ab 
initio  studies  dealing  with  the  use  of  clusters  as 
the  contacts  for  single  molecules.  Most  previous 
studies  dealt  with  surfaces.  Shown  in  Figure  1  are 
the  electronic  levels  for  the  gold  atom  in  its  ground 
state.  The  ground  state  of  gold  is  used  as  the 
reference  to  compare  with  the  energy  of  its  posi¬ 
tive  and  negative  ions.  The  negative  of  the  KS 
HOMO  and  LUMO  of  the  ground  state  of  the  gold 
atom  are  also  shown.  The  KS  orbitals  correspond 
to  a  system  of  fictitious  noninteracting  electrons 
with  a  density  identical  to  the  real  system.  There¬ 
fore,  the  negative  of  the  HOMO  energy  corre¬ 
sponds  to  the  energy  increase  of  the  system  when 
this  fictitious  orbital  stops  being  part  of  the  nonin¬ 
teracting  system  due  to  the  hypothetically  instan¬ 
taneous  ejection  of  one  electron  in  the  real  system. 
The  loss  of  one  electron  implies  an  electron  relax¬ 
ation  in  the  real  and  noninteractive  systems  that 
has,  as  a  consequence,  an  increase  in  the  total 
energy  of  the  positive  ion.  This  excess  (A)  is  domi¬ 
nated  by  the  electron- electron  repulsion,  which 
grows  more  than  the  kinetic  and  nuclear-electron 
energies  combined.  Similarly,  the  insertion  of  an 
electron  in  the  real  system  causes  the  LUMO  of  the 
noninteractive  system  of  the  neutral  atom  to  be 
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TABLE  III _ 

Results  for  the  gold  trimer.^ 


System  Energy,  Hg  (eV) 


Geometry  (A  and  °) 


AUg^SjC^  -406.55521  (0.00)  2.642  2.642  3.007 

69.4  55.3  55.3 


AUg^AiDg^  -406.47020  (2.31)  2.966 


Au3®6iC2v  -406.28273  (7.41)  2.695  2.695  2.972 

66.9  56.5  56.5 


AUg-^AiDgft  -406.64609  (-2.47)  2.787 


AUg-  -406.69050  (-3.68)  2.623 


Au^^/4,D3h  -406.29524  (7.07)  2.674 


Au^^AiCav  -406.19806(9.72)  3.090  3.090  2.631 

50.4  64.8  64.8 


®When  electronic  state  and  symmetry  of  the  optimized  system  do  not  correspond  with  the  initial  state  the  latter  is  indicated. 
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TABLE  IV _ 

Results  for  the  gold  tetramer.^ 


System 

Energy, 

Energy  (eV) 

Geometry  (A  and  °) 

Structure 

-542.11914 

0.00 

1-3,  2-3  =  2.761 

1-2  =  2.608 

3-4  =  2.560 

AUi 

\ 

AUo — AU4 

/ 

AU2 

D4ft  AU4 

-542.08941 

0.81 

2.684^ 

Au — Au 

I  I 

Au — Au 

Au, 

Si  D2CI  Au^ 


-542.00314 


3.16 


1-2,  3-4  =  2.979 
1-3,1-4  2-3  2-4  =  2.733 


AU2 - \--- AUg 


Au. 


Doublet  AU4  -542.17039 


Doublet  Au|  -541.81813 


Quartet  AU4  -542.17848 


-1.39 


8.19"^ 


-1.61 


1-3  =  2.827 

1- 4  =  2.828 

2- 4  =  2.885 


1- 3  =  2.955 

2- 3  =  2.922 
1-2  =  2.626 


1- 3  =  2.848 

2- 3  =  2.851 
1-2  =  2.852 


3-4  2.282 

1- 2  =  2.877 

2- 3  =  2.893 


1- 4  =  2.943 

2- 4  =  2.912 

3- 4  =  2.626 


1- 4  =  2.851 

2- 4  =  2.852 

3- 4  =  2.848 


Quartet  AU4  No  convergence 

^All  initial  geometries  were  almost  perfect  tetrahedrons. 

similar  geometrical  and  isoenergetic  conformation  with  a  puckered  angle  of  12.9  was  also  found. 

^A  lower  energy  structure  with  C2V  planar  symmetry  (rombic)  was  found  with  energy  of  -541.8205  a.u. 


occupied  and,  therefore,  instantaneously  reducing 
the  energy  by  an  amount  equivalent  to  the  nega¬ 
tive  of  the  LUMO  energy.  Since  the  electrons  be¬ 
come  denser  due  to  the  presence  of  the  excess 
electron,  the  relaxation  of  the  electrons  will  tend  to 
spread  out  the  electronic  charge.  As  in  the  case  of 


the  positive  ion,  an  increase  in  the  total  energy 
results.  As  expected  from  a  physical  interpretation 
of  the  KS  orbitals,  the  subtraction  of  the  HOMO 
energy  from  the  total  energy  of  the  neutral  system 
will  yield  an  underestimation  of  the  IP.  Further¬ 
more,  the  addition  of  the  LUMO  energy  to  the 
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neutral  atom  will  yield  an  underestimation  of  the 
energy  of  the  negative  ion.  Relaxation  effects  will 
produce  an  increase  of  the  energies  in  both  cases. 

Therefore,  DFT  presents  a  better  qualitative  and 
quantitative  picture  than  HF  theory  where,  in  most 
cases,  the  LUMO  energy  is  always  positive  [42] 
and  thus  contrary  to  a  real  physical  situation.  Since 
the  virtual  KS  orbitals  describe  electrons  moving 
in  the  field  of  N  -  1  electrons,  they  therefore  lack 
the  artificial  upward  shift  of  the  Flartree-Fock 
virtual  orbitals  [43].  In  addition,  using  an  exact 
functional,  the  HOMO  energy  is  the  exact  ioniza¬ 
tion  energy  since  this  energy  depends  only  upon 
the  asymptotic  decay  of  the  density  [44].  For  fur¬ 
ther  details  on  the  Kohn-Sham  orbitals  see,  for 
instance.  Refs.  [21,  23]  and  the  works  cited  within. 
It  is  important  to  mention  that  relaxation  energy 
defined  here  has  contributions  from  the  potential 
and  kinetic  energies.  The  potential  energy  arises 
from  the  nuclear-electron  and  electron-electron 
interactions  and  are  incorporated  into  A,  which  is 
formed  by  the  contributions  of  the  T,  and 
energies.  Therefore,  when  an  electron  leaves  the 
neutral  system,  at  that  instant,  the  1/^  and  T  lower 
the  total  energy  and  the  increases  the  total 
energy.  Then,  in  the  relaxation  process,  the  has 
more  effect  than  the  +  T  energy.  Conversely, 
when  an  electron  enters  into  the  neutral  system  to 
form  the  negative  ion,  at  that  instant  the  +  T 
energy  increases,  therefore  increasing  the  total  en¬ 
ergy.  Accordingly,  the  increases  in  its  absolute 
value,  thus  lowering  the  total  energy.  In  this  situa¬ 
tion,  the  Vge  +  T  term  has  a  dominant  effect.  These 
ideas  are  in  disagreement  with  another  study  us¬ 
ing  the  local  density  approximation  (LDA),  which 
claims  that  the  so-called  relaxation  and  correlation 
effects  cancel,  thus  Koopman's  theorem  can  be 
used  for  the  outer  orbitals  [45]. 

Table  V  summarizes  the  energetic  results  for  the 
four  gold  systems.  The  HOMO  and  LUMO  ener¬ 
gies  follow  the  trend  and  qualitative  behavior  of 
the  monomer.  For  the  dimer  and  trimer,  the  effects 
of  geometry  relaxation  have  to  be  considered,  and 
basically  the  IP  and  EA  tend  to  be  in  the  same 
range  of  values  as  the  cluster  increases  in  size.  This 
has  strong  implications  in  the  estimation  of  Fermi 
levels  for  gold  obtained  from  small  clusters  calcu¬ 
lations.  We  also  noticed  that  a  tetrahedral  geome¬ 
try  is  not  allowed  with  low  spin  and  neutral  charge. 
In  addition,  results  in  Table  V  yield  a  perfect 
qualitative  interpretation  of  the  Kohn-Sham  HO¬ 
MO  and  LUMO,  allowing  us  to  express  the  energy 
of  the  cation  and  anion  with  respect  to  the  energy 


TABLE  V _ 

Ionization  potentials  (IP),  electron  affinities  (EA), 
Kohn  -  Sham  (KS)  HOMO  and  LUMO  energies  of 
Au,  Au2,  and  AUg  in  eV.® 


Aui 

AUg 

Aus 

IP 

9.34 

9.36 

7.07 

6.77 

7.19 

5.40 

"^HOMO 

2.57 

2.17 

1.57 

EA 

2.10 

1.99 

2.47 

“^LUMO 

4.60 

3.79 

3.81 

^LUMO 

2.50 

1.80 

1.34 

^  We  have  not  used  the  AU4  because  no  ground-state  sys¬ 
tem  with  tetrahedral  symmetry  was  found. 


of  the  neutral,  which  can  be  written  in  KS-DFT  as 
(see  for  instance  Ref.  [46]) 

E  =  T,  +  Ke  +  r<^AlKel'I'A>  dX, 

•'l 

and  using  the  KS  orbital  energies,  as 

E=ie^  + 

1  •'0 

Therefore,  the  energy  of  the  cation  is 

=  I  H  “  ^HOMO  “F  ^TOMoj 

+  {/F^AlKel^A>‘iA- A“o}. 

and  the  energy  of  the  anion 

^  I  +  ^LUMO  “  ^UJMoj 

+  |/\,AAlVeel’I'A>rfA  + 

Thus 

_  Anon  int  _  Aadiaba 

HOMO  “  -^HOMO  ^HOMO 

and 

_  _  Anon  int  i  Aadiaba 

LUMO  “  ^LUMO  ^LUMO* 

Since  A^omo  A^umo  always  positive,  the 
noninteracting  changes  predominate  over  the  adia¬ 
batic  changes  when  the  system  looses  one  electron, 
and  the  opposite  happens  when  an  electron  is 
gained  by  the  system. 
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An  analysis  of  the  HOMO  and  LUMO  orbitals 
also  helps  in  explaining  the  geometrical  structure 
of  the  gold  tetramers.  The  dimer  and  trimer  struc¬ 
tures  are  straightforwardly  understood.  Figure  2 
shows  the  HOMO  and  LUMO  orbitals  for  the 
singlet,  triplet,  and  quintet  lowest  states  of  AU4. 
We  see  that  in  the  planar  structure  of  the  singlet, 
the  HOMO  is  localized  on  the  gold  atom  outside 
of  the  triangle  and  the  LUMO  is  localized  on  the 
base  of  the  triangle.  Therefore,  electron  conduction 
will  be  preferred  when  one  electron  enters  from 
the  base  and  one  electron  ejects  from  the  tip.  The 
reverse  would  be  less  preferred  and  probably  only 


possible  through  a  tunnel  effect  as  in  the  STM. 
This  logical  way  to  interpret  conductivity  in 
molecules  needs  to  be  analyzed  further  since  it 
would  be  instrumental  for  the  design  of  molecular 
circuits.  The  HOMO  and  LUMO  corresponding  to 
the  ground  state  of  the  planar  tetramer  are  also 
show  in  Figure  2.  In  addition  we  have  plotted  the 
HOMO  and  LUMO  of  the  nonoptimized  tetramer 
with  a  symmetry.  Clearly,  the  HOMO  of  the 
nonoptimized  tetramer  indicates  that  this  corre¬ 
sponds  to  an  antibonding  orbital  and  therefore  no 
stable  structure  is  found.  The  point  group 

used  here  can  end  up  on  a  or  a  structure  on 


C2V  AU4  singlet 


HOMO 


LUMO 


D4h  AU4  triplet 


FIGURE  2.  HOMO  and  LUMO  orbitals  for  the  optinnized  lowest  singlet,  triplet,  and  quintet  states  of  AU4  as  well  as  for 
the  nonoptinnized  triplet  of  synnnnetry. 
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Tj  AU4  triplet 

HOMO 


Djd  AU4  Quintet 

HOMO 


LUMO 


LUMO 


FIGURE  2.  Continued 


the  other  end.  The  preferred  symmetry  is  the  D4;,. 
The  HOMO  and  LUMO  of  the  lowest  quintet  state 
is  also  shown  in  the  figure.  Although  this  structure 
has  reached  a  local  extremum,  it  cannot  be  real¬ 
istically  considered  because  it  corresponds  to  an 
artifact  of  the  procedure;  since  several  of  the  al¬ 
pha-occupied  orbitals  are  above  several  of  the 
beta-unoccupied  orbitals.  The  orbital  analysis  al¬ 
lows  us  to  conclude  that  there  is  no  possibility  of 
having  a  tetrahedral  geometry  for  the  gold  te- 
tramer  due  to  the  instability  associated  with  the 
asymetric  electronic  occupation  of  degenerate  or¬ 
bitals  in  the  structure  of  AU4  because  there  is  a 
distortion  pathway  that  will  lower  the  electronic 
energy  of  the  molecule  yielding  a  removal  of  the 
orbital  degeneracy.  In  this  case,  the  distortion  can 
reduce  the  symmetry  to  €2^,  D4;,,  or  02^  point 
groups.  Although  strictly  associated  initially  with 
the  instability  of  a  degenerate  electronic  state  [47], 
the  above  distortion  is  also  known  as  a  Jahn-Teller 
effect  [48].  All  orbitals  presented  in  Figure  2  corre¬ 
sponds  to  contours  of  +0.07/bohr^/^,  i.e.,  con¬ 


tours  of  probability  per  cubic  Bohr  or  in  classi¬ 
cal  terms,  contours  corresponding  to  a  electronic 
density  of  0.005  electrons  per  cubic  Bohr. 
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ABSTRACT:  The  results  of  ab  initio  post-Hartree-Fock  study  of  the  molecular 
structures,  relative  stabilities,  and  Hartree-Fock  level  calculations  of  IR  frequencies  and 
intensities  for  mono-  and  dihydrated  oxo-amino-tautomers  of  guanine  are  reported.  The 
geometries  of  the  local  minima  were  optimized  without  symmetry  restrictions  at  the 
second-order  Moller-Plesset  perturbation  theory.  The  standard  6-31  G(d)  basis  set  was 
used.  Local  minima  were  verified  by  calculations  of  the  matrix  of  energy  second 
derivatives  [Hessian  at  the  HF/6-31  G(d)  level].  The  single-point  calculations  were  also 
performed  at  the  MP4(SDQ)/6-31G(d)//MP2/6-31G(d)  and  MP2/6-31  +  -H 
G(d,  p)//MP2/6-31G(if)  levels  of  theory.  The  total  energies  were  corrected  for  the 
zero-point  energy  contributions  scaled  by  a  factor  of  0.9.  The  structural  parameters  of 
mono-  and  dihydrated  complexes  were  analyzed  and  related  to  the  characteristics  of 
hydrogen  bonds.  We  found  that  the  interaction  of  guanine  tautomers  with  two  water 
molecules  changes  the  order  of  the  gas-phase  stability:  7GUA  =  9GUA  into  the  order 
which  corresponds  to  the  stability  of  the  guanine  tautomers  in  the  polar  solvent: 
9GUA  »  7GUA.  The  predicted  IR  HF/6-31G(rf)  level  frequencies  and  intensities  are 
presented.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  759-765,  1997 
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Introduction 

The  structural  characteristics  of  the  isolated 
DNA  and  RNA  bases  and  the  prototypic 
molecules  have  been  the  subject  of  extensive  spec¬ 
troscopic  studies  in  recent  years.  Most  of  the  struc¬ 
tural  characteristics  have  been  obtained  using 
infrared  spectroscopy  and  high-resolution  mi¬ 
crowave  studies.  These  investigations  are  always 
more  relevant  if  they  are  performed  in  parallel 
with  ab  initio  quantum  chemical  calculations, 
which,  at  the  correlated  level  with  sufficiently  large 
basis  sets,  are  able  to  reproduce  such  structural 
characteristics  as  geometry  and  IR  frequencies  with 
the  same  accuracy  as  those  of  experiments  (see 
e.g,,  [1]).  Examples  of  such  spectroscopic  investiga¬ 
tions  coupled  with  ab  initio  quantum  chemical 
calculations  can  be  found  in  [2]. 

Currently,  the  focus  of  these  combined  investi¬ 
gations  is  strongly  shifted  toward  the  study  of 
isolated  intermolecular  complexes  with  water 
molecules.  It  is  so  natural  because  the  hydration  of 
the  DNA  and  RNA  bases  plays  a  special  role  in 
determining  the  three-dimensional  structure  of 
these  kinds  of  biopolymers.  The  available  litera¬ 
ture  on  this  subject  consists  of  investigations  of 
only  the  pyrimidine  type  of  DNA  bases  and  their 
prototypic  molecules  [3],  To  fill  the  gap  formed  by 
a  lack  of  investigations  of  hydrated  puring  bases, 
we  initiated  ab  initio  calculations  of  the  two  tau¬ 
tomers  of  guanine:  the  oxo-amino  form  of  the 
N(9)H  tautomer  (9GUA)  and  the  oxo-amino  form 
of  the  N(7)H  tautomer  (7GUA),  which  are  the 
lowest  minima  on  the  potential  surface  of  isolated 
guanine  tautomers  [4]. 

Before  discussing  the  objective  of  this  article, 
justification  for  considering  the  mono-  and  dihy¬ 
dra  ted  species  of  guanine  tautomers  is  in  order. 
We  are  interested  only  in  the  cyclic  forms  of  these 
complexes  because  of  the  existence  of  both  experi¬ 
mental  and  theoretical  data  which  support  the  idea 
that  similar  structures  are  the  most  important  min¬ 
ima  on  the  potential  surface  of  the  water-DNA 
base  complexes: 

(i)  According  to  the  X-ray  data  of  guanidine 
monohydrate  [5],  the  water  molecule  is 
located  in  the  vicinity  of  the  carbonyl 
oxygen; 

(ii)  according  to  the  calculations  of  formamide 
and  acetamide-water  complexes  [6, 7],  the 


>  C=0  •  •  H2O  •  •  HN  <  bifurcated  struc¬ 
ture  is  really  the  global  minimum;  and 

(iii)  recently,  it  was  concluded  from  the  studies 
of  rotationally  resolved  fluorescence  excita¬ 
tion  spectrum  [3a,  b]  that  the  mono-  and 
dihydrated  complexes  of  pyridone  have  the 
same  geometrical  structure  as  that  as¬ 
sumed  for  the  7(9)GUA  •  H2O  and  the 
7(9)GUA  •2H2O  species  in  this  study. 

The  purpose  of  this  article  was  to  provide  accu¬ 
rate  theoretical  data  for  the  studied  species.  The 
following  properties  of  the  mono-  and  dihydrated 
guanine  species  are  predicted:  (1)  the  geometry 
and  rotational  constants  of  the  mono-  and  dihy¬ 
drated  7(9)GUA  species;  (2)  the  stability  of  these 
complexes  and  a  comparison  with  the  stability  of 
isolated  species;  and  (3)  IR  frequencies  of  mono- 
and  dihydrated  7(9)GUA  species  and  a  compari¬ 
son  with  the  spectra  of  isolated  components. 


Computational  Methods 

The  ab  initio  LCAO-MO  method  [8]  was  used  to 
study  the  interaction  of  guanine  tautomers  with 
water  molecules.  The  calculations  were  carried  out 
with  the  Gaussian-92  program  [9].  The  standard 
6-31G(rf)  basis  set  was  used.  All  the  geometries 
were  optimized  without  symmetry  restrictions  (C^ 
symmetry  was  assumed)  by  the  gradient  proce¬ 
dure  [10]  initially  at  the  HE  level  and  subsequently 
at  the  second-order  Moller-Plesset  perturbation 
theory  [11].  Local  minima  were  verified  by  estab¬ 
lishing  that  the  matrix  of  energy  second  deriva¬ 
tives  (Hessian)  [at  the  HF/6-31G(d)  level]  has  only 
positive  eigenvalues.  The  single-point  calculations 
were  performed  at  the  MP4(SDQ)/6- 
31G(rf)//MP2/6-31G(d)  and  MP2/6-31  +  + 
G(d,  p) / /MP2 /6-31G(rf)  levels  of  theory.  The  total 
energies  were  corrected  for  the  HF/6-31G(d)  level 
with  a  zero-point  energy  contribution  scaled  by  a 
factor  of  0.9. 


Results 

The  calculated  geometrical  parameters  for 
mono-  and  dihydrated  complexes  of  7(9)GUA  are 
presented  in  Figure  1.  Tables  I  and  II  present  the 
computed  rotational  constants  along  with  selected 
intermolecular  parameters  for  7(9)GUA  •  H2O  and 
7(9)GUA  •  2H2O.  The  data  corresponding  to  the 
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A 

FIGURE  1 .  Structural  parameters  of  mono-  and  dihydrated  oxo-amino-guanine  complexes.  (A)  Up  indexes: 
monohydrated  oxO'-amlno-N(9)H  guanine;  down  indexes:  monohydrated  oxo-amino-N(7)H  guanine.  (B)  Up  indexes: 
dihydrated  oxo-amino-N(9)H  guanine;  down  indexes:  dihydrated  oxo-aminO“N(7)H  guanine. 


relative  stability  of  isolated  and  hydrated  guanine 
species  together  with  computed  dipole  moments 
are  collected  in  Table  III.  The  computed  IR  fre¬ 
quencies  are  given  in  Tables  IV  and  V. 


Discussion 

GEOMETRY  OF  7(9)GEA  •  H2O  AND 
7(9)GEA  •  2  H2O  SPECIES 

An  analysis  of  the  structural  parameters  in  hy¬ 
drogen-bonded  systems  A — where  A  and 

TABLE  I _ 

MP2/6-31G(d)  rotational  constant  (A,  B,  C,GHz) 

and  selected  intermolecular  parameters  (A)  for  the 
monohydrated  7(9)-guanine  complexes. 


QGUA-HgO 

7GUA- H2O 

A 

1 .47278 

1 .46761 

B 

0.74504 

0.74764 

C 

0.49581 

0.49647 

2.822 

2.811 

ROuOi7 

2.818 

2.820 

B  are  heavy  atoms  which  are  involved  in  hydrogen 
bonding,  is  crucial  for  an  understanding  of  the 
reasons  for  proton  transfer  in  DNA  bases.  Accord¬ 
ing  to  the  classification  of  nearly  linear  hydrogen 
bonds  [12]  (AHB  more  than  IbS'"),  the  very  low 
values  of  the  proton-transfer  barriers  take  place 
when  the  A — B  distance  is  less  than  2.5  A  (A  and 
B  are  oxygens  and/or  nitrogens  atoms).  In  this 
case,  the  two  heavy  atoms  are  so  close  to  each 

TABLE  II _ 

MP2/6-31G(c/)  rotational  constant  (A,  B,  C,  GHz) 

and  selected  intermolecular  parameters  (A)  for  the 
dihydrated  7(9)-guanine  complexes. 


9GUA*H20 

7GUA-H20 

A 

1.23260 

1 .22306 

B 

0.524809 

0.52490 

C 

0.37245 

0.37073 

RN10O17 

2.868 

2.845 

RO17O20 

2.719 

2.752 

RO-14O20 

2.794 

2.804 

The  O  —  O  distance  In  (H20)2  is  2.915  A  at  the  MP2/ 
6-31G(c/)  level  of  theory. 
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TABLE  III 


Relative  stability  (kJ  mol 
species. 

and  dipole  moments  (D)  of  guanine  tautomers  and  their 

mono-  and  dihydrated 

MP4(SDQ)/ 

MP2/ 

MP2/ 

6-31  G(c/)// 

6-31-l--l-G{cy)// 

G-31G(d)// 

MP2/ 

MP2/ 

MP2/ 

Species 

6-31  G(c/) 

6-31  G(d) 

6-31  G(d) 

9GUA 

0.0 

0.0 

0.0 

7.0 

7GUA 

1.6 

1.8 

-0.2 

1.6 

QGUA-HsO 

0.0 

0.0 

0.0 

5.7 

7GUA • H2O 

-4.1 

0.5 

-2.3 

1.6 

9GUA'2H20 

0.0 

0.0 

0.0 

4.9 

7GUA-2H20 

6.8 

2.7 

4.8 

2.8 

® Evaluated  from  MP2-optimized  geometry  using  HF  density. 


other  that  the  zero-point  energy  level  could  appear 
even  under  the  barrier.  In  the  case  of  the  A — B 
distance  between  2.8  and  3.0  A,  the  value  of  the 
proton-transfer  barrier  is  much  higher.  This  obser¬ 
vation  is  the  reference  point  for  our  analysis  of  the 
structural  data  (Fig.  1  and  Tables  I  and  II)  of 
mono-  and  dihydrated  tautomers  of  9(7)  guanine. 

According  to  the  data  of  previous  theoretical 
studies  and  experimental  observations  [3a,  b],  the 
influence  of  the  interaction  with  one  and/or  two 
water  molecules  on  the  bond  distances  and  bond 
angles  of  prototypic  molecules  manifests  itself 
mainly  in  the  region  of  intermolecular  hydrogen 
bonding.  Our  data  (see  Fig.  1  and  Tables  I  and  II) 
completely  confirm  this  point.  All  other  structural 
parameters  of  the  heterocyclic  rings  are  vitually 
the  same  [except  for  the  "'single"  bond  of 

9(7)GUA  tautomers  which  is  also  under  the  influ¬ 
ence  of  hydrogen  bonding]  and  depend  only  on 
the  chemical  structure  of  tautomers. 

The  addition  of  the  second  water  molecule 
makes  a  noticeable  change  in  the  geometrical 
structure  of  the  hydrogen-bonded  complexes.  All 
of  them  become  much  more  linear  compared  to 
the  monohydrated  complexes;  the  distance  be¬ 
tween  Oi4  and  O20  of  dihydrated  complexes  is 
much  shorter  than  the  corresponding  distance 
Oi4 — Oi7  of  the  monohydrated  complexes.  Also, 
the  distance  between  O^j  and  O20  is  much  shorter 
than  the  interatomic  distance  in  a  free  water  dimer 
(see  data  in  Tables  I  and  II).  All  the  observed 
trends  for  the  9(7)GUA  •  H2O  and  9(7)GUA  •  2H2O 
complexes  are  congruous  with  the  available  exper¬ 
imental  and  calculated  [3a,  b]  data  of  the  mono- 
and  dihydrated  complexes  of  pyridone.  Thus, 


based  on  geometrical  characteristics,  one  might 
expect  that  the  participation  of  two  water 
molecules  in  a  proton-transfer  reaction  between 
the  and  0^4  atoms  of  the  guanine  molecule 
could  be  more  favorable  than  is  the  participation 
of  only  one  water  molecule. 

RELATIVE  STABILITIES 

The  relative  stabilities  of  the  isolated  and  sol¬ 
vated  guanine  tautomers  have  been  predicted  by 
different  levels  of  theory  up  to  an  MP4(SDQ)/6- 
311  -F  -F  G(d,  p)//MP2/6-31G(d)  approximation 
[13].  In  Table  III,  the  calculated  stabilities  of  iso¬ 
lated  tautomers  are  in  complete  agreement  with 
the  data  in  [13]. 

Our  present  discussion  will  concentrate  on  an 
evaluation  of  the  relative  stability  of  the  hydrated 
complexes.  For  the  isolated  tautomers  of  guanine, 
the  following  stability  pattern  is  established: 

7GUA  ^  9GUA.  (1) 

This  relationship  (1)  corresponds  well  with  the 
available  experimental  data  for  the  species  in  the 
gas  phase  [4].  According  to  the  data  of  experimen¬ 
tal  IR  studies,  7GUA  is  probably  the  most  stable 
tautomer.  The  tendency  for  the  7GUA  form  of 
guanine  monohydrates  to  stabilize  exists.  Based  on 
the  data  presented  in  Table  III,  the  same  stability 
pattern  in  obtained  for  complexes  of  guanine  with 
one  water  molecule: 

7GUA  •  H2O  ^  9GUA  •  H2O.  (2) 

The  described  peculiarity  of  monohydrated  com¬ 
plexes  is  changed  for  the  dihydrated  isolated  com- 
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TABLE  IV _ ^ 

Calculated  IR  frequences  (FREQ,  in  cm  ~ 
and  absolute  intensities  (INT,  km  mol  ”^)  for 
9-guanine  species. 


9-Guanine 
FREQ  INT 

9GUA- 

FREQ 

H2O 

INT 

9GUA' 

FREQ 

2H2O 

INT 

152 

6 

50 

6 

34 

4 

180 

2 

110 

18 

57 

1 

218 

10 

153 

2 

59 

4 

339 

17 

167 

1 

147 

18 

355 

93 

186 

6 

157 

4 

366 

7 

218 

25 

164 

1 

392 

1 

247 

90 

192 

4 

522 

7 

351 

6 

199 

100 

573 

43 

370 

38 

220 

12 

576 

109 

381 

63 

237 

4 

608 

293 

387 

73 

315 

129 

654 

109 

398 

59 

356 

9 

679 

8 

526 

57 

367 

11 

721 

10 

562 

256 

385 

1 

729 

30 

576 

196 

391 

7 

790 

60 

581 

57 

430 

27 

823 

13 

676 

256 

496 

349 

871 

90 

690 

196 

533 

9 

908 

15 

719 

2 

550 

91 

969 

4 

735 

31 

575 

168 

1033 

11 

771 

2 

584 

6 

1147 

13 

815 

48 

680 

176 

1162 

12 

865 

6 

700 

142 

1192 

22 

894 

202 

719 

6 

1251 

21 

909 

40 

736 

54 

1282 

59 

970 

5 

777 

9 

1425 

5 

1032 

13 

819 

22 

1457 

54 

1158 

15 

830 

353 

1478 

62 

1162 

7 

870 

63 

1535 

22 

1206 

10 

905 

26 

1577 

8 

1262 

10 

922 

180 

1687 

26 

1288 

70 

969 

5 

1736 

112 

1428 

11 

1032 

15 

1781 

464 

1466 

11 

1158 

17 

1792 

111 

1513 

189 

1162 

6 

1842 

536 

1536 

16 

1219 

7 

1999 

901 

1578 

9 

1266 

5 

3452 

4 

1702 

45 

1295 

71 

3814 

69 

1743 

76 

1430 

17 

3849 

76 

1787 

305 

1465 

11 

3909 

112 

1797 

148 

1505 

151 

3927 

56 

1840 

638 

1536 

26 

1854 

111 

1572 

16 

1959 

1001 

1704 

44 

3451 

5 

1745 

59 

3732 

305 

1786 

304 

3820 

90 

1804 

144 

3908 

114 

1838 

796 

3937 

61 

1856 

89 

TABLE  IV  . 
(Continued) 


9-Guanlne 
FREQ  INT 

9GUA- 

FREQ 

H2O 

INT 

9GUA' 

FREQ 

2H2Q 

INT 

3968 

217 

1866 

99 

4144 

125 

1938 

1073 

3450 

5 

3761 

278 

3811 

168 

3887 

379 

3909 

125 

3930 

637 

3944 

71 

4132 

110 

4145 

135 

plexes  which  display  the  quite  usual  tendency  to 
approach  the  stability  of  fully  hydrated  species: 

9GUA  •  2H2O  >  7GUA  •  2H2O. 


IR  FREQUENCIES 

The  present  IR  frequencies  are  intended  to  guide 
experimental  investigators  in  the  study  of  the  title 
species.  As  can  be  expected  from  the  calculated 
data,  it  is  clear  that  the  major  differences  between 
the  spectra  of  isolated  species  and  complexes  are 
the  intermolecular  vibration  modes  (50-300  cm"^) 
and  the  valence  sym  and  antisym  OH  and  NH 
vibrations  (3400-4100  cm"^).  The  predicted  shifts 
of  vibrational  frequencies  upon  the  complexation 
can  be  applied  to  an  identification  of  these  species 
in  the  isolated-matrix  experiments.  Although  the 
HF/6-31G(d)  level  harmonic  frequencies  overesti¬ 
mate  the  experimentally  observed  unharmonic 
spectrum  by  approximately  10%,  the  differences 
between  predicted  and  observed  shifts  are  usually 
not  larger  than  10-20  cm“^ 


Conclusion 

In  this  article,  the  stability,  the  harmonic  IR 
frequencies,  and  the  structural  parameters  at  the 
post-Hartree-Fock  level  are  revealed.  The  princi¬ 
pal  conclusions  from  this  study  are  the  following: 

1.  The  structural  parameters  of  mono-  and  di- 
hydrated  oxo-amino-guanine  tautomers  cal¬ 
culated  at  the  MP2/6-31G*  level  permits  the 
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TABLE  V _ 

Calculated  IR  frequences  (FREQ,  in  cm  “^)  and 
absolute  intensities  (INT,  km  moP^)  for 
7-guanine  species. 


7-Guanine 
FREQ  INT 

7GUA- 

FREQ 

H2O 

INT 

7GUA 

FREQ 

2H2O 

INT 

160 

22 

49 

2 

33 

1 

169 

6 

111 

11 

57 

2 

213 

3 

158 

20 

59 

1 

324 

1 

174 

1 

147 

10 

344 

110 

192 

5 

160 

7 

366 

1 

251 

59 

167 

11 

406 

1 

287 

74 

180 

4 

509 

133 

333 

41 

202 

114 

523 

3 

352 

24 

219 

7 

586 

5 

387 

31 

235 

3 

644 

84 

394 

53 

317 

134 

669 

205 

422 

118 

345 

33 

685 

56 

451 

71 

365 

4 

702 

44 

544 

16 

385 

4 

733 

21 

583 

276 

399 

1 

814 

51 

608 

30 

433 

43 

824 

49 

665 

172 

484 

138 

869 

100 

680 

1 

508 

74 

907 

14 

694 

150 

526 

104 

1027 

4 

746 

25 

553 

294 

1041 

1 

778 

1 

594 

13 

1135 

19 

832 

21 

678 

197 

1203 

5 

848 

161 

691 

2 

1223 

40 

896 

87 

699 

104 

1259 

14 

908 

43 

741 

41 

1306 

85 

1027 

6 

798 

9 

1408 

25 

1050 

3 

811 

10 

1452 

19 

1143 

41 

836 

346 

1482 

83 

1194 

57 

869 

63 

1547 

233 

1229 

17 

905 

14 

1607 

64 

1259 

28 

925 

187 

1650 

3 

1336 

78 

1033 

3 

1710 

1 

1369 

65 

1038 

1 

1755 

304 

1461 

113 

1151 

20 

1804 

72 

1504 

186 

1211 

21 

1854 

602 

1551 

208 

1234 

16 

1970 

1003 

1591 

251 

1265 

13 

3450 

6 

1619 

84 

1319 

67 

3799 

56 

1673 

169 

1410 

45 

3851 

79 

1702 

60 

1469 

5 

3908 

49 

1787 

618 

1502 

192 

3913 

142 

1812 

128 

1547 

191 

1832 

156 

1608 

74 

1871 

508 

1661 

66 

3443 

7 

1720 

29 

3825 

51 

1756 

166 

3835 

608 

1811 

46 

3922 

135 

1849 

843 

3944 

46 

1855 

129 

TABLE  V  _ 
(Continued) 


7-Guanlne 
FREQ  INT 

7GUA- 

FREQ 

H2O 

INT 

7GUA' 

FREQ 

2H2Q 

INT 

3987 

311 

1864 

67 

4144 

156 

1917 

1132 

3447 

7 

3734 

381 

3811 

116 

3893 

348 

3917 

131 

3934 

711 

3937 

10 

4134 

111 

4146 

136 

classification  of  the  intermolecular  hydrogen 
bonds  as  rather  weak. 

2.  It  is  found  that  the  interaction  of  guanine 
tautomers  with  two  water  molecules  changes 
the  order  of  gas-phase  stability  (7GUA  9GUA) 
into  the  order  which  corresponds  to  the  sta¬ 
bility  of  the  guanine  tautomers  in  a  polar 
solvent  (9GUA  »  7GUA). 

3.  The  main  difference  in  the  calculated  IR  fre¬ 
quences  of  isolated,  monohydrated,  and  di- 
hydrated  species  are  found  in  the  region  of 
valence  sym  and  antisym  vibrations  of  OH 
and  NH  bonds  and  in  the  region  related  to 
the  intermolecular  vibrations, 
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ABSTRACT:  Molecular  orbital  calculations  were  carried  out  for  the  dimerization  step 
in  the  polymerization  process  of  vinyl  acetate  and  styrene  through  free  radicals  and  ionic 
mechanisms.  The  calculations  were  performed  for  monomers,  dimers,  their  positive  and 
negative  ions,  and  free  radicals.  The  minimum-energy  geometry  is  achieved  in  all  cases, 
the  geometrical  and  electronic  parameters  are  analyzed,  a  dimerization  reaction  pathway 
is  proposed,  and  the  heats  of  polymerization  obtained  are  in  excellent  agreement  with 
experimental  data.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  767-785,  1997 


Introduction 

In  polymers,  the  sequence  distribution,  tacticity, 
and,  in  general,  the  intramolecular  structure 
are  important,  because  they  may  suply  informa¬ 
tion  about  the  monomer  addition  process,  e.g., 
about  the  preference  of  monomers  to  add  in  the 
iso-  or  syndiotactic  configuration  [1,2].  Moreover, 
knowledge  about  the  intramolecular  structure  is  of 
paramount  importance  for  the  understanding  of 
the  relations  between  structure  and  polymer  prop¬ 
erties  [3]  and  there  is  significant  interest  in  charac¬ 
terizing  polymeric  materials,  especially  blends  at 
the  molecular  level.  This  interest  is  due,  in  part,  to 
the  tremendous  technological  importance  of  poly- 

International  Journal  of  Quantum  Chemistry,  Vol.  65,  767-785  (1997) 
©  1997  John  Wiley  &  Sons,  Inc. 


mer  blends.  The  bulk,  macroscopic  properties  of 
such  materials  are  determined  by  the  microscopic 
structure  that,  in  turn,  is  critically  dependent  upon 
the  degree  of  molecular  mixing  between  the  blend 
components.  The  peculiar  properties  of  polymer 
materials  arise  from  differences  between  the  ther¬ 
modynamic  interactions  in  systems  containing 
macromolecules  and  those  systems  of  only  small 
molecules. 

The  nature  of  intermolecular  interaction  deter¬ 
mines  the  microscopic  structure  in  any 
condensed-phase  system.  However,  when  the  sys¬ 
tem  contains  macromolecules,  there  is  additional 
complexity  due  the  intramolecular  considerations; 
the  number  of  configurations  available  to  a  poly¬ 
mer  chain,  although  restricted  by  covalent  bond 
geometries,  is  vast.  Thus,  the  morphology  of  a 

CCC  0020-7608  /  97  /  050767-1 9 
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chain  involves  the  interplay  between  the  interac¬ 
tions  of  the  polymer  with  its  environment  and  the 
possible  chain  conformations.  Since  the  interac¬ 
tions  between  the  chains  of  a  polymer  may  play  an 
important  role  in  defining  the  ultimate  properties 
of  these  polymers,  it  is  desirable  to  investigate 
quantitatively  the  structure  of  isolated  polymer 
chains.  The  geometry  of  the  dimers  may  be  impor¬ 
tant  in  defining  the  interactions  between  neighbor¬ 
ing  chains  of  the  polymer  and  in  the  monomer 
addition  mechanism.  To  understand  better  the  ge¬ 
ometry  of  these  dimers,  model  compounds  of 
poly(vinyl  acetate)  and  polystyrene  were  investi¬ 
gated. 

In  this  work,  we  used  vinyl  acetate  (VA)  and 
styrene  (ST)  dimer  bond  dissociation  energies  to 
model  the  reverse  reaction  of  a  chain-growth  poly¬ 
merization,  predicting  the  heat  of  polymerization. 
Also,  we  investigated  the  evolution  of  the  confor¬ 
mations  and  the  binding  process  when  a  monomer 
approaches  the  active  species,  an  ion  or  a  free 
radical.  In  other  words,  we  explored  a  reaction 
coordinate. 


We  present  results  of  semiempirical  and  ab  ini¬ 
tio  molecular  orbital  calculations  for  2,4-diacetate 
butane  as  a  model  compound  of  poly(VA)  and 
2,4-diphenylbutane  as  model  substance  of  PS.  All 
calculations  were  performed  with  the  system  of 
programs  Gaussian94  [4].  Two  levels  of  theory 
were  used:  semiempirical  calculations  with  the 
Austin  model  1  Hamiltonian  (AMI)  [5]  and  self- 
consistent-field  calculations  using  unrestricted 
open-shell  Hartree-Fock  wave  functions  (UHF)  [6] 
with  two  different  Gaussian  orbital  basis  sets:  3- 
21G^^  [7]  and  6-31G*  [8]. 

Using  geometrical  parameters  obtained  with  ab 
initio  methods  for  small  systems,  the  geometry 
optimization  of  bigger  molecules  could  be  carried 
out  using  semiempirical  methods  like  AMI  with¬ 
out  unduly  high  computer  and  memory  require¬ 
ments.  The  AMI  method  embodies  the  approxima¬ 
tions  that  are  inherent  to  the  neglect  of  diatomic 
differential  overlap  (NDDO)  formalism  [9]  and 
overcome  the  major  weakness  of  the  modified  ne¬ 
glect  of  differential  overlap  (MNDO)  method  [10] 
on  which  it  is  based.  It  is  known  that  AMI  pro- 


FIGURE  1.  Dimer  of  VA;  structure  optimized  with  the  HF/6-31G*  basis  set.  The  numbering  is  used  throughout 
the  text. 
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duces  fairly  accurate  equilibrium  geometries,  or¬ 
bital  orderings,  and  partial  charges  [11]. 


Methodology 

The  minimum-energy  conformation  for  the 
monomers  of  VA  and  ST  were  obtained  in  a  previ¬ 
ous  work  [12].  Taking  as  a  basis  those  geometries, 
model  compounds  to  represent  dimers  were 
formed;  in  each  case,  the  initial  geometry  corre¬ 
sponds  to  an  isotactic  all-trans  conformation. 

With  those  geometries,  an  optimization  process 
was  initiated.  In  the  first  step,  full-optimization 
calculations  were  made  for  each  dimer  to  find  the 
global  minimum.  With  the  resulting  geometries, 
rotational  barriers  were  calculated,  taking  as  rota¬ 
tion  axes  the  skeletal  bonds  between  heavy  atoms. 
The  torsion  angles  were  changed  with  increments 
of  20°  in  each  particular  case.  For  each  point,  all 
the  non-hydrogen  geometrical  parameters  were 
varied  simultaneously. 

To  form  the  free  radical  of  each  dimer,  one 
hydrogen  atom  was  removed  from  the  optimized 
VA  dimer  and  ST  dimer  models,  and  by  assigning 
a  positive  or  negative  charge,  the  corresponding 
cations  or  anion  were  formed.  These  processes 
were  made  with  the  optimization  of  the  geometri¬ 
cal  parameters  for  the  atoms  nearest  to  the  site 
where  the  hydrogen  atom  was  removed.  In  the 
next  step,  the  bond  between  monomeric  units  was 
broken;  it  was  done  by  changing  the  C2 — C3  (see 
Fig.  1)  bond  distance  for  the  VA  dimeric  species 
and  the  C4 — (see  Fig.  6)  bond  distance  for  the 
ST  dimeric  species.  This  bond  enlargement  was 
made  in  steps  of  0.2  A  from  the  equilibrium  dis¬ 
tances  to  4.5  A.  In  this  step,  two  types  of  calcula¬ 
tion  were  made:  In  the  first,  only  the  geometrical 
parameters  of  the  atoms  nearest  to  the  bond-break¬ 
ing  site  were  optimized,  and  in  the  second  one,  all 
the  geometrical  parameters  were  optimized.  A 
schematic  view  of  the  process  is 

(a)  A— A  A— A*  -h  H’ 

(b)  A— A*^A  +  A* 

(c)  A— A+->A  +  A+  (1) 

(d)  A— A"^  A  +  A", 

where  A — A  represents  a  dimer  molecule  formed 
by  two  monomers  of  A,  A —  A*  is  the  free  radical 


of  the  dimer,  A — A"^  is  a  cationic  dimer,  and 
A — A“  is  an  anionic  dimer. 

The  minimum-energy  conformation  and  total 
energy  for  the  different  monomeric  (A,  A ,  A^, 
A”)  and  dimeric  (A — A,  A —  A’,  A — A"^,  A — A~) 
species  were  found  by  allowing  all  atoms  to  move. 
The  bond  dissociation  energy  can  be  calculated  in 
two  different  ways:  as  the  difference  between  the 
sum  of  the  calculated  total  energies  for  the  right- 
side  species  in  its  minimum-energy  conformation 
and  the  total  energy  of  the  left-side  compound  also 
in  its  minimum  energy  conformation,  and  from  the 
potential  energy  curves  for  the  dissociation  of  VA 
and  ST  dimeric  species  [Eqs.  (Ib)-(Id)],  taking  the 
difference  between  the  energy  of  the  optimized 
dimeric  species  (initial  point  in  the  curve)  and  the 
energy  value  where  the  curve  becomes  asymptotic. 


TABLE  I _ 

Dimer  of  VA  z-matrix  input  for  the  Gaussian-94 
program;  the  numbers  on  the  atoms  correspond 
to  Figure  1. 

Dimer  of  VA 


H 


c 

1 

R2 

c 

2 

R3 

1 

A3 

H 

2 

R4 

1 

A4 

3 

D4 

C 

2 

R5 

1 

A5 

3 

D5 

H 

5 

R6 

2 

A6 

1 

D6 

0 

5 

R7 

2 

A7 

6 

D7 

H 

5 

R8 

2 

A8 

6 

D8 

H 

3 

R9 

2 

A9 

1 

D9 

C 

3 

RIO 

2 

A10 

9 

DIO 

0 

3 

R11 

2 

All 

9 

Dll 

C 

7 

R12 

5 

A12 

2 

D12 

H 

10 

R13 

3 

A13 

2 

D13 

H 

10 

R14 

3 

A14 

13 

D14 

H 

10 

R15 

3 

A15 

13 

D15 

C 

11 

R16 

3 

A16 

2 

D16 

0 

12 

R17 

7 

A17 

5 

D17 

C 

12 

R18 

7 

A18 

17 

D18 

0 

16 

R19 

11 

A19 

3 

D19 

C 

16 

R20 

11 

A20 

19 

D20 

H 

18 

R21 

12 

A21 

7 

D21 

H 

18 

R22 

12 

A22 

21 

D22 

H 

18 

R23 

12 

A23 

21 

D23 

H 

20 

R24 

16 

A24 

11 

D24 

H 

20 

R25 

16 

A25 

24 

D25 

H 

20 

R26 

16 

A26 

24 

D26 
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The  heat  of  dissociation  obtained  in  this  way  must 
be  the  negative  of  the  polymerization  heat  in  terms 
of  monomer  unit  added.  The  minimum-energy  ge¬ 
ometry  and  the  total  energy  for  each  radical,  cation, 
and  anion  and  monomeric  and  dimeric  species 
were  found  by  full  optimization. 


Results  and  Discussion 

VINYL  ACETATE 

In  previous  studies  of  the  VA  monomer  [12],  we 
reported  an  energetically  more  favorable  confor¬ 
mation.  Using  that  geometry,  the  dimer  of  VA  was 
formed.  This  model  compound  has  the  vinyl 
groups  in  a  parallel  position  perpendicular  to  the 
skeletal  sequence,  and  this  conformation  corre¬ 
sponds  to  an  isotactic  all-trans.  The  various  atoms 
have  been  numbered  (Fig.  1)  and  will  hereafter  be 
referred  to  by  these  numbers.  With  this  initial 
geometry,  a  calculation  changing  simultaneously 


all  the  geometric  parameters  was  made.  With  this 
new  geometry  and  taking  the  bond  C2 — C5  as  the 
axis  of  rotation,  the  rotational  barrier  was  calcu¬ 
lated  from  O'"  to  360°  with  angle  increments  of  20°; 
0°  corresponds  to  having  the  bond  C5 — C7  cis  to 
the  bond  C2 — C3.  In  each  point,  the  position  of  the 
acetate  groups  atoms  was  optimized.  From  this 
rotational  barrier,  the  minimum-energy  conforma¬ 
tion  was  taken  as  the  initial  one  for  the  next 
rotational  barrier  calculation,  this  time  through  the 
bond  C2 — C3.  Optimization  of  the  geometrical  pa¬ 
rameters  for  the  atoms  neighboring  this  bond  was 
made.  Again,  with  the  new  minimum-energy  con¬ 
formation,  a  full-optimization  calculation  was  car¬ 
ried  out.  Table  I  defines  the  notation  we  use  for 
the  internal  coordinates  of  the  VA  dimer  in  the 
format  of  the  z-matrix  input  for  the  program 
Gaussian  94.  The  UHF/6-31G*-optimized  values 
of  the  intermolecular  distances,  Rj,  the  bond  an¬ 
gles,  A;,  and  the  dihedral  angles,  D/,  are  given  in 
Tables  II,  III,  and  IV,  respectively.  The  first  two 


TABLE  II _ 

Internuclear  distances  for  the  different  dimeric  species  of  VA,  calculated  with  the  UHF/6-31G(c/)  method. 


Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free  radical 

R2 

1.08105 

1.084 

1 .0884 

1.0817 

1.0846 

R3 

1 .52795 

1 .5269 

1 .5366 

1.5431 

1 .5268 

R4 

1.0815 

1 .0835 

1.0898 

1 .0932 

1 .0885 

R5 

1.52584 

1.5211 

1 .4954 

1.473 

1 .4959 

R6 

1.0837 

1.0857 

1 .0761 

1.0816 

1.0783 

R7 

1.44419 

1.4118 

1 .3928 

1 .2428 

1.3696 

R8 

1.08037 

1 .0826 

R9 

1.08208 

1.083 

1.0866 

1 .0839 

1 .0831 

R10 

1 .52993 

1 .5237 

1 .5296 

1 .5237 

1 .5228 

R11 

1 .4502 

1 .4202 

1.3786 

1 .3976 

1.4196 

R12 

1.36113 

1 .3365 

1 .3295 

1.5175 

1 .3455 

R13 

1.08371 

1 .0844 

1 .0804 

1.0823 

1.0846 

R14 

1 .08271 

1.085 

1 .0908 

1 .0883 

1 .0845 

R15 

1.08225 

1 .0842 

1 .0893 

1 .0836 

1.084 

R16 

1.36081 

1.3354 

1.486 

1.3618 

1 .3353 

R17 

1.1968 

1.1826 

1.1876 

1.1522 

1.1803 

R18 

1.51217 

1.5117 

1.511 

1 .4847 

1 .5092 

R19 

1.19753 

1.1832 

1 .2647 

1.1745 

1.1832 

R20 

1.51332 

1.5131 

1.5217 

1.5086 

1.513 

R21 

1 .07769 

1.079 

1 .0801 

1 .0794 

1 .0792 

R22 

1 .08359 

1 .0841 

1.084 

1 .0834 

1 .0848 

R23 

1.08314 

1 .0844 

1 .0828 

1 .0834 

1.0834 

R24 

1.08328 

1.0836 

1 .0885 

1.0835 

1.0835 

R25 

1.08254 

1 .0847 

1 .0948 

1.0851 

1 .0848 

R26 

1.07822 

1.079 

1 .0875 

1.0787 

1.0791 
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TABLE  III _ 

Bond  angles  for  the  different  dimeric  species  of  VA,  calculated  with  the  UHF/6-31G(c/)  method. 


Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free  radical 

A3 

109.20119 

109.3892 

109.5052 

111.3468 

109.4869 

A4 

107.68760 

107.0329 

106.1593 

107.4943 

106.5460 

A5 

109.67980 

109.5263 

110.0410 

110.3933 

109.3951 

A6 

110.86996 

110.6728 

120.5230 

122.9585 

119.9846 

A7 

105.08688 

106.4235 

117.4401 

120.6877 

112.5518 

A8 

1 1 1 .52025 

111.2892 

A9 

110.00961 

108.8651 

108.8560 

109.0228 

108.8957 

A10 

113.59662 

114.0690 

112.3991 

111.9307 

113.2548 

A11 

104.15747 

105.1302 

107.0583 

102.8399 

105.3686 

A12 

124.38530 

123.3404 

122.2633 

119.8762 

123.1970 

A13 

110.80390 

110.8378 

107.6032 

110.7637 

111.0958 

A14 

111.23519 

111.3142 

112.5814 

111.4145 

110.9288 

A15 

109.16586 

110.2305 

110.2342 

110.3348 

109.9922 

A16 

125.74818 

125.2239 

114.661 

125.4188 

125.1943 

A17 

119.24141 

119.1289 

119.7086 

116.5776 

118.7749 

A18 

116.88147 

118.1824 

117.5256 

107.7608 

117.6845 

A19 

119.29680 

119.0126 

116.6171 

117.5297 

119.0181 

A20 

117.31459 

118.8165 

104.436 

118.3378 

118.7979 

A21 

107.87529 

107.9146 

108.1808 

108.3815 

107.9812 

A22 

1 1 1 .03209 

111.4270 

110.5046 

109.4000 

110.6888 

A23 

110.86750 

111.0104 

111.1130 

109.3901 

111.2064 

A24 

111.64095 

112.0892 

112.1481 

112.2392 

112.1051 

A25 

110.65525 

110.7329 

1 1 1 .3839 

110.7128 

110.7063 

A26 

107.61335 

107.5731 

109.1456 

107.5398 

107.5774 

column  data  correspond  to  the  initial  (AMI -opti¬ 
mized)  and  UHF/6-31G*-optimized  values  for  the 
VA  dimer,  respectively.  The  last  three  data 
columns  give  the  UHF/6-31G*-optimized  parame¬ 
ters  values  for  the  anion,  cation,  and  free-radical 
dimeric  species.  By  adopting  the  convention  given 
by  Flory  and  coworkers  [13],  the  final  geometry 
corresponds  to  a  tg  conformation.  Figure  1  shows 
the  VA  dimer  final  conformation. 

With  the  minimum-energy  conformation  for  the 
dimer  obtained  in  this  way,  the  bond  distance 
C5 — Fig  was  changed  by  increments  of  0.2  A  from 
the  equilibrium  value  to  5  A.  This  bond  enlarge¬ 
ment  is  equivalent  to  the  dimer  dissociation  in  one 
hydrogen  and  one  dimer  free  radical.  This  dissoci¬ 
ation  process  was  made  with  optimization  of  the 
geometrical  parameters  for  the  atoms  nearest  to 
the  bond  which  is  being  broken.  In  Table  III  it  is 
possible  appreciate  how  the  bond  angles  C2 — 
C5 — O7  and  C2 — C5 — Fi^  change  from  almost  te¬ 
trahedral  in  the  initial  conformation  to  almost  trig¬ 


onal  when  the  system  is  dissociated,  and  in  Table 
V,  how  the  electronic  densities  for  the  atoms 
neighboring  C5  are  substantially  changed.  With 
the  dimer  free-radical  geometry  obtained  this  way 
and  the  program  parameter  charge  made  equal  to 
zero,  one,  or  minus  one,  the  geometries  and  heats 
of  formation  were  obtained  for  the  free-radical, 
cation,  and  anion  dimeric  species,  respectively. 

For  each  one  of  these  three  species,  the  C2 — C3 
bond  distance  was  changed  from  the  equilibrium 
value  to  4.2  A  with  increments  of  0.3  A.  Figure  2 
shows  the  UHF/3-21G*  potential  energy  curves 
obtained  for  the  dissociation  of  the  dimeric  species. 
The  dissociation  products  are  one  monomer  and 
one  ionic  or  free-radical  monomeric  species. 
Around  the  distance  of  2.0  A,  the  energy  is  a 
maximum,  which  means  that  the  dimeric  species 
are  dissociated  and  the  C2 — C3  bond  is  almost 
broken.  Fiowever,  between  2.0  and  4  A,  some 
attractive  interactions  occur  principally  between 
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TABLE  IV _ _ _ _ 

Dihedral  angles  for  the  different  dimeric  species  of  VA,  calculated  with  the  UHF/6"31G(d)  method. 


Atom 

Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free  radical 

D4 

-118.03805 

-117.3563 

-116.4926 

-118.5230 

-117.2242 

D5 

124.37077 

125.3241 

125.3115 

128.0308 

124.1003 

D6 

-177.00407 

-176.0425 

-167.6353 

-165.8375 

-166.8072 

D7 

-119.10359 

-119.5823 

-141.3371 

-177.6376 

-141.3146 

D8 

121.47613 

120.4413 

D9 

-179.21453 

-177.4649 

-181.4689 

-176.8852 

-180.4869 

D10 

-124.41414 

-122.9400 

-119.3468 

-122.5343 

-122.6618 

D11 

118.08135 

117.7678 

119.0362 

118.2071 

117.8713 

D12 

-178.27760 

-177.9501 

-95.9448 

-182.6225 

-167.3026 

D13 

-175.48810 

-174.8035 

-178.4848 

-176.9629 

-177.8408 

D14 

-120.13487 

-119.7525 

-120.2503 

-119.3246 

-120.1987 

D15 

119.16572 

119.4663 

118.6905 

119.5042 

119.5933 

D16 

-151.51809 

-146.3294 

-145.8997 

-150.9679 

-147.3915 

D17 

-178.93995 

-178.8368 

-182.3227 

-359.1194 

-175.9514 

D18 

-179.80448 

-179.8345 

-180.9907 

-180.0441 

-179.6381 

D19 

-166.59531 

-171.7614 

-28.5634 

-171.9048 

-171.9182 

D20 

-178.31833 

-178.4059 

-134.4832 

-178.1465 

-178.3473 

D21 

-179.42262 

-178.4141 

-179.1716 

-179.9199 

-186.8656 

D22 

119.81004 

120.029 

120.3031 

120.6593 

119.2167 

D23 

-119.65612 

-119.6049 

-120.6860 

-120.6502 

-120.7299 

D24 

-62.28269 

-58.5701 

-51.6634 

-58.0384 

-58.2904 

D25 

120.90497 

120.7807 

120.8520 

121.2944 

120.7116 

D26 

-119.74459 

-120.1251 

-120.4455 

-119.9548 

-120.1626 

O7  and  the  hydrogens  and  H4.  In  Tables  II-IV 
are  the  geometrical  parameters  obtained  for  the 
dimer  anion,  cation,  and  free  radical  of  the  dissoci¬ 
ated  VA  dimer.  Notice  the  geometry  adopted  by 
the  monomeric  species. 

Figures  3  and  4  show  the  evolution  of  the  Mul- 
liken  and  electrostatic  potential  net  atomic  charges 
calculated  for  the  skeletal  atoms  as  a  function  of 
the  C2 — C3  bond  distance  for  the  cation  of  the 
dimer  when  calculated  with  the  UHF/3-21G(d) 
method.  It  is  necessary  to  point  out  the  sign  and 
magnitude  of  the  charges  on  the  C3  and  C5  atoms. 
In  the  final  state,  C3  has  a  positive  net  atomic 
charge  because  when  the  C2 — C3  bond  is  broken  a 
monomeric  cation  with  an  electron-deficient  car¬ 
bon  atom  is  formed.  The  cationic  character  is  on 
C3.  On  the  other  hand,  the  other  dissociation  prod¬ 
uct  is  a  monomeric  unit  with  a  double  bond  be¬ 
tween  C2  and  C5.  This  double  bond  is  evident 
from  the  net  atomic  charges  on  the  C2  and  C5 
atoms  which  have  Mulliken  atomic  charge  nega¬ 


tive  values.  Figure  4  shows  how  the  atomic  charge 
values  calculated  from  the  electrostatic  potential 
have  almost  the  same  behavior  in  respect  to  the 
C2 — C3  bond  distance.  The  geometrical  parame¬ 
ters  for  the  dimeric  species,  neutral,  anionic, 
cationic,  and  free  radical,  of  the  VA  are  shown  in 
Tables  II- IV.  Notice  the  monomer  geometry  which 
is  planar  and  the  nonplanar  geometry  for  the 
monomeric  species.  Figure  5  shows  for  the  free- 
radical  dimer  the  geometry  adopted  by  the  dissoci¬ 
ation  products. 

The  process  in  the  opposite  direction  must  be  a 
dimerization  process,  which  may  be  interpreted  as 
the  propagation  step  in  a  polymerization  mecha¬ 
nism,  The  heats  of  polymerization  calculated  for 
VA  from  the  potential  energy  curves  are  — 18.627 
kcal/mol  for  the  anionic  mechanism,  —31.778 
kcal/mol  for  the  cationic  mechanism,  and  -  23.022 
kcal/mol  for  the  free-radical  mechanism;  the  ex¬ 
perimental  value  from  the  literature  [14]  is  -21.0 
kcal/mol. 
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TABLE  V _ 

Total  atomic  charges  from  Mulliken  population  analysis  for  the  dimeric  species  of  VA, 
caicuiated  with  the  UHF/6-31G(d)  method. 


Atom 

VA 

VA  anion 

VA  cation 

VA  free  radical 

1  H 

0.205172 

0.147594 

0.287148 

0.287148 

2C 

-0.332017 

-0.357533 

-0.476120 

-0.476120 

3H 

0.144219 

0.245764 

0.154746 

0.154746 

4C 

0.207229 

0.181474 

0.290899 

0.290899 

5H 

-0.002987 

0.120137 

0.400659 

0.400659 

6C 

0.159126 

0.173557 

0.318052 

0.318052 

7H 

-0.609809 

-0.597995 

-0.514816 

-0.514816 

8C 

0.167823 

9H 

0.167434 

0.146838 

0.193760 

0.193760 

10H 

-0.498400 

-0.492737 

-0.511565 

-0.511565 

11  c 

-0.623438 

-0.735056 

-0.634492 

-0.634492 

12c 

0.774221 

0.770519 

0.751987 

0.751987 

13C 

0.176504 

0.218369 

0.223286 

0.223286 

14H 

0.161408 

0.090427 

0.154930 

0.154930 

15  H 

0.192680 

0.113955 

0.215910 

0.215910 

16C 

0.776163 

0.509799 

0.769270 

0.769270 

17C 

-0.526482 

-0.561445 

-0.399745 

-0.399745 

18C 

-0.621054 

-0.610418 

-0.579832 

-0.579832 

19  H 

-0.528501 

-0.790477 

-0.478013 

-0.478013 

20  H 

-0.624681 

-0.492259 

-0.629457 

-0.629457 

21  H 

0.229435 

0.196462 

0.276367 

0.276367 

22  H 

0.192066 

0.196666 

0.262972 

0.262972 

23  H 

0.195782 

0.215863 

0.263654 

0.263654 

24  H 

0.190545 

0.100517 

0.197866 

0.197866 

25  H 

0.197787 

0.099525 

0.202732 

0.202732 

26  H 

0.229773 

0.110453 

0.259804 

0.259804 

STYREIVE 

Table  VI  defines  the  notation  used  for  the  inter¬ 
nal  coordinates  of  the  ST  dimer  in  the  format  of 
the  z-matrix.  The  optimized  values  of  the  internu- 
clear  distances,  Rj,  the  bond  angles,  A/,  and  the 
dihedral  angles,  D/,  are  given  in  Tables  VII,  VIII, 
and  IX,  respectively.  In  the  first  and  second 
columns  are,  respectively,  the  AMI-  and  UHF/6- 
31G*-optimized  parameter  values  for  the  dimer  of 
styrene.  The  last  three  data  columns  give  the 
UHF/3-21G*-optimized  parameters  for  the  anion, 
cation,  and  free  radical  of  the  ST  dimer.  The  atomic 
numbering  is  presented  in  Figure  6. 

The  curves  of  the  C4 — bond  distance  vs.  the 
UHF/3-21G*  energy  for  the  dissociation  of  the 
three  species  of  styrene  are  shown  in  Figure  7. 
There,  the  three  curves  present  a  maximum  be¬ 
tween  2.1  and  2.4  A,  and  every  one  has  a  different 
form.  The  curve  corresponding  to  the  ST  cation 
shows  asymptotic  behavior  from  2.5  A,  which 


means  that  the  C4 — bond  is  broken,  forming  a 
styrene  monomer  with  a  double  bond  between  the 
and  the  atoms  and  an  ST  monomer  cation. 
Figure  8  shows  the  dissociation  products.  The 
cationic  character  of  the  dimer  can  be  appreciated 
in  Figure  9  by  the  net  atomic  charge  on  the  carbon 
atoms  4  and  11  before  and  after  the  dimer  dissocia¬ 
tion.  The  free-radical  curve  reaches  its  asymptoti¬ 
cal  behavior  at  approximately  2.8  A.  The  dissocia¬ 
tion  products  of  the  dimer  anion  are  a  monomeric 
anion  and  a  monomer.  The  net  atomic  charge  on 
is  from  —0.5  at  the  equilibrium  distance  value 
to  around  —0.2  for  distances  greater  than  2.8  A, 
On  the  other  hand,  the  net  atomic  charge  on  C4  is 
from  almost  zero  to  —  0.55.  This  fact  confirms  that 
in  this  case,  again,  the  dissociation  is  undertaken 
given  the  above-mentioned  products. 

When  the  C4 — C5  bond  is  breaking,  there  are 
changes  in  the  bond  and  dihedral  angles  of  the 
atoms  Cg  and  H7  which  are  bonded  to  C4.  Figure 
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FIGURE  2.  Potential  energy  curve  for  the  VA  dimeric  species,  calculated  with  the  UHF/3-21G*  model  chemistry.  The 
zero  energy  corresponds  to  606.0  au. 


FIGURE  3.  Change  of  the  Mulliken  net  atomic  charges  for  the  main-chain  atoms  during  the  dissociation  of  the  VA 
cation  of  the  dimer,  calculated  at  the  UHF/3-21G*  level  of  theory. 
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FIGURE  4.  Change  of  the  electrostatic  potential  net  atomic  charges  for  the  main-chain  atoms  during  the  dissociation 
of  the  VA  cation  of  the  dimer,  calculated  at  the  UHF/3-21G*  level  of  theory. 


FIGURE  5.  Dissociation  products  of  the  VA  free  radical  of  the  dimer,  obtained  at  the  UHF  /  3-21 G*  level  of  theory. 
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TABLE  VI _ 

ST  z-matrix  input  for  the  Gaussian-94  program; 
the  numbering  of  the  atoms  correspond  to 
Figure  6. 


H 


c 

1 

R2 

H 

2 

R3 

1 

A3 

C 

2 

R4 

1 

A4 

3 

D4 

0 

H 

2 

R5 

1 

A5 

3 

D5 

0 

C 

4 

R6 

2 

A6 

1 

D6 

0 

H 

4 

R7 

2 

A7 

6 

D7 

0 

C 

4 

R8 

2 

A8 

6 

D8 

0 

H 

6 

R9 

4 

A9 

2 

D9 

0 

H 

6 

RIO 

4 

A10 

9 

DIO 

0 

c 

6 

R11 

4 

All 

9 

Dll 

0 

c 

8 

R12 

4 

A12 

2 

D12 

0 

c 

8 

R13 

4 

A13 

12 

D13 

0 

H 

11 

R14 

6 

A14 

4 

D14 

0 

H 

11 

R15 

6 

A15 

14 

D15 

0 

C 

11 

R16 

6 

A16 

14 

D16 

0 

C 

13 

417 

8 

A17 

4 

El  7 

0 

C 

12 

R18 

8 

A18 

4 

D18 

0 

H 

12 

R19 

8 

A19 

18 

D19 

0 

H 

13 

R20 

8 

A20 

17 

D20 

0 

C 

17 

R21 

13 

A21 

8 

D21 

0 

H 

17 

R22 

13 

A22 

21 

D22 

0 

H 

18 

R23 

12 

A23 

8 

D23 

0 

C 

16 

R24 

11 

A24 

6 

D24 

0 

C 

16 

R25 

11 

A25 

24 

D25 

0 

H 

21 

R26 

17 

A26 

13 

E26 

0 

C 

25 

R27 

16 

A27 

11 

D27 

0 

C 

24 

R28 

16 

A28 

11 

D28 

0 

H 

24 

R29 

16 

A29 

28 

D29 

0 

H 

25 

R30 

16 

A30 

27 

D30 

0 

C 

27 

R31 

25 

A31 

16 

D31 

0 

H 

27 

R32 

25 

A32 

31 

D32 

0 

H 

28 

R33 

24 

A33 

16 

D33 

0 

H 

31 

R34 

27 

A34 

25 

D34 

0 

10  shows  how  the  bond  angles  of  the  Cg,  H7, 
Hjo,  and  atoms  change  from  almost  tetrahe¬ 
dral  to  trigonal  values.  The  variation  of  the  bond 
angle  defined  by  C2— C4 — is  represented  by 
the  lowest  curve  in  Figure  10.  This  curve  and  the 
curves  in  Figure  11  provide  information  about  the 
trajectory  followed  by  the  monomer  when  it  is 
moving  away  from  the  other  fragment.  Such 
changes  are  because  on  C4  and  a  hybridization 
change  is  occurring  from  SP3  to  SP2.  Figure  12 
shows  the  change  of  the  total  overlap  population 
on  the  bonds  of  the  main  chain  as  a  function  of  the 


bond  distance.  There  it  is  possible  to  appreciate  a 
change  from  a  single  to  a  double  bond  between  the 
Cg  and  atoms.  Also,  between  C4  and  Cg,  there 
is  a  change  from  a  characteristic  value  of  a  single 
bond  to  a  zero  total  overlap  population  value 
when  this  bond  has  been  broken.  The  change  in 
the  Cg  bond  angle  from  112  A  to  almost  90  A 
shows  the  trajectory  followed  by  the  monomer 
cation  when  it  is  approaching  the  molecular 
monomer.  Figure  8  presents  the  dissociation  prod¬ 
ucts. 

The  energy  difference  between  products  and 
reagents  [see  Eq.  (1)]  could  be  a  measure  of  the 
polymerization  heat.  From  the  dissociation  curves, 
the  calculated  values  for  ST  are  -16.31,  -19.45, 
and  -21.34  kcal/mol  for  the  anionic,  cationic,  and 
free-radical  mechanism,  respectively.  The  values 
calculated  for  the  cationic  species  are  in  excellent 
agreement  with  the  experimental  value  which  is 
—  16.7  kcal/mol  [15,16]. 


Conclusions 

For  VA  and  ST,  we  optimized  the  geometry  of 
monomers  and  dimers  in  their  neutral,  cationic, 
anionic,  and  free-radical  forms.  The  heats  of  poly¬ 
merization  obtained  from  the  dissociation  process 
are  in  good  agreement  with  the  values  reported  in 
the  literature.  With  the  methods  used  in  this  work, 
it  is  possible  to  appreciate  the  rearrangement  of 
the  electronic  density.  In  all  the  cases  where  a 
bond  rupture  or  formation  (ionization  and  dimer¬ 
ization)  occurred,  changes  in  the  hybridization  of 
the  atoms  directly  involved  in  such  processes  were 
observed. 

From  the  energy  differences  for  the  process  of 
dimerization,  it  is  possible  to  see  that  for  the  VA 
the  anionic  mechanism  seems  to  be  the  most  feasi¬ 
ble  one,  followed  by  the  free  radicals.  The  more- 
favored  mechanism  for  ST  is  the  anionic  followed 
by  the  cationic.  This  order  of  polymerization  is  in 
excellent  agreement  with  the  Schildknecht  classifi¬ 
cation  [17]. 

The  results  enable  us  to  postulate  that  heats  of 
polymerization  can  be  reasonably  estimated  by  ab 
initio  quantum  mechanical  calculations.  Many  pos¬ 
sibilities  may  be  suggested  for  the  reaction  path¬ 
way  followed  by  the  monomer  in  their  approach  to 
the  other  species,  but  only  one  path  is  possible  in 
the  dissociation  process.  The  path  which  we  are 
proposing  is  the  dimerization  path  for  ST  and  VA. 
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TABLE  VII _ 

Internuclear  distances  for  the  different  dimeric  species  of  styrene,  caiculated  with  the  UHF  /  6-31  G(d)  and 
UHF/3-21G*  methods. 


Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free  radical 

R2 

1.08313 

1 .0845 

1 .087351 

1.085147 

1 .082979 

R3 

1.08376 

1 .0859 

1 .084398 

1 .082008 

1.084316 

R4 

1 .54506 

1 .5365 

1 .548771 

1.541442 

1.541876 

R5 

1.08437 

1 .0847 

1 .083480 

1.081592 

1.084130 

R6 

1 .54822 

1.5415 

1 .565072 

1 .592307 

1 .560874 

R7 

1 .0857 

1.0875 

1 .087424 

1 .082980 

1 .083802 

R8 

1.524 

1.523 

1.512671 

1.517996 

1.521031 

R9 

1 .08331 

1.086 

1 .083338 

1 .079065 

1.083519 

R10 

1.08414 

1 .0871 

1 .083540 

1.081076 

1 .084238 

R11 

1 .5496 

1 .5372 

1.510959 

1 .476950 

1 .509654 

R12 

1 .39065 

1 .3935 

1 .465396 

1 .399079 

1 .399970 

R13 

1.38684 

1 .3893 

1.384125 

1 .395863 

1 .397356 

R14 

1.08352 

1.0846 

R15 

1.08539 

1.0871 

1 .076574 

1.076165 

1 .074842 

R16 

1.51699 

1.6146 

1 .409728 

1 .383034 

1.410167 

R17 

1 .38483 

1.3872 

1 .389970 

1.393763 

1 .393798 

R18 

1 .38284 

1 .3837 

1.394176 

1.391641 

1.391450 

R19 

1.07168 

1 .0757 

1 .074704 

1 .073076 

1.071672 

R20 

1 .07365 

1 .0764 

1.082014 

1 .073428 

1 .073298 

R21 

1 .38267 

1 .3831 

1 .456959 

1.391121 

1 .392209 

R22 

1 .07227 

1 .0758 

1 .074224 

1.071714 

1.072019 

R23 

1 .07202 

1 .0758 

1 .080686 

1.071540 

1 .072622 

R24 

1 .38826 

1.3903 

1 .428397 

1.421720 

1 .427760 

R25 

1.38713 

1.3908 

1 .428047 

1.421996 

1 .427277 

R26 

1.07112 

1 .0754 

1.075916 

1 .070472 

1 .072221 

R27 

1.38324 

1.3851 

1 .390244 

1.364232 

1 .389586 

R28 

1 .38389 

1.3856 

1.388108 

1 .367788 

1 .387982 

R29 

1 .07367 

1.0768 

1.073511 

1.073147 

1 .073661 

R30 

1 .07377 

1 .0768 

1.070176 

1 .070342 

1.071485 

R31 

1 .38479 

1.3855 

1.402132 

1.396969 

1.401893 

R32 

1 .07259 

1 .0757 

1.073199 

1 .069944 

1 .072030 

R33 

1.072 

1 .0758 

1 .073939 

1 .069775 

1.072197 

R34 

1.0713 

1 .0754 

1 .072277 

1 .070921 

1.072131 
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TABLE  VIII _ 

Bond  angles  for  the  different  dimeric  species  of  ST,  calcuiated  with  the  UHF/6-31G(d)  and  UHF/3-21G* 
methods. 


initial  Optimized 

parameters  parameters  Anion  Cation  Free  radical 


A3 

108.33837 

107.7667 

108.787 

108.644 

110.504 

A4 

111.21367 

111.4655 

111.976 

111.722 

108.427 

A5 

108.10089 

107.6871 

108.791 

107.786 

111.214 

A6 

112.17282 

122.9059 

109.928 

1 1 1 .643 

108.076 

A7 

108.11570 

107.2820 

107.699 

108.915 

111.382 

A8 

110.49859 

111.2918 

112.138 

112.551 

108.608 

A9 

108.76368 

108.5197 

107.480 

108.076 

108.428 

A10 

109.62103 

109.8119 

107.938 

107.689 

112.622 

A11 

113.64087 

114.3750 

114.024 

109.519 

121.499 

A12 

121.19954 

121.5492 

120.837 

121.427 

120.421 

A13 

120.48590 

120.4949 

121.337 

119.796 

117.560 

A14 

110.12286 

110.1620 

A15 

109.14581 

109.1545 

117.966 

116.240 

124.834 

A16 

110.65706 

112.3793 

125.044 

127.225 

121.026 

A17 

121.04501 

121.2240 

123.675 

120.617 

120.847 

A18 

120.83009 

121.0439 

118.444 

120.390 

119.901 

A19 

119.93723 

119.9504 

120.941 

120.520 

119.400 

A20 

119.46750 

119.4909 

118.015 

119.900 

120.091 

A21 

120.13621 

120.1610 

118.221 

120.0460 

119.780 

A22 

119.76853 

119.7094 

120.924 

119.755 

119.777 

A23 

119.76843 

119.6806 

118.159 

119.723 

119.896 

A24 

120.63336 

120.8650 

120.366 

118.895 

123.000 

A25 

120.70228 

120.8867 

122.670 

122.381 

120.277 

A26 

120.32113 

120.3923 

120.420 

120.212 

120.995 

A27 

120.81487 

120.9931 

120.957 

120.179 

121.364 

A28 

120.75128 

121.0002 

121.503 

120.585 

118.672 

A29 

119.53357 

119.5145 

118.575 

119.108 

119.625 

A30 

119.45174 

119.5118 

119.215 

119.852 

120.672 

A31 

120.12089 

120.1800 

120.856 

119.721 

119.577 

A32 

119.86709 

119.7680 

119.449 

120.517 

119.801 

A33 

119.82719 

119.7669 

119.860 

120.665 

120.237 

A34 

120.18628 

120.2854 

120.313 

119.193 
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TABLE  IX _ 

Dihedral  angles  for  the  different  dimeric  species  of  ST,  calculated  with  the  UHF/6-31G(d)  and  UHF/3-21G* 
methods. 


Initial 

parameters 

Optimized 

parameters 

Anion 

Cation 

Free  radical 

D4 

-121.79377 

-122.5815 

121.771 

-123.139 

-121.644 

D5 

117.06081 

115:9720 

117.265 

116.908 

117.271 

D6 

62.09327 

62.6393 

56.961 

65.152 

60.288 

D7 

-118.90094 

-117.9309 

-116.470 

-118.285 

-117.901 

D8 

123.61427 

125.5284 

123.991 

121.429 

123.633 

D9 

172.42559 

174.3642 

174.229 

175.667 

173.099 

D10 

-116.94578 

-115.6387 

-114.803 

117.712 

-115.854 

D11 

120.83665 

121.3542 

123.273 

122.755 

123.011 

D12 

-62.04618 

-61.8644 

-60-204 

-54.350 

-60.001 

D13 

-179.74842 

-179.8569 

-179.021 

-179.694 

-179.184 

D14 

61 .72438 

62.4989 

D15 

-117.86062 

-116.7008 

94.894 

88.511 

94.818 

D16 

121.17856 

121.7599 

-177.612 

-177.834 

-177.681 

D17 

179.79928 

179.8637 

179.136 

179.804 

179.304 

D18 

-179.77884 

-179.8371 

-179.140 

-179.670 

-179.275 

D19 

179.91523 

179.9692 

-179.789 

-179.533 

-179.949 

D20 

-179.99439 

-180.0385 

179.836 

179.180 

179,839 

D21 

-0.05688 

-0.016 

-0.092 

-0.140 

-0.113 

D22 

179.95525 

180.0192 

179.779 

179.726 

179.984 

D23 

179.93643 

179.9625 

-179.989 

-179.749 

179.972 

D24 

-90.34242 

-91.1110 

1 79.493  ( 

-182.503 

178.103 

D25 

177.99809 

178.6660 

179.775 

179.912 

179.842 

D26 

179.98703 

180.0002 

-179.949 

179.912 

179.978 

D27 

-178.02109 

-178.556 

-179.508 

179.595 

-179.866 

D28 

178.02595 

178.5297 

179.477 

179.552 

179.862 

D29 

179.68947 

179.7631 

-1 79.970  ( 

179.579 

-179.918 

D30 

-179.85802 

-179.7743 

-179.371 

-179.715 

179.758 

D31 

-0.00875 

-0.0313 

-0.058 

0.928 

-0.020 

D32 

-179.84312 

-179.7941 

-179.720 

179.527 

179.913 

D33 

179.84029 

179.8574 

-179.915 

-179.395 

179.961 

D34 

-179.89313 

-179.8424 

179.985 

179.380 

-179.936 

FIGURE  6.  Dimer  of  ST;  optimized  structure  with  the  HF/6-31G*  method,  showing  the  numbering  used  throughout 
the  text. 
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FIGURE  7.  Potential  energy  curve  for  the  dimeric  species  of  ST,  calculated  with  the  UHF/3-21G*  model  chemistry. 
The  zero  energy  corresponds  to  612.0  au. 


FIGURE  8.  Dissociation  products  of  the  dimer  free  radical  of  ST,  obtained  at  the  UHF  /  3-21 G*  level  of  theory. 
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Bond  distance (angstroms) 


FIGURE  12.  Bond  distance  vs.  half  of  the  total  overlap  population  calculated  for  the  ST  cation  of  the  dimer. 
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ABSTRACT:  The  reaction  mechanism  and  relative  stabilities  of  the  intermediates  and 
transition  states  in  the  reaction  of  ^02  plus  ethylene  molecule  using  ab  initio  molecular 
orbital  (MO)  theories  at  several  levels  of  theory  with  the  correction  of  the  nondynamic 
and  dynamic  electron  correlation  effects  were  systematically  investigated.  Full  geometry 
optimizations  of  the  corresponding  biradical  (BR)  intermediates,  perepoxide  (PE)  and 
1,2-dioxetane  (DO)  were  performed  by  complete-active-space  CASSCF{2,2}/6-31G* 
method  with  nondynamic  electron  correlation  effect  and  Moller-Plesset  MP2/6-31G* 
method  with  dynamic  electron  correlation  effect.  For  the  1/4-biradical  intermediates,  new 
gauche- type  ^BR  state  was  found  at  both  CASSCF  and  MP2  levels,  corresponding  to  the 
transition  state  of  the  rotation  motion  of  the  O2  moiety.  It  was  found  from  the  intrinsic- 
reaction-coordinate  (IRC)  study  that  another  gauche-type  ^BR  transition  state  connects 
smoothly  to  the  reactant  system  ^02  +  H2C  =  CH2  and  the  gauche  minimum  ^BR  state, 
showing  that  the  reaction  through  the  1,4-biradical  intermediates  initially  proceeds 
through  the  gauche  transition  state  to  form  the  gauche  minimum  ^BR  state,  following 
that  the  free  rotation  of  O2  moiety  occurs  due  to  the  energy  barrier  less  than  4.0 
kcal /mol.  The  stability  of  the  perepoxide  is  surprisingly  sensitive  to  the  levels  of  the 
theory  and  the  basis  sets  employed.  The  coupled-cluster  methods,  CCSD  and  CCSD(T), 
gave  the  reasonable  stabilities  of  1,4-biradical  intermediates,  perepoxide,  and  dioxetane 
as  a  reaction  product.  From  the  results  of  the  CCSD  and  CCSD(T)  methods,  the  reaction 
of  ^02  +  H2C  =  CH2  proceeds  by  a  two-step  mechanism  through  the  1,4-biradical 
intermediates  rather  than  through  the  perepoxide.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J 
Quant  Chem  65:  787-801,  1997 
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YOSHIOKA  ET  AL 


The  additions  of  the  singlet  molecular  oxygen 
to  various  types  of  the  olefin  derivatives  have 
been  extensively  studied  over  30  years  [1,  2].  A 
large  number  of  accumulated  data  suggested  three 
types  of  the  reaction  modes:  (I)  1,2-addition  to  give 

1,2-dioxetane;  (II)  ene  reaction  to  give  carbonyl 
fragments  and  allylic  hydroperoxides;  (III)  1,4-ad¬ 
dition  to  give  endperoxides.  For  these  reaction 
modes,  the  following  five  mechanisms  are  sug¬ 
gested  as  shown  schematically  in  Figure  1:  (i) 
concerted  mechanism  which  proceeds  without 
well-defined  intermediates;  (ii)  mechanism  through 
a  homopolar  biradical  (BR)  intermediates;  (iii) 
mechanism  through  a  zwitterionic  (ZW)  interme¬ 
diates;  (iv)  mechanism  through  a  one-electron 
transfer  (ET)  biradical  intermediates;  (v)  mecha¬ 
nism  through  a  perepoxide  (PE)  intermediates  [3, 
4].  The  experimental  studies  on  the  reaction  mech¬ 
anisms  [3,  4]  have  clarified  that  energy  differences 
between  these  reaction  mechanisms  are  not  so  large 
even  in  a  gas  phase  and  highly  dependent  on  the 
chemical  conditions  such  as  solvent  polarity  em¬ 
ployed,  Several  reaction  mechanism  are,  then,  op¬ 
erative  in  each  reaction  mode. 

On  the  other  hand,  theoretical  studies  [5,  6] 
showed  that  the  reacting  complexes  between 
molecular  oxygen  and  various  olefins  are  regarded 
as  typical  quasi-degenerate  systems  in  energy.  This 
indicates  that  the  reaction  complexes  are  very  sen- 


O 


FIGURE  1.  Typical  five  reaction  mechanisms  of  singlet 
oxygen  molecule  and  ethylene  derivatives. 


sitive  for  introducing  substituents  and/or  chang¬ 
ing  reaction  conditions  such  as  solvents,  and  inter¬ 
conversion  of  reaction  mechanisms  easily  occurs. 
This  also  suggests  the  possibility  that  the  relative 
stabilities  of  electronic  states  on  the  reaction  path¬ 
way  are  highly  sensitive  to  the  level  of  the  theory 
with  and  without  correlation  effects  and  basis  sets 
employed. 

Harding  and  Goddard  supported  the  biradical 
mechanism  such  as  CH2CH2OO  with  extensive 
calculations  at  the  GVB-CI  level  with  a  double-^ 
plus  polarization  (DZP)  basis  set  [7].  The  perepox¬ 
ide  were  located  as  a  minimum  with  higher  en¬ 
ergy  than  biradical  minimum  by  about  8  kcal/mol. 
On  the  basis  of  the  MINDO/3  level  of  theory, 
Dewar  et  al.  studied  the  reaction  of  O2  with  vari¬ 
ous  substituted  alkenes  [8].  They  found,  in  the  case 
of  ethene  itself,  that  the  reaction  proceeds  through 
a  polar  perepoxide,  which  in  turn  rearranges  to 

1,2-dioxetane  in  a  rate-determining  step. 

Hotokka  et  al.  have  performed  the  calculations 
of  reaction  intermediates  and  transition  states 
optimized  with  constraints  of  the  symmetry  and 
geometry  at  the  level  of  CASSCF  and  CCI, 
concluding  that  the  formation  of  the  perepoxide 
intermediate  is  easier  than  passing  through  a  C^, 
syn-biradical  pathway  [9].  Robb  and  co-workers 
suggested  from  the  calculations  at  the  level  of 
CASSCF{6,6}/4-31G  that  the  1,2-dioxetane-forming 
reaction  is  described  as  a  two-step  reaction  which 
proceeds  through  a  gauche  biradical  intermediate 
[10].  It  is,  however,  noteworthy  that  these  theoreti¬ 
cal  results  may  not  be  conclusive  because  the 
CASSCF  do  not  involve  the  correction  of  the  dy¬ 
namic  electron  correlation.  In  fact,  the  heat  of  the 

1,2-dioxetane  formation  is  too  small  (12.8  [9]  and 
4.0  [10b]  kcal/mol)  compared  with  the  value  from 
thermochemical  estimation  (30  kcal/mol).  These 
theoretical  results,  therefore,  show  that  the  reac¬ 
tion  mechanism  is  still  not  conclusive  for  the  reac¬ 
tion  mode  as  well  as  the  experimental  data. 

In  this  article,  we  systematically  investigate  the 
reaction  mechanism  and  relative  stabilities  of  the 
intermediates  and  transition  states  in  the  reaction 
of  ethylene  molecule  plus  ^©2  using  ab  initio 
molecular  orbital  (MO)  theories  at  several  calcula- 
tional  levels  with  the  correction  of  the  nondynamic 
and  dynamic  electron  correlation  effects  [11,  12]. 
Full  geometry  optimizations  of  the  corresponding 
BR  intermediates,  PE,  and  1,2-dioxetane  (DO)  are 
performed  by  CASSCF{2,2} /6-31G*  level  with  non¬ 
dynamic  electron  correlation  effect  and  Hartree- 
Fock  [HF;  restricted  (R)  or  unrestricted  (U)] 
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MP2/6-31G*  level  with  dynamic  electron  correla¬ 
tion  effect.  The  relative  stabilities  of  the  reaction 
intermediates  and  transition  states  are  estimated 
again  by  the  approximately  spin-projected  but 
size-consistent  UHF  scheme  (APUHF)  [13],  the  ap¬ 
proximate  spin  projection  scheme  (APUMP2  and 
APUMP4)  [6],  and  the  RHF  or  UHF  coupled-clus¬ 
ter  [(R  or  U)CCSD  and  (R  or  U)CCSD(T)]  [6d,  14] 
methods,  CASMP2  [15]  method,  and  CASPT2  [12] 
method  at  the  optimized  geometries  using  the 
same  6-31G*  basis  set.  The  reaction  mechanisms 
through  1,4-biradical  intermediates  and  perepox- 
ide  will  be  discussed  and  suggested.  The  program 
package  Gaussian  94  [15]  was  employed  to  evalu¬ 
ate  the  above. 


Calculational  Procedures 

Calculations  for  the  singlet  biradical  intermedi¬ 
ates  were  carried  out  at  UHF,  UHF  natural  orbital 
(UNO)  CASSCF  [16,  17]  and  UMP2  [18]  levels  of 
theory  with  6-31G*  basis  set.  As  described  in  the 
previous  works  [19]  the  RHF  is  not  suitable  for 
estimating  the  electronic  structures  of  the  species 
in  the  singlet  biradical  states  due  to  the  triplet 
instability  [13].  The  singly  occupied  MOs  (SOMO) 
with  radical  characteristics  are,  then,  induced  by 
mixing  highest  occupied  (HO)  and  lowest  unoccu¬ 
pied  (LU)  MOs  of  the  RHF  solution,  resulting  in 
the  different  orbitals  for  different  spins  (DODS) 
MO  picture  by  the  singlet  UHF  solution.  The  sin¬ 
glet  DODS  solution  constructed  is  not  a  pure 
eigenfunction  of  the  spin  angular  momentum  due 
to  the  contamination  from  triplet  excited  wave 
function.  The  removement  of  the  contamination 
result  in  the  approximately  spin-projected  UHF 
(APUHF)  [13].  The  correlation  correction  by  the 
APUHF  method  for  the  case  of  tricentric  biradical 
such  as  O3,  CH2NHCH2,  CH2OO,  OCH2O,  and 
CH2CH2O  gives  the  simple  relation  as  follows 
[19a]: 

^UNO  CASCI{2,2}  =  ^APUHF  =  ^UNO  CASSCF{2,2)  •  (1) 

Similarly,  UMP2  is  not  also  a  pure  spin  state  with 
the  contamination  from  higher  excited  states.  The 
approximate  spin  projection  to  the  singlet  state 
gives  the  APUMP2  wave  function.  As  described  in 
a  previous  work  [19b],  the  APUMP2  method  is 
simply  related  by  the  CASMP2  method  from  the 
behavior  of  potential  energy  curve  for  the  rotation 


of  the  ethylene  molecule,  as  follows: 

^UNO  CASMP2{2,2}  ^APUMP2  +  A,  (2) 

where  A  is  term  corresponding  to  the  orbital  relax¬ 
ation  energy. 

The  geometry  optimizations  for  the  singlet  bi¬ 
radical  species  were  carried  out  by  two  different 
procedures:  (i)  the  natural  orbitals  are  constructed 
by  DODS  MOs  obtained,  following  that  the  UNO 
CASSCF{2,2}  calculations  were  performed  in  an 
active  space  of  two  SOMOs  and  two  biradical 
electrons  to  include  the  nondynamic  electron  cor¬ 
relation  effect  and  (ii)  UMP2  calculations  were 
performed  by  using  DODS  MOs  of  the  UHF  solu¬ 
tion  to  include  the  dynamic  electron  correlation 
effect  [11].  The  usual  RHF  picture  was  used  for 
CASSCF{2,2}  and  MP2  (RMP2)  to  calculate  the 
singlet  closed-shell  species  such  as  perepoxide, 
1,2-dioxetane,  and  so  on. 

All  geometrical  parameters  of  the  reaction  inter¬ 
mediates  and  transition  states  were  fully  opti¬ 
mized  by  the  above  two  methods  without  con¬ 
straints  of  the  symmetry  and  geometry.  For  the 
stationary  point,  the  vibrational  frequency  analy¬ 
ses  were  carried  out  to  clarify  the  intermediate, 
transition  state  (TS)  or  higher-order  saddle  point 
on  the  potential  energy  surface.  The  relative  stabil¬ 
ities  of  the  reaction  intermediates  and  transition 
states  are  estimated  again  by  the  APUHF, 
APUMP2APUMP4,  UCCSD,  UCCSIXT),  CASMP2, 
and  CASPT2  methods  at  the  optimized  geometries. 


Results  and  Discussion 

REACTANTS 

The  geometry  optimizations  of  O2  (^A^)  at  the 
CASSCF{2,2}/6-31G*  level  gives  the  0—0  bond 
distances  of  1.210  A,  being  in  quite  good  agree¬ 
ment  with  the  experimental  value  of  1.216  A  [20a]. 
The  0—0  bond  distance  of  CASSCF{2,2}/6-31G* 
is  smaller  by  0.048  A  than  1.258  A  of  CASSCF/4- 
31G  by  Robb  et  al.  [10b].  For  the  triplet  O2  (^S^), 
the  O — O  bond  distances  were  estimated  to  be 
1.168  and  1.247  A  by  C ASSCF{2,2}  ^  and  UMP2, 
respectively,  being  smaller  by  0.040  A  and  longer 
by  0.039  A  than  the  experimental  value  of  1.208  A. 
The  ^A^  state  at  CASSCF{2,2}  level  is  higher  in 
energy  by  0.62  eV  (14.2  kcal/mol)  than  the  ^2^ 
state,  being  quite  smaller  than  the  experimental 
value  of  1.07  eV. 
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In  the  ethylene  molecule,  the  RMP2/6-31G* 
method  gives  the  optimized  geometrical  parame" 
ters  of  1.336  A  for  C — C  bond  distance,  1.085  A  for 
C — H,  and  121.7°  for  C — C — H  bond  angle,  being 
in  quite  reasonable  agreement  with  the  experimen¬ 
tal  values  [20b]  of  1.330  A,  1.076  A,  and  121.7°, 
respectively. 

REACTIOIV  PATHWAY  THROUGH 

1,4-BIRADICAL  INTERMEDIATES 

Optimized  1,4-Biradical  Intermediates 

The  (2  s  +  2s)-type  attacks  of  singlet  molecular 
oxygen  CO2)  to  ethylene  molecule  have  the  possi¬ 
bility  to  provide  intermediates  with  1,4-biradical 
orbitals  on  terminal  carbon  and  oxygen  atoms  in 
the  s-cis  type  of  the  coplanar  geometry.  Four  types 
of  the  singlet  BR  intermediates  are  expected  by  the 
rotation  of  the  terminal  methylene  group  and/or 
oxygen  atom,  that  is,  Vcr-,  Vtt-,  Vcr-,  and  Vtt-BR 
intermediates.  The  C — O  bond  cleavage  of  1,2-di- 
oxetane  (DO)  leads  the  Vcr-BR  intermediate  with 
a--type  radical  orbital  on  terminal  carbon  atom  and 
cr-type  radical  orbital  on  another  terminal  oxygen 
atom.  The  W-BR  intermediate  has  higher  energy 
by  12.9  kcal/mol  than  Vtt-BR  at  APUHF/4-31G 
level,  and  the  Vcr-BR  state  is  also  more  unstable 
by  13.3  kcal/mol  [21].  Though  the  V77-BR  state  is 
comparable  in  energy  with  the  Vtt-BR  state  [21], 
the  terminal  methylene  group  is  rotated  by  90°  in 
the  Vtt-BR  state.  This  means  that  the  reaction  to 
1,2-dioxetane  proceeds  through  the  energy  barrier 
due  to  the  rotation  of  the  terminal  methylene 
group.  In  this  study,  we  investigated  the  Vtt-BR 
states  as  the  reaction  intermediates  to  the  1,2-di- 
oxetane.  Although  only  s-cis  conformation  is  noted, 
the  s-trans  and  gauche  conformations  are  equally 
possible  for  1,4-peroxy  biradicals.  Full  geometry 
optimizations  of  the  corresponding  s-cis  conforma¬ 
tion,  the  s-trans  and  gauche  conformations  were 
carried  out  at  the  levels  of  CASSCF{2,2}/6-31G* 
and  UMP2/6-31G*. 

Four  optimized  structures  with  the  geometries 
of  s-cis,  two  gauche's,  and  s-trans  types  in  the  Vtt 
BR  state  were  found  by  UMP2/6-31G*  and 
CASSCF{2,2}/6-31G*  methods.  Their  conforma¬ 
tions  with  optimized  geometrical  parameters  are 
shown  in  Figure  2.  Their  relative  stabilities  to  the 
gauche-type  ^BR(G1)  state  are  summarized  in  Table 
I.  The  corresponding  values  by  the  CASSCF/4-31G 
method  by  Robb  et  al.  [10b]  are  also  given  for 
comparison. 


As  can  be  seen  from  Table  I,  the  most  stable 
structures  of  the  singlet  BR  state  is  the  gauche-type 
BR(G1),  which  was  optimized  by  both  UMP2/6- 
31G*  and  CASSCF{2,2}/6-31G*  methods.  This 
state  is  a  real  local  minimum  (stationary  point)  on 
the  potential  energy  surface  with  all  positive 
eigenvalues  of  Hessian  matrix  =  65  cm”^  by 
UMP2  method),  being  in  good  agreement  with  the 
result  of  Robb  et  al.  [10b].  The  bond  distances  of 
C — O  and  O — O  given  by  UMP2  are  slightly  larger 
than  those  by  CASSCF{2,2},  while  the  C — C  bond 
distance  of  UMP2  is  slightly  shorter  than  that  of 
CASSCF{2,2).  These  tendencies  are  observed  in  an¬ 
other  three  optimized  structures.  For  the  bond 
distances  of  O — O  and  C — 0, 1.310  and  1.472  A  of 
UMP2  and  1.301  and  1.432  A  of  CASSCF{2,2}  using 
the  6-31G*  basis  set  are  quite  shorter  than  1.376 
and  1.530  A  of  CASSCF{6,6)/4-31G  by  Robb  et  al. 
[10b].  These  differences  may  be  due  to  the  inclu¬ 
sion  of  the  polarization  functions  and  the  use  of 
unrestricted  natural  orbital  (UNO)  for  the  CASSCF 
method.  The  twisted  angles  (82.8°  and  87.1°)  of 
O— O— C— C  by  both  UMP2  and  CASSCF{2,2} 
are  slightly  larger  than  79.6°  of  Robb  et  al.  [10b]. 

Next,  the  stationary  point  is  the  s-trans  Vtt-BR 
state  with  higher  energy  by  only  0.6  and  0.3 
kcal/mol  than  the  most  stable  gauche-type  BR(G1) 
state  by  UMP2/6-31G*  and  CASSCF{2,2}/6-31G* 
methods,  respectively.  This  state  possesses  the 
lowest  vibrational  frequencies  of  21  cm“^  at 
UMP2/6-31G*  level,  indicating  that  the  Vtt-BR 
stationary  point  is  really  a  shallow  minimum  on 
the  potential  energy  surface.  These  results  are  also 
in  quite  good  agreement  with  the  results  of  Robb 
et  al.  [10b].  The  bond  distances  of  O — O,  C — O, 
and  C — C  are  quite  similar  to  the  values  of  the 
most  stable  gauche-type  BR(G1)  state  at  both 
UMP2/6-31G*  and  CASSCF{2,2}/6-31G*  levels. 
The  O — O  and  C — O  bond  distances  are  also 

o 

shorter  by  0.071-0.095  A  compared  with  those  of 
Robb  et  al.  [lObj. 

Each  of  UMP2/6-31G*  and  CASSCF{2,2}/6- 
31 G*  methods  gives  two  transition  states,  the  sad¬ 
dle  point  on  the  potential  energy  surface,  with  one 
negative  eigenvalue  of  Hessian  matrix;  one  is  a 
s-cis  Vtt-BR  state  and  another  is  a  gauche  ^BR(G2) 
state.  The  s-cis  Vtt-BR  state  has  higher  energy  by 
2.5  and  3.5  kcal/mol  than  those  of  the  most  stable 
gauche-type  BR(G1)  state  at  UMP2/6-31G*  and 
CASSCF{2,2}/6-31G*  levels,  respectively,  com¬ 
pared  with  2.0  kcal/mol  of  Robb  et  al.  [10b].  Robb 
et  al.  found  three  optimized  geometries  with  the 
structures  of  s-cis-type  1,4-biradical  conforma- 
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FIGURE  2,  Optimized  geometrical  parameters  of  the  singlet  biradical  states  obtained  by  UMP2/6-31G*  method  and 
UNO  CASSCF{2,2} /6-31G*  method  in  parenthesis. 


tions;  one  is  obtained  by  both  4“31G  and  STO-3G 
basis  sets  using  CASSCF  method;  others  are  given 
by  only  STO-3G  basis  set.  Two  structures,  which 
are  the  second-order  saddle  points  on  the  113  and 
04  energy  surface  (Fig.  5  in  Ref.  [10b]),  have  the 
C— O  bond  distances  of  1.842  A  (1.874  A  by  4-31G) 
and  1.759  A  given  by  STO-3G  basis  set  employed, 
being  quite  different  from  our  results  of  1.478  A 
(UMP2)  and  1.491  A  (CASSCF).  Robb  et  al.  [10b] 
also  showed  that  the  third  structure  with  the  C — O 
bond  distance  of  1.538  A  by  STO-3G  is  a  local 
minimum  on  the  113  energy  surface.  This  structure 
corresponds  to  our  structure.  Their  conclusion  is, 
however,  in  conflict  with  our  result  that  the  s-cis 


Vtt-BR  state  is  the  saddle  point  on  the  potential 
energy  surface,  namely  the  transition  state.  The 
s-cis  Vtt-BR  and  gauche-type  BR(G2)  states  have 
really  imaginary  frequencies  of  125/  and  44/  cm“^ 
at  UMP2/6-31G*  level,  respectively. 

It  should  be  also  noted  that  the  gauche-type 
BR(G2)  state  obtained  here  is  far  from  the  C^ 
gauche  first-order  saddle  point  given  by  Robb  et 
al.  [10b]  in  the  molecular  geometry  and  relative 
stability  to  the  gauche-type  ^BR(G1)  state.  The 
C — O  distances  are  1.472  and  1.432  A  and  the 
relative  energies  to  most  stable  gauche-type  BR(Gl) 
state  are  estimated  to  be  0.7  kcal/mol  by  both 
UMP2/6-31G*  and  CASSCF{2,2}/6-31G*,  while 
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TABLE  I 


Relative  stabilities  (Real  /  mol)  of  optimized  structures  of  reactants,  reaction  intermediates, 
transition  states,  and  products. 

States  \  Methods 

CASSCF{2,2} 

MP2 

CASSCF^ 

O^CAg)  +  HgC  =  CH2 

-18.1 

-22.8  (-17.7) 

OgfSg)  +  H2C  =  CHj 

VttBR  (s-cis) 

-32.3 

-30.1 

3.5 

2.5 

(2.0) 

^BR  (gauche!) 

0.0*’ 

0.0® 

0.0  (0.0) 

’BR  (gauche2) 

0.7 

0.7 

0.6  (0.4) 

VttBR  (s-trans) 

0.3 

0.6 

^BR  (gauche  TS) 

19.9 

6.7  (11.1) 

V17BR  (s-trans  SP2)'' 

18.7® 

21.0 

9.6  (12.1) 

Perepoxide  (PE) 

3.2 

-0.6 

-39.0  (32.3) 

PE  (SP2)'' 

44.4 

12.9 

-25.4  (45.4) 

Dioxetane  (’DO) 

-51.8' 

-51.5 

-26.8  (-41.8)*’ 

’do  (TS) 

-51.89 

-50.8 

2H2C  =  0 

-84.5' 

-105.0 

^CASSCF/4-31G  and  {CASSCF /  STO-3G)  in  Ref.  [10b]  of  Robb  et  al. 
‘^Total  energy  is  -227.598231  a.u. 

'^Total  energy  is  -228.168207  a.u. 

'^SP2  means  a  second-order  saddle  point. 

®CASSCF{2,2}  value  at  the  geometry  optimized  by  CASSCF{4,4}. 
^Total  energy  is  -227.680865  a.u. 

^Total  energy  is  -227.680805  a.u. 

^C2v  symmetry  not  C2. 

'The  formaldehyde  dimer  with  C-C  distance  of  10  A  was  calculated. 


Robb  et  al.  [10b]  gave  the  C — O  distance  of  1.850 
A  and  the  relative  energy  of  6.7  kcal/mol  by  4-31G 
basis  set.  This  shows  that  the  present  gauche-type 
BR(G2)  state  is  apparently  different  from  the  tran¬ 
sition  state  leading  to  1,2-dioxetane  suggested  by 
Robb  et  al.  [10b],  even  if  the  gauche-type  BR(G2) 
state  has  single  negative  eigenvalue  of  the  Hessian 
matrix. 

The  potential  energy  curves  of  the  singlet  BR 
states  for  C — O  rotation  at  UMP2  and  CASSCF 
levels  are  depicted  in  Figure  3.  Four  singlet  BR 
states  obtained  here  are  smoothly  connected,  indi¬ 
cating  that  the  present  gauche-type  BR(G2)  state  is 
apparently  the  transition  state  of  the  rotation  mo¬ 
tion  of  1,4-biradical  states.  The  rotational  energy 
barriers  are  within  3.5  kcal/mol,  showing  that  the 
free  rotation  easily  occurs  in  the  singlet  state.  The 
carbon-carbon  bond  distances  of  four  structures 
are  fallen  into  small  regions  from  1.478  to  1.482  A, 
being  larger  than  1.336  A  of  ethylene  molecule  at 
RMP/6-31G*  level.  This  indicates  that  the  radical 
characteristics  on  a  terminal  carbon  atom  with 
cleavage  of  tt  bonding  orbital  is  kept  throughout 
the  rotation  motion.  Similarly,  the  oxygen-oxygen 
bond  distances  of  1.306,  1.310,  1.311,  and  ^1.311  A 
are  longer  than  1.208  A  (X^2”)  and  1.216  A 


of  oxygen  molecule  estimated  by  the  experiment, 
reflecting  the  biradical  characteristics  on  terminal 
oxygen  atom.  The  C — O  distances  are  between 
1.472  and  1.495  A  for  the  rotation  motion.  The 
UMP2  method  gives  longer  distances  than  the 
CASSCF{2,2}  method  due  to  inclusion  of  dynamic 
electron  correlation  effects  except  for  C — C  dis¬ 
tance  of  s-trans  Vtt-BR  state. 

Relative  Stabilities  of  1,4-Biradical 

Intermediates  at  Various  Calculational  Levels 

Table  II  summarizes  the  relative  stabilities  of 
four  1,4-biradical  states  estimated  by  various  lev¬ 
els  of  theory  based  on  the  geometries  optimized  by 
UNO  CASSCF{2,2}  and  UMP2  method  using  6- 
31 G*  basis  set.  The  UMP4  with  higher-order  corre¬ 
lation  correction  indicates  that  the  most  stable  1,4- 
biradical  state  is  the  gauche-type  BR(Gl)  state  at 
the  geometries  optimized  by  both  CASSCF{2,2} 
and  UMP2  methods.  The  relative  stabilities  of  four 
states  gives  the  same  ordering  to  those  of  UMP2 
levels,  indicating  that  BR(Gl)  and  Vtt-BR  (s-trans) 
states  are  the  local  minimum  and  BR(G2)  and 
Vtt-BR  (s-cis)  are  the  first-order  saddle  points  on 
the  potential  energy  surface. 
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Rotational  angle  (degree) 


FIGURE  3.  Potential  energy  curves  of  the  singlet  bIradical  states  for  the  rotation  ofC  —  C  —  O  —  0  estimated  by 
UMP2/6-31G*  and  UNO  CASSCF{2,2} /6-31G*  methods. 


The  approximately  spin-projected  (AP)UHF  and 
(AP)UMPn  levels  are  constructed  by  removing  the 
contamination  of  the  triplet  exited  wave  function. 
The  APUHF  gives  the  same  ordering  of  the  rela¬ 
tive  stabilities  at  four  biradical  states  with  the 


geometries  optimized  by  CASSCF{2,2}  and  UMP2 
methods.  As  is  shown  in  Eq.  (1),  and  presented  in 
the  previous  work  [19],  the  APUHF  gave  the  corre¬ 
lation  correction  nearly  equal  to  those  of  the 
CASCI{2,2}  and  CASSCF{2,2}  in  the  case  of  tri- 


TABLE  II _ 

Relative  stabilities  (kcal  /  moi)  of  four  1 ,4-biradlcal  intermediates  estimated  by  various  levels  of  theory 
based  on  the  geometries  optimized  by  CASSCF{2,2}  and  UMP2  using  6-31 G*  basis  set. 


States  Geom  \ 
Methods 

VttBR  (s-cis) 

^BR  (G1)^ 

^BR  (G2)® 

VttBR  (s-trans) 

CASSCF 

UMP2 

CASSCF 

UMP2'= 

CASSCF 

UMP2 

CASSCF 

UMP2 

UHF 

2.11 

3.08 

-0.81 

0.0 

-0.18 

0.36 

-0.97 

-0.08 

APUHF 

2.64 

3.73 

-0.56 

0.0 

0.12 

0.85 

-0.30 

0.68 

CASSCF 

2.50 

3.62 

-1.00 

0.0 

-0.29 

0.74 

-0.73 

0.33 

UMP2 

3.53 

2.53 

1.07 

0.0 

1.67 

0.67 

-1.61 

0.64 

APUMP2 

4.38 

3.52 

1.26 

0.0 

1.90 

1.31 

2.53 

1.68 

UMP4 

3.99 

2.59 

1.44 

0.0 

2.18 

0.72 

2.00 

0.65 

APUMP4 

4.88 

3.61 

1.58 

0.0 

2.41 

1.40 

2.98 

1.76 

CCSD 

4.07 

2.86 

1.27 

0.0 

2.08 

0.70 

1.65 

0.50 

CCSD(T) 

4.31 

2.78 

1.53 

0.0 

2.36 

0.75 

2.02 

0.58 

CASMP2 

3.97 

3.16 

1.08 

0.0 

1.75 

1.06 

2.01 

1.20 

CASPT2 

10.34 

9.48 

-0.33 

0.0 

1.17 

4.93 

8.48 

7.59 

^Gauche  geometry,  G1  is  a  local  minimum  and  G2  is  a  transition  state. 

'"Total  energies  in  units  of  a.u.  are  -227.606549  for  UHF,  -227.606588  for  APUHF,  -227.596633  for  CASSCF,  -228.168207  for 
UMP2,  -228.168429  for  APUMP2,  -228.224839  for  UI\/IP4,  -228.225177  for  APUMP4,  -228.212413  for  CCSD,  -228.229669 
for  CCSD(T),  -228.175792  for  CASMP2,  and  -228.193027  for  CASPT2. 
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centric  biradical  intermediates  such  as  O3, 
CH2NHCH2,  CH2OO,  OCH2O,  and  CH2CH2O.  It 
seems  that  these  relationships  are  also  supported 
in  the  case  of  tetracentric  biradical  species. 

However,  the  APUMPn  (n  =  2,4)  gives  the  re¬ 
versed  stabilities  that  the  gauche-type  BR(G2)  state 
is  more  stable  than  the  s-trans  Vtt-BR  state,  con¬ 
flicting  with  above  results  that  the  s-trans  Vtt-BR 
state  is  the  local  minimum  and  the  gauche-type 
BR(G2)  state  is  the  first-order  saddle  point  on  the 
potential  energy  surface.  These  results  show  that 
the  approximately  spin-projected  (AP)  procedure 
is,  in  some  cases,  not  effective  for  describing  the 
energy  differences  of  the  energetically  quasi-de¬ 
generate  chemical  system.  This  may  be  due  to  fact 
that  the  APUMPn  methods  include  the  small  con¬ 
tribution  of  the  higher  excited  states  than  triplet 
excited  state.  The  APUMPn  methods  should  be 
carefully  used  to  describe  the  chemical  systems 
and  intermediates  with  strong  biradical  character 
as  well  as  APUHF. 

Two  coupled-cluster  CCSD  and  CCSD(T)  meth¬ 
ods  with  dynamic  electron  correlation  show  that 
the  relative  stabilities  of  four  1,4-biradical  state 
give  the  same  ordering  to  those  optimized  by 
CASSCF{2,2}  and  UMP2  methods.  The  energy  gaps 
between  four  BR  states  fall  into  within  a  few 
kcal/mol,  giving  the  reasonable  chemical  pictures. 

However,  in  the  CASMP2  method,  the  charac¬ 
teristics  of  four  states  such  as  minimum  and  tran¬ 
sition  state  on  the  potential  energy  surface  are  lost 
as  well  as  the  APUMP2  and  APUMP4  methods, 
indicating  that  the  inclusion  treatment  of  the  dy¬ 
namic  electron  correlation  based  on  the  CASSCF 
wave  function  is  not  effective  for  1,4-biradical 
species.  The  behaviors  of  the  CASMP2  and 
APUMP2  methods  support  the  relationship  given 
in  Eq.  (2).  Though  the  CASPT2  method  gives  larger 
energy  gaps  between  four  1,4-biradical  states  than 
the  CASMP2,  the  CASPT2  has  similar  tendencies 
on  the  characteristics  of  the  potential  energy  sur¬ 
face  to  the  CASMP2. 

From  Table  II,  following  conclusions  are  de¬ 
scribed  for  the  chemical  systems  which  are  quasi¬ 
degenerate  in  energy  and  have  the  strong  biradical 
character.  The  dynamic  electron  correlation  is  es¬ 
sentially  necessary  to  elucidate  the  electronic 
structures.  The  inclusion  of  the  dynamic  electron 
correlation  should  be  done  carefully  and  ade¬ 
quately.  The  geometry  optimization  should  be 
done  to  represent  the  energetics  in  quasi-degener¬ 
ate  chemical  system  and  the  electronic  structures 


with  the  biradical  character  even  if  the  high-qual¬ 
ity  method  is  employed. 

Transition  States  with 

1 ,4-Bira(lical  Characteristics 

Figure  4  shows  optimized  geometry  of  the  Cj 
gauche  transition  state  with  just  one  imaginary 
frequency  of  1191/  cm-1  at  UMP2/6-31G*  level. 
These  structures  are  corresponding  to  the  transi¬ 
tion  state  given  by  the  CASSCF  calculation  of 
Robb  et  al.  [10b].  As  is  indicated  in  Table  I,  the 
relative  energies  of  19.9  kcal/mol  at  UMP2  level  is 
larger  than  6.7  kcal/mol  of  Robb  et  al.  [lObj.  The  C 
— C  and  C — O  bond  distances  of  1.373  and  1.873  A 

o 

are  shorter  by  only  0.028  A  and  longer  by  only 
0.023  A  of  Robb  et  al.  [10b],  being  similar  tenden¬ 
cies  in  the  case  of  the  1,4-biradical  states. 

Robb  et  al.  [10b]  indicated  the  qualitative  de¬ 
scription  of  the  potential  energy  surfaces  (Fig.  5  in 
Ref.  [10b]).  They  described  that  the  gauche  TS  is 
connected  to  the  gauche  1,4-biradical  minimum 
(Gl).  They  also  suggested  that  the  avoided  cross¬ 
ing  corresponding  to  the  closure  of  the  gauche 
1,4-biradical  minimum  to  1,2-dioxetane  exists  and 
the  transition  state  is  located  far  from  the  gauche 
TS  obtained.  However,  they  did  not  discuss  the 
path  from  the  gauche  TS  to  the  1,2-dioxetane 
molecule  concretely. 

The  intrinsic-reaction-coordinate  (IRC)  calcula¬ 
tion  was  performed  to  elucidate  the  characteristics 
of  the  gauche  transition  state.  Figure  5  shows  the 


^BR(G  TS) 

FIGURE  4.  Optimized  geometrical  parameters  of  the 
gauche  transition  state  obtained  by  UMP2/6-31G* 
method. 
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FIGURE  5.  Energy  variation  from  the  gauche  transition  state  along  the  intrinsic  reaction  coordinate  using  UMP2/ 
6-31 G*  method. 


energy  variation  from  the  gauche  transition  state 
along  the  intrinsic  reaction  coordinate.  The  energy 
profile  smoothly  connects  to  both  directions  of  two 
different  states.  The  geometries  on  the  terminal 
points  of  the  reverse  and  forward  directions  of 


intrinsic  reaction  coordinate  are  depicted  in  Figure 
6.  It  is  found  from  Fig.  6(a)  that  the  geometry 
obtained  from  the  reverse  direction  is  almost  equal 
to  that  of  the  gauche  ^BR  minimum  shown  in 
Figure  2,  indicating  that  the  gauche  transition  state 


(A)  Reverse  direction  (B)  Forward  direction 

FIGURE  6.  Geometrical  parameters  of  the  states  obtained  by  the  intrinsic  reaction  coordinate  calculation  from  the 
gauche  transition  state  using  the  UMP2/6-31G*  method:  (a)  reverse  direction,  (b)  forward  direction. 
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is  smoothly  connected  to  the  gauche  ^BR  state. 

Figure  6(b)  represents  the  geometry  obtained  from 

the  forward  direction.  The  C — O  distance  is  2.710 
0 

A,  being  quite  longer  as  a  chemical  bonding.  The 
C — C  bond  distance  in  the  ethylene  moiety  is  1.310 
A,  being  nearly  equal  to  1.336  A  of  the  ethylene 
molecule  optimized  by  RMP2/6-31G*  method. 
Apparently,  the  gauche  transition  state  has  a 
reaction  path  to  connect  with  the  dissociation  limit 
of  ’O2  +  HjC  =  CH^. 

The  reaction  of  ^©2  +  H2C  =  CH2  through  the 
1,4-biradical  intermediates  initially  proceeds 
through  the  gauche  transition  state  to  form  the 
gauche  ^BR  minimum  state,  following  that  the  free 
rotation  of  O2  moiety  along  the  C — O  bond  axis 
occurs  due  to  the  energy  barrier  less  than  4.0 
kcal/mol.  The  free  rotation  brings  about  the  possi¬ 
bility  to  form  the  1,2-dioxetane  molecule. 

The  transition  state  corresponding  to  the  gauche 
^BR(G  TS)  state  was  not  found  at  the  UNO 
CASSCF{2,2}  level.  The  occupation  numbers  of  the 
UNOs  are  1.998,  1.874,  1.052,  0.948,  0.126,  and 
0.002  for  HOMO  -  2,  HOMO  -  1,  HOMO,  LUMO, 
LUMO  -h  1,  LUMO  +  2,  respectively,  showing  that 
the  active  space  in  CASSCF  should  be  composed 
of  four  orbitals  and  four  electrons  at  least.  How¬ 
ever,  at  the  UNO  CASSCF{4,4}  level,  the  transition 
state  corresponding  to  the  ^BR(G  TS)  state  was  not 
also  found.  This  indicates  that  the  wide  active 
space  with  more  orbitals  and  more  electrons  may 
be  necessary  [19].  On  the  contrary,  the  occupation 
numbers  of  the  gauche  minimum  ^BR  state  are 
given  by  1.998,  1.996,  1.063,  0.937,  0.004,  and  0.002 
at  above  same  ordering,  showing  that  two  radical 
orbitals  are  concentrated  to  HOMO  and  LUMO. 
Similarly,  for  the  s-cis  Vtt-BR  state,  the  occupation 
numbers  of  HOMO  -  1,  HOMO,  LUMO,  and 
LUMO  -h  1  are  1.997,  1.0,  1.0,  and  0.003,  even  the 
transition  state  of  the  rotation  motion.  In  the  previ¬ 
ous  work  of  tricentric  bisdesmiphiles  such  as  O3, 
CH2OCH2,  CH2ONH,  and  HNOO  [19a],  it  was 
shown  that  the  CASSCF{2,2}  method,  in  which  the 
occupation  numbers  of  only  HOMO  and  LUMO 
suggest  the  radical  orbitals,  gives  the  qualitatively 
and  quantitatively  correct  description  of  the  sin¬ 
glet  TTTT-radical  states.  Therefore,  the  active  space 
of  two  orbitals  and  two  electrons  are  reasonable  to 
represent  the  1,4-biradical  states  except  for  the 
case  of  the  gauche  ^BR  transition  state. 

Another  optimum  state  with  the  s-trans  geome¬ 
try  was  found  as  shown  in  Figure  7.  This  structure 
corresponds  to  the  anti-first-order  saddle  point 
of  Robb  et  al.  [10b],  Our  structures  obtained  by 
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FIGURE  7.  Optimized  geometrical  parameters  of  the 
s-trans-type  second-order  saddle  point  obtained  by 
UMP2  /  6-31 G*  method  and  UNO  CASSCF{4,4}  /  6-31 G* 
method  in  parenthesis. 


UMP2/6-31G*  are,  however,  a  second-order  sad¬ 
dle  point  on  the  potential  energy  surface  with  two 
imaginary  frequencies  of  1267/  and  37/  cm“h  The 
total  energy  is  decreased  with  rotation  of  the  oxy¬ 
gen  molecule  along  the  C — O  bond  axis,  approach¬ 
ing  to  the  Cl  gauche  transition  state  of  the  first- 
order  saddle  point. 

The  geometrical  parameters  of  CASSCF  shown 
in  Figure  7  is  obtained  by  the  active  space  of  four 
orbitals  and  four  electrons,  since  the  optimized 
geometry  was  not  found  at  the  CASSCF{2,2}  level. 
The  relative  energy  in  Table  I  was  estimated  by 
the  CASSCF{2,2}  method  at  the  geometry  opti¬ 
mized  by  the  CASSCF{4,4}  method.  The  occupa¬ 
tion  numbers  of  the  s-trans  ^BR(SP2)  state  are 
given  by  1.997,  1.832,  1.0,  1.0,  0.168,  and  0.003  at 
same  ordering  to  the  above,  while  the  occupation 
numbers  of  the  s-trans  minimum  ^BR  state  are 
1.997,  1.0,  1.0,  and  0.003  for  HOMO  -  1,  HOMO, 
LUMO,  and  LUMO  4-  1,  respectively. 

REACTION  PATHWAY  THROUGH 

PEREPOXIDE  INTERMEDIATES 

As  shown  in  Figure  8,  two  optimized  structures 
of  the  perepoxide  type  were  found  at  both 
RMP2/6-31G*  and  CASSCF{2,2}/6-31G*  levels. 
One  (a)  is  a  true  minimum  on  the  potential  energy 
surface,  another  (b)  is  a  second-order  saddle  point. 
These  behaviors  are  very  similar  to  those  of 
CASSCF/4-31G  and  STO-3G  by  Robb  et  al.  [10b]. 
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(A)  (B) 

FIGURE  8.  Optimized  geometrical  parameters  of  (a)  perepoxide  and  (b)  perepoxide-like  (second-order  saddle  point) 
obtained  by  UMP2/6-31G*  method  and  CASSCF{2,2} /6-31G*  method  in  parenthesis. 


However,  the  geometrical  parameters  obtained  are 
quite  different  from  each  other  even  at  the  CASSCF 
levels.  For  the  minimum  C^  perepoxide,  the 
CASSCF{2,2}  gives  the  shorter  bond  distances;  for 
example,  1.433  A  of  C — O  and  1.532  A  of  O — O 
for  1.559  and  1.634  A  of  Robb  et  al.  [10b],  respec¬ 
tively.  The  RMP2  method,  which  gives  a  longer 
C — O  bond  distance  and  shorter  O — O  bond  dis¬ 
tance  than  those  of  the  CASSCF  method,  has  also 
similar  tendencies.  On  the  other  hand,  for  the 
second-order  saddle  point,  the  MP2  method  gives 
the  bond  distances  of  1.748  and  1.288  A  of  C — O 
and  0—0,  CASSCF/4-31G  gives  1.821  and  1.467 
A,  respectively. 

As  can  be  seen  from  Table  I,  the  relative  stabili¬ 
ties  of  the  perepoxide  based  on  the  gauche  mini¬ 
mum  ^BR  state  are  surprisingly  conflicting  each 
other  at  the  calculational  levels  and  basis  sets 
employed.  The  minimum  basis  set  by  CASSCF 
gives  positive  relative  energy,  the  double-zeta  ba¬ 
sis  set  gives  negative  one,  and  the  inclusion  of  the 
d-polarization  switches  the  sign  back  to  positive, 
giving  the  relative  energy  nearly  equal  to  the 
gauche  minimum  ^BR  state.  By  the  correction  of 
the  dynamic  electron  correlation,  the  stability  of 
the  perepoxide  falls  into  the  compatibility  with  the 
gauche  minimum  ^BR  state. 


The  perepoxidic  intermediates  have  been  stud¬ 
ied  by  semiempirical  and  ab  initio  molecular  or¬ 
bital  works.  The  CNDO/2  study  [22]  suggested 
that  the  perepoxide  was  a  quasi-intermediate  with 
the  shallow  minimum  on  the  potential  energy  sur¬ 
face.  It  is  also  suggested  by  the  MINDO/3  study 
[8]  that  the  reaction  of  ^©2  +  H2C  =  CH2  pro¬ 
ceeds  through  the  perepoxide  rather  than  the  1,4- 
biradical  intermediates.  The  MNDO  study  [10b] 
indicated  that  the  perepoxide  is  a  local  minimum 
on  the  potential  energy  surface  with  the  higher 
energy  by  32.6  kcal/mol  than  the  dissociation  limit 
of  ^02  +  H2C  =  CH2.  On  the  other  hand,  the 
extensive  calculations  at  the  GVB-CI  level  with  the 
DZP  basis  set  by  Harding  and  Goddard  [7]  sup¬ 
ported  the  biradical  mechanism  from  the  result 
that  the  perepoxide  was  located  as  a  minimum 
with  higher  energy  than  biradical  minimum  by 
about  8  kcal/mol.  The  ab  initio  MO  MCSCF  and 
CCI  study  of  Roos  et  al.  [9]  concluded  that  the 
perepoxide  is  a  preferable  reaction  pathway  rather 
the  1,4-biradical  intermediates.  At  last,  we  can  see 
the  results  by  Robb  [10b]  shown  in  Table  I. 

It  is  apparent  from  above  discussion  that  the 
stability  of  the  perepoxide  is  very  sensitive  to  the 
basis  sets  and  calculational  levels  employed  in 
describing  the  electronic  structure.  This  means  that 
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the  relative  stability  to  the  1,4-biradical  states  and 
the  reasonable  reaction  pathway  are  not  conclusive 
within  the  previous  investigations  and  our  calcula- 
tional  levels,  indicating  that  more  high-quality  cal- 
culational  levels  are  necessary  to  elucidate  the 
clear  reaction  mechanism  of  ^©2  +  H2C  =  CH2. 
These  will  be  discussed  later  in  this  text. 

1,2-DIOXETANE  AS  THE 

REACTION  PRODUCT 

Figure  9  shows  the  most  stable  geometries  of 

1,2-dioxetane  optimized  at  both  RMP2/6-31G*  and 
CASSCF{2,2}/6-31G*  levels  with  all  positive  vi¬ 
brational  frequencies.  This  structure  has  the  C2 
symmetry  with  twisted  angles  of  19.2°  and  10.1°  at 
RMP2  and  CASSCF  levels,  respectively,  being  con¬ 
sistent  with  that  of  Robb  et  al.  [23].  On  the  other 
hand,  the  RHF/6-31G*  gives  the  structure  with 
C2J;  symmetry  as  a  local  minimum  on  the  potential 
energy  surface  with  the  lowest  positive  frequency 
of  54  cm" \  as  shown  in  Figure  10.  The  RMP2  and 
CASSCF  methods  have  also  the  similar  structures 
shown  in  Figure  10.  These  states  are,  however,  the 
first-order  saddle  point  with  single  imaginary  fre¬ 
quency  and  have  higher  energy  by  0.68  and  0.01 
kcal/mol  than  the  state  of  the  C2  symmetry  at 
RMP2  and  CASSCF  levels,  respectively,  indicating 
that  the  C — O  bonds  in  the  ground  state  of  the 

1,2-dioxetane  swing  alternately. 

As  shown  in  Table  I,  the  relative  stability  be¬ 
tween  the  ^BR(Gl)  and  the  1,2-dioxetane  at  the 
CASSCF{2,2}/6-31G*  level  gives  comparable  value 


1.521 


to  that  at  the  MP2/6-31G*  level,  being  quite  dif¬ 
ferent  from  27  kcal/mol  of  CASSCF/4-31G  by 
Robb  et  al.  [10b].  The  heat  of  1,2-dioxetane  forma¬ 
tion  at  CASSCF{2,2}  level  is  estimated  to  be  33.7 
kcal/mol  compared  with  12.8  [9]  and  4.0  [10b] 
kcal/mol,  being  reasonable  value  with  30  kcal/mol 
of  thermochemical  estimation.  This  indicates  that 
the  polarization  function  and  correlation  effect  play 
important  roles  for  describing  the  electronic  struc¬ 
ture  of  the  1,2-dioxetane. 

Robb  et  al.  [23]  studied  the  reaction  mechanism 
of  the  chemiluminescent  decomposition  of  1,2-di¬ 
oxetane  by  using  CASSCF/4-31G  method.  They 
gave  the  relative  energies  between  1,2-dioxetane 
and  two  formaldehydes  by  57.1  kcal/mol  at 
CASSCF  level  and  61.4  kcal/mol  at  MP2  level 
using  the  optimized  geometries  of  CASSCF.  As  is 
shown  in  Table  I,  our  relative  energy  is  32.7 
kcal/mol  by  CASSCF{2,2}/6-31G*  method,  being 
smaller  than  their  CASSCF  results.  Flowever,  the 
relative  energy  of  53.5  kcal/mol  at  the  MP2/6- 
31G*  level  is  comparable  with  their  result.  Since 
discussion  of  the  decomposition  of  1,2-dioxetane  is 
not  part  of  this  study,  the  investigation  of  this 
problem  should  be  left  in  future. 

From  Figures  9  and  10,  the  carbon-carbon  bond 
distance  of  1,2-dioxetane  at  the  MP2  level  is  given 
by  1.509  A,  which  is  shorter  than  the  standard 
single-bond  length  of  C — C  single  bond.  The  oxy¬ 
gen-oxygen  bond  distance  is  1.521  A,  being  quite 
longer  than  1.421  A  given  by  geometry  optimiza- 


1.513 

(1.556) 


FIGURE  9.  Optimized  geometrical  parameters  of 
1 ,2-dioxetane  with  C2  symmetry  obtained  by 
RMP2  /  6-31 G*  method  and  CASSCF{2,2}  /  6-31 G* 
method  in  parenthesis. 


FIGURE  1 0.  Optimized  geometrical  parameters  of 
1,2-dioxetane  with  C2V  symmetry  obtained  by 
RMP2/6-31G*,  CASSCF{2,2}/6-31G*  in  parenthesis, 
and  RHF/6-31G*  in  square  brackets. 
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tion  of  RHF/6-31G*,  indicating  that  the  corre¬ 
lation  effect  plays  an  important  role  to  describe  the 
electronic  structure  of  1,2-dioxetane.  The 
carbon-oxygen  bond  distance  also  changes  from 
1.423  to  1.458  A  by  inclusion  of  correlation  effects. 

RELATIVE  STABILITIES  OF  REACTIOIV 
INTERMEDIATES  AT  VARIOUS 

CALCULATIONAL  LEVELS 

Table  III  summarizes  the  relative  energies  of 
reaction  intermediates,  transition  state,  and  prod¬ 
ucts  estimated  by  various  levels  of  theories  using 
the  geometries  optimized  at  the  MP2/6-31G*  level. 
The  relative  energies  of  the  perepoxide  are  surpris¬ 
ingly  sensitive  to  the  calculational  levels  of  theo¬ 
ries. 

Compared  with  the  energy  barrier  height  of  18.0 
kcal/mol  from  ^BR(G1)  to  *BR(G  TS)  by  the  UHF 
method,  the  APUHF  method  increases  to  28.2 
kcal/mol,  while  the  CASSCF  method  with  the 
nondynamic  electron  correlation  does  not  almost 
increase,  that  is,  19.9  kcal/mol.  Though  the  dy¬ 
namic  electron  correlation  is  important  to  describe 
more  precisely  the  singlet  biradical  states,  the  rela¬ 
tive  energies  of  UMP2  and  UMP4  are  not  almost 
changed.  The  relative  energy  by  the  APUHF, 
APUMP2,  and  APUMP4  methods  are  largely  in¬ 
creased  as  well  as  the  CASMP2.  The  coupled-clus¬ 
ter  methods  decrease  the  barrier  height  to  9.9  and 
6.7  kcal/mol  by  CCSD  and  CCSD(T),  respectively. 

On  the  other  hand,  as  is  shown  in  Table  III,  the 
perepoxide  is  largely  fluctuated  by  the  calcula¬ 
tional  levels.  Compared  with  results  of  the  RHF 


method,  the  nondynamic  and  dynamic  electron 
correlation  decreases  the  relative  energies  between 
^BR(Gl)  state  and  the  perepoxide.  At  the  MP2  and 
MP4  levels,  the  stabilities  of  the  perepoxide  are 
compatible  with  the  ^BR(G1)  state.  The  coupled- 
cluster  method  unstabilized  the  perepoxide  to  6-8 
kcal/mol. 

The  relative  stabilities  of  the  1,2-dioxetane  are 
not  so  sensitive  to  the  calculational  levels,  falling 
into  between  40  and  52  kcal/mol.  However,  the 
CASMP2  method  decreases  it  to  about  33  kcal/mol. 
The  energy  differences  between  DO  and  two 
formaldehyde  are  nearly  equal  to  50-60  kcal/mol 
except  for  the  CASSCF  and  CASMP2  methods. 

It  is  apparent  from  Table  III  that  the  reaction 
pathways  from  the  singlet  oxygen  plus  ethylene  to 
1,2-dioxetane  will  be  differently  described  by  the 
calculational  levels.  In  the  UHF  and  CASSCF 
methods,  the  reaction  pathway  through  1,4-biradi- 
cal  states  is  preferable  rather  than  through  the 
perepoxide  intermediate  due  to  the  greater  stabil¬ 
ity  of  the  BR  state  than  the  perepoxide.  At  the  MP2 
level,  though  the  transition  state  from  ^BR(Gl) 
state  or  perepoxide  to  the  1,2-dioxetane  is  not 
found,  two  reaction  pathways  might  be  competi¬ 
tive  with  each  other.  At  the  coupled-cluster 
method,  the  ^BR(TS)  state  and  the  perepoxide, 
which  is  a  local  minimum  on  the  potential  energy 
surface,  are  iso-energetics,  indicating  that  the  pos¬ 
sibility  of  reaching  the  singlet  BR(Gl)  state  is  higher 
than  the  perepoxide  intermediate. 

As  was  examined  previously  [6d,  13,  19a]  a 
potential  energy  curve  for  the  dissociation  process 
of  covalent  bond  in  a  bicentric  system  is  divided 


TABLE  III _ 

Relative  stabilities  (kcal  /  mol)  of  reaction  intermediates  estimated  by  various  levels  of  theory  based  on  the 
geometries  optimized  by  RMP2  or  UMP2  using  6-31 G*  basis  set. 


Methods  \  States 

^BR  (G1)^ 

^BR  (G  TS) 

Perepoxide 

DO 

2H2C  =  0 

R(U)HF 

0.0 

18.0 

30.4 

-11.5 

-75.7 

APUHF 

0.0 

28.2 

CASSCF 

0.0 

19.9 

4.9 

-51.0 

-82.1^ 

R(U)MP2 

0.0 

19.9 

-0.6 

-51.5 

-105.0 

APUMP2 

0.0 

34.6 

R(U)MP4 

0.0 

16.4 

2.2 

-46.9 

-99.9 

APUMP4 

0.0 

29.3 

CCSD 

0.0 

9.9 

8.1 

-40.9 

-93.7 

CCSD(T) 

0.0 

6.7 

5.7 

-43.5 

-95.1 

CASMP2 

0.0 

35.5 

-4.6 

-33.4 

-109.3^ 

CASPT2 

0.0 

17.0 

5.0 

-42.9 

-100.6^ 

^Total  energies  are  given  in  footnote  in  Table  II. 

^The  formaldehyde  dimer  with  C  —  C  distance  of  10  A  was  calculated. 
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into  three  characteristic  regions:  (I)  stable  region, 
(II)  intermediate  (transition)  region,  and  (III)  un¬ 
stable  (magnetic)  region.  As  is  well  known,  the  full 
configuration  interaction  (FCI)  provides  a  reason¬ 
able  potential  curve  over  the  whole  region.  The 
RHF  and  RMP  methods  can  be  applied  to  species 
in  region  I,  but  they  usually  give  an  unreasonable 
result  in  region  III.  The  RHF  coupled-cluster  (RCC) 
method  is  suitable  to  species  in  region  I,  usually 
reliable  in  region  II,  and  unreasonable  in  region  III. 
On  the  other  hand,  the  UHF  method  provides  a 
proper  dissociation  curve,  but  it  suffers  the  so- 
called  spin  contamination  error,  giving  the  higher 
energy  than  FCI  over  the  whole  region.  The  UMPn 
method  provides  the  potential  curve  close  to  FCI, 
but  usually  fails  for  the  description  of  species  in 
region  IT  However,  the  potential  curve  and  wave 
function  presented  by  the  UHF  coupled-cluster 
(UCC)  should  be  very  close  to  that  by  FCI  if  the 
exponential  part  is  taken  to  involve  sufficient 
higher  excitation  operators.  In  fact,  the  spin  con¬ 
tamination  errors  on  the  potential  curve  were  not 
serious  even  at  the  truncated  UCCSIXT)  level  in 
the  bicentric  systems  examined  before  [14]. 

It  seems,  therefore,  that  the  coupled-duster 
method  is  reliable  and  necessary  to  describe  the 
reactions  including  the  unstable  or  radical  inter¬ 
mediate.  The  perepoxide  and  1,2-dioxetane  con¬ 
structed  of  the  covalent  bond  should  be  located  in 
region  I,  while  the  singlet  biradical  intermediates 
and  transition  state  with  strong  biradical  character¬ 
istics  are  in  region  III.  Therefore,  the  coupled-clus¬ 
ter  method  is  reliable  and  necessary  to  describe  the 
reactions,  including  the  unstable  or  radical  inter¬ 
mediate,  It  is  concluded  from  Table  III  that  the 
singlet  oxygen  reaction  to  ethylene  molecule  will 
proceed  by  a  two-step  mechanism  through  1,4-bi¬ 
radical  intermediates,  though  the  transition  state 
should  be  located  by  coupled-cluster  treatment  or 
more  sophisticated  method. 


Conclusion 

The  reaction  mechanism  and  relative  stabilities 
of  the  intermediates  and  transition  states  in  the 
reaction  of  ethylene  molecule  plus  ^02  using  ab 
initio  MO  theories  at  several  levels  of  theory  with 
the  correction  of  the  nondynamic  and  dynamic 
electron  correlation  effects  were  systematically  in¬ 
vestigated.  Full  geometry  optimizations  of  the  cor¬ 
responding  biradical  (BR)  intermediates,  perepox¬ 


ide  (PE),  and  1,2-dioxetane  (DO)  were  performed 
by  CASSCF{2,2}/6-31G*  method  with  nondy¬ 
namic  electron  correlation  effect  and  RHF  or  UHF 
(R  or  U)MP2/6-31G*  method  with  dynamic  elec¬ 
tron  correlation  effect. 

Four  optimized  geometries  of  s-cis,  two 
gauche's,  and  s-trans  types  in  the  ^BR  states  were 
found  by  both  UMP2/6-31G*  and  CASSCF 
{2,2} /6-31G*  methods,  possessing  biradical  char¬ 
acteristics  with  terminal  methylene  moiety  and 
terminal  oxygen  atom.  One  of  two  gauche's  and 
s-trans  types  are  local  minimums  on  the  potential 
energy  surface.  For  the  rotation  around  the  C — O 
bond  axis,  four  ^BR  states  are  smoothly  cormected 
with  the  rotation  barrier  less  than  4  kcal/mol, 
indicating  that  the  free  rotation  easily  occurs  in  the 
singlet  state.  One  of  two  gauche  types,  which  was 
found  as  a  new  geometry  of  1,4-biradical  singlet 
state,  and  s-cis-type  geometry  are  the  transition 
states  of  the  rotation  motion. 

The  Cl  gauche  geometry  was  also  found  as  a 
transition  state  on  the  potential  surface.  The  IRC 
study  showed  that  it  connects  smoothly  to  the 
gauche  ^BR  minimum  state  for  the  reverse  direc¬ 
tion  and  the  dissociation  limit  of  ^©2  +  H2C  = 
CH2  for  the  forward  direction.  It  is,  therefore, 
concluded  that  the  reaction  of  ^02  +  H2C  =  CH2 
through  the  1,4-biradical  intermediates  initially 
proceeds  through  the  gauche  transition  state  to 
form  the  gauche  ^BR  minimum  state,  following 
that  the  free  rotation  of  O2  moiety  along  the  C — O 
bond  axis  occurs  due  to  the  energy  barrier  less 
than  4  kcal /mol. 

The  relative  stabilities  of  four  1,4-biradical 
states,  the  transition  states,  perepoxide,  and  1,2-di¬ 
oxetane  were  estimated  again  by  the  APUHF  [13], 
APUMPn  in  =  2,4)  [6],  CCSD,  CCSD(T)  [6d,  14], 
CASMP2  [15],  and  CASPT2  [12]  methods  at  the 
geometries  optimized  at  both  CASSCF{2,2}  / 6-31G 
and  MP2/6-31G*  levels.  The  APUHF  gives  the 
same  ordering  of  the  relative  stabilities  for  four 
1,4-biradical  states  at  the  geometries  optimized  by 
CASSCF{2,2}  and  UMP2  methods,  while  the 
APUMP  (n  =  2,4)  provides  the  reverse  stabilities 
for  1,4-biradical  states.  It  may  be,  therefore,  noted 
that  both  APUHF  and  APUMP  methods  are  suit¬ 
able  and  unsuitable  to  describe  the  qualitative 
pictures  of  the  species  and  quantitative  pictures, 
respectively.  The  description  of  the  perepoxide  is 
very  sensitive  for  basis  sets  employed  and  levels  of 
theory,  causing  the  difficulties  determining  the 
reaction  mechanism  of  ^©2  +  H2C  =  CH2.  For  the 
description  of  the  1,2-dioxetane,  the  correction  of 
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the  dynamic  electron  correlation  such  as  MP2 
treatment  is  necessary  at  least. 

As  is  mentioned  in  the  previous  section,  the 
RHF  coupled-cluster  (RCC)  method  is  suitable  for 
species  in  the  stable  region  I  and  usually  reliable 
in  intermediate  region  11.  The  potential  curve  and 
wave  function  presented  by  the  UHF  coupled- 
cluster  (UCC)  should  be  very  close  to  that  by  FCI 
over  the  whole  regions  [6d,  12].  It  seems,  therefore, 
that  the  CCSD  and  CCSD(T)  methods  are  neces¬ 
sary  to  elucidate  the  reaction  mechanism  including 
the  unstable  and  radical  intermediates.  From  the 
data  of  the  CCSD  and  CCSD(T)  methods,  the  reac¬ 
tion  of  ^©2  +  H2C  =  CH2  proceeds  by  a  two-step 
mechanism  through  the  1,4-biradical  intermediates 
rather  than  through  the  perepoxide. 
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ABSTRACT:  Molecular  dynamics  simulations  of  clusters  of  Li'*'  and  Be^"^  cations 
with  up  to  12  water  molecules  were  performed  calculating  the  particle  trajectories  using 
Hartree-Fock-derived  forces.  It  was  found  that  independent  of  the  starting  configuration 
tetrahedral  clusters  are  obtained  in  less  than  1  ps.  Only  for  Li'^  clusters,  transition  states 
with  five  and  three  water  molecules  are  found.  We  discuss  the  structures  of  the  clusters 
and  the  vibrational  dynamics  of  the  water  molecules  and  compare  them  with  other  ab 
initio  simulations,  with  simulations  using  analytical  potential  functions,  and  with  static 
calculations.  Water-dissociation  (hydrogen  transfer)  reactions  which  take  place  at  elevated 
temperatures  were  investigated.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  803-816, 
1997 


Introduction 

The  hydration  of  ions  plays  a  key  role  in  many 
processes  occurring  in  solution  chemistry  and 
is  of  great  importance  in  many  areas  of  physics. 
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biochemistry,  and  biology.  The  molecular  dynam¬ 
ics  (MD)  method  has  established  itself  as  a  valu¬ 
able  tool  to  obtain  insight  into  the  properties  of 
aqueous  solutions  and  hydrated  ions  at  a  micro¬ 
scopic  level  [1-3]. 

In  this  article,  we  report  structural  and  dynami¬ 
cal  results  from  MD  simulations  of  clusters  of  Li'^ 
and  Be^’^  cations  with  water  molecules.  Many- 
body  effects  were  incorporated  into  the  simulation 
by  using  energies  and  forces  from  Hartree-Fock 
wave  functions  to  propagate  the  trajectories.  Due 
to  the  strong  interactions  caused  by  the  small  and 
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highly  charged  cation,  aqueous  Be^“^  clusters  are 
ideal  systems  for  studying  strong  nonadditive  in¬ 
teractions.  Comparisons  are  made  with  clusters  of 
the  singly  charged  Li'^  cation  for  which  nonaddi¬ 
tive  effects  are  less  important  but  for  which  only  a 
few  theoretical  studies  on  clusters  exist. 

Experimental  [4-6],  quantum  chemical  [7-9], 
and  simulation  [6,  10,  11]  studies  on  aqueous  salt 
solutions  containing  Be^^  all  report  or  predict 
a  coordination  number  of  four  for  not-too- 
concentrated  solutions.  For  example,  O-NMR  [5] 
experiments  on  a  Be(C104)2  solution  at  295  K  gave 
a  coordination  number  of  4.1  ±  0.2. 

It  has  been  shown  by  ab  initio  calculations  of 
clusters  [7-9]  that  the  high  charge  density  on  the 
Be^"^  ion  leads  to  strong  nonadditive  interactions. 
A  comparison  of  classical  molecular  dynamics 
simulations  with  pair  potentials  only  and  with 
three-body  effects  incorporated  [10,11]  showed 
that  in  correctly  determining  the  coordination 
number  nonadditive  interactions  play  a  crucial 
role. 

Recently,  calculations  on  Be^^(H20)j^  systems 
were  also  carried  out  using  the  so-called 
Car-Parinello  (CP)  molecular  dynamics  method 
by  Marx  et  al.  [12-14].  Their  results  are  compared 
with  our  findings  in  more  detail  in  the  subsequent 
sections. 

Li"^  ions  in  water  were  among  the  first  molecu¬ 
lar  liquids  studied  in  computer  simulations  [15] 
and  there  exist  many  MD  simulation  studies  of  Li 
ions  in  various  concentrations  [16-18].  While  the 
coordination  number  of  Li"^  in  dilute  aqueous 
solutions  is  still  inconclusive,  most  of  the  newer 
studies  assume  a  sixfold  coordination  to  be  most 
likely. 

In  the  following  sections  we  discuss  at  first  the 
details  of  our  calculations.  Then,  structural  proper¬ 
ties  of  the  clusters,  simulation  experiments  per¬ 
formed  in  order  to  study  the  dissociation  of  water 
molecules,  and  vibrational  properties  are  dis- 
cussed.  Finally,  we  compare  the  method  used  in 
the  present  work  with  alternative  ones  and  sum¬ 
marize  our  conclusions. 


Details  of  Calculations 

In  most  MD  simulations,  analytical  potentials 
are  employed  to  derive  the  interparticle  forces  and 
most  of  the  analytical  potentials  assume  a  pairwise 
additivity  of  the  interactions,  at  least  between  un¬ 


bonded  atoms.  While  this  is  often  a  good  approxi¬ 
mation,  it  becomes  inaccurate  or  even  fails  in  cases 
where  effects  like  polarization,  charge  transfer,  or 
bond-breaking/bond  formation  are  relevant.  In 
principle,  analytical  potentials  can  be  constructed 
to  account  also  for  such  effects  and  many  efforts 
have  been  made  along  these  lines  [19-22].  An 
alternative  that  became  viable  in  recent  time  is  to 
employ  quantum  chemical  methods  to  treat  the 
system  as  a  whole  when  calculating  the  forces 
acting  on  the  particles. 

For  the  MD  simulations  reported  here,  the  parti¬ 
cle  trajectories  were  calculated  via  forces  from 
Hartree-Fock  Hamiltonians  with  small  basis  sets. 
The  wave  function  was  optimized  at  every  time 
step.  The  equations  of  motion  were  solved  using  a 
velocity- Verlet  algorithm  [23]  and  all  degrees  of 
freedom  were  kept  active.  For  the  constant- 
temperature  simulations,  the  Berendsen  thermo¬ 
stat  [24]  was  used.  A  time  step  of  0.3  fs  was 
sufficient  to  resolve  also  the  hydrogen  motions 
and  a  typical  MD  run  was  performed  for  6000  time 
steps,  corresponding  to  1.8  ps  elapsed  time.  One 
time  step  of  a  complex  with  12  water  molecules 
took  about  180  s  on  a  single  MIPS  8000  processor 
of  an  SGI  power  challenge  computer.  The  time  is 
almost  exclusively  spent  in  the  force  calculation 
while  the  time  for  obtaining  the  new  positions  and 
velocities  is  negligible.  The  forces  were  obtained 
from  a  slightly  modified  Gaussian  94  computer 
code  [25]  which  we  interfaced  to  our  MD  routines. 

THEORETICAL  LEVEL 

In  principle,  any  suitable  wave  function — or 
density  functional-based  method  for  which  energy 
gradients  are  available — can  be  used  in  an  ab 
initio  MD  approach.  We  decided  to  work  at  the 
Hartree-Fock  (HF)  level  since  for  the  systems 
studied  here  electron  correlation  plays  a  minor 
role  and  a  comparison  of  the  computer  codes  ac¬ 
cessible  to  us  showed  the  calculations  of  forces 
to  be  about  twice  as  fast  for  HF  as  for  density 
functional-based  methods  like  B3LYP  [26,  27]  or 
for  MP2.  Density  functionals  without  gradient  cor¬ 
rection  are  considerably  faster  but  they  fail  to 
reproduce  hydrogen  bonding  correctly. 

To  be  able  to  handle  systems  with  two  hydra¬ 
tion  shells,  we  are  restricted  to  use  rather  small 
basis  sets.  For  a  comparison  of  basis-set  quality, 
we  carried  out  geometry  optimizations  on  the  sys¬ 
tems  Be^'^(H20)„  (n  =  1-4)  and  list  the  results  in 
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Table  L  (The  binding  energy  per  water  molecule  is 
given  by  A£  =  [ £([Be(OH2)J^^)  -  £(Be^+)  - 
n£(H20)]/n). 

The  CEP  [29]  basis  set  in  single-zeta  contraction 
compares  energetically  well  with  larger  ones.  On 
the  HE  level,  it  gives  A  £  values  of  -136.6,  -129.6, 
—116.3,  and  —103.1  kcal/mol  for  the  mono-  to 
tetrahydrates,  compared  with,  e.g.,  -135.6,  -128.0, 
-107.9,  and  -96.3  kcal/mol  for  the  ''DZP"  basis 
set  and  it  is  (accidentally)  even  closer  to  the  6- 
311G**  basis  at  the  MP2  level  (-147.3,  -136.7, 
-119.0,  and  —103.2  kcal/mol).  We  found  that  small 
double-zeta  basis  sets  like  3-21G  give  binding 
energies  somewhat  too  high. 

Table  I  also  shows  that  the  CEP  basis  set  in  a 
single-zeta  contraction  leads  to  Be — O  distances 
about  0.07  A  too  long.  Also,  the  O — H  distances 
are  slightly  too  long.  These  errors  stay  constant 
during  the  series  of  hydrates.  The  use  of  CEP  in  a 
double-zeta  contraction  would  have  improved 


these  distances  at  the  expense  of  a  fivefold  in¬ 
crease  of  computing  time.  The  H — O — H  angle  is 
in  good  agreement  with  more  expensive  basis  sets 
and  methods.  We  finally  decided  to  use  the  CEP 
basis  set  in  a  single-zeta  contraction  for  the  MD 
simulations.  Table  II  lists  the  results  for  the 
Li^(H20)„  (n  =  1-4)  clusters  with  the  CEP  basis 
set.  Table  III  shows  the  harmonic  vibrational  fre¬ 
quencies  for  Be^'^(H20)„  (n  =  1-4)  compared  to 
two  other  basis  sets  and  methods.  The  shifts  in  the 
O — H  stretching  vibrations  are  the  largest  for  n  = 
1  and  decrease  then  with  increasing  charge  trans¬ 
fer  to  the  cation.  Most  relevant  are  the  differences 
between  the  O — H  stretching  vibrations  of  the 
isolated  H2O  molecule  and  the  fourfold  coordi¬ 
nated  clusters.  For  the  CEP  basis  set,  the  slight 
average  upshift  of  the  symmetric  (^  10  cm”^)  and 
the  large  downshift  of  the  asymmetric  stretching 
mode  (~— 290  cm“^)  are  in  reasonable  agreement 
with  the  3-21G  values  and  also  with  shifts  found 


TABLE  I 


Binding  energies  and  optimized  geometries  of  with  n  =  1-4. 


Method  and 

basis  set 

n 

X 

1  0^ 

0 

ZHOH 

(degree) 

r Be— 0  (A) 

A£ 

(kcal  /  mol) 

Ref. 

HF 

CEP-4G 

1 

1.033 

107.5 

1.635 

-136.6 

This 

2 

1.027 

108.3 

1.644 

-129.6 

work 

3 

1.020 

108.9 

1.682 

-116.2 

4 

1.014 

108.8 

1.736 

-103.1 

HF 

DZP 

1 

0.976 

108.0 

1.536 

-135.6 

[7] 

2 

0.968 

108.3 

1.537 

-128.0 

3 

0.961 

107.9 

1.613 

-107.9 

4 

0.956 

108.0 

1.662 

-96.3 

HF 

3-21 G 

1 

0.999 

108.7 

1.517 

-169.2 

[8] 

2 

0.991 

109.6 

1.533 

-158.0 

3 

0.981 

110.2 

1.585 

-140.0 

4 

0.975 

110.6 

1.645 

-123.1 

HF 

3-21 G* 

1 

0.999 

108.0 

1.503 

-161.3 

[8] 

2 

0.990 

108.4 

1.528 

-149.0 

3 

0.980 

108.4 

1.580 

-130.8 

4 

0.974 

108.2 

1.648 

-114.1 

DFT 

Plane  wave 

1 

0.991 

108.0 

1.490 

-149.6 

[12] 

basis  set 

2 

0.982 

108.7 

1.525 

-140.2 

-KECP 

3 

0.974 

108.6 

1.585 

-127.9 

4 

0.970 

108.6 

1.668 

-117.7 

MP2 

6-31 1G** 

1 

0.991 

108.0 

1.507 

-147.3 

[28] 

2 

0.983 

108.1 

1.526 

-136.7 

3 

0.974 

107.8 

1.579 

-119.0 

4 

0.970 

107.0 

1.668 

-103.2 
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TABLE  II 


Hartree- 

-Fock  energies  and  optimized  geometries  of  Li^CHgO)^  with  n  =  ‘ 

1-4  (CEP  basis  set). 

n 

^0— H 

(A) 

^HOH 

(degree) 

^Li  — 0 

(A) 

AE 

(kcal  /  mo!) 

1  (C2,) 

1.0090 

107.2 

1.945 

-41.7 

2  (D2,) 

1.0081 

107.5 

1.961 

-39.9 

3(03) 

1 .0074 

107.9 

1.994 

-36.6 

4  (Dg  J 

1 .0067 

108.0 

2.043 

-32.9 

for  other  cations  [31].  Interestingly,  the  reported 
shifts  from  the  DFT  calculations  with  a  plane-wave 
basis  set  [12]  are  zero. 


Structural  Results 

In  contrast  to  Be^'^  where  tetrahedral  coordina¬ 
tion  is  expected  if  nonadditive  effects  are  incorpo¬ 
rated  in  the  simulations,  this  is  not  so  clear  for  Li"^. 
For  example,  a  neutron  diffraction  study  of  a  3.57 
m  LiCl  solution  resulted  in  a  coordination  number 
of  5.5  ±  0.3  [32]  and  MD  simulations  of  aqueous 
solutions  at  different  concentrations  but  with  pair 
potentials  also  seem  to  indicate  that  in  infinitely 
diluted  systems  an  octahedral  tetrahedral  coordi¬ 
nation  is  preferred  [16-18].  On  the  other  hand, 
simulations  on  the  system  Li'^/Fl20  with  nonad¬ 


ditive  potentials  [33,  34]  resulted  in  tetrahedral 
coordination  of  the  cation. 

We  performed  simulations  with  different  initial 
configurations.  Independent  of  the  initial  state,  te- 
trahedrally  coordinated  complexes  are  obtained. 
The  same  starting  configurations  for  Be^'^(H20)„ 
and  Li'^(H20)„  (n  =  6,8,10,12)  were  used.  The 
simulations  were  performed  at  300  K  using  a 
Berendsen  thermostat  [24]  for  equilibration. 

Figure  1  shows  the  different  starting  configura¬ 
tions  of  the  Be^"^  clusters.  In  the  center  of  the 
figure,  a  representative  configuration  resulting 
from  the  simulations  Is  shown.  The  tetrahedral 
coordination  of  the  oxygen  atoms  of  the  inner-shell 
water  molecules  (plotted  in  black)  can  be  seen 
clearly.  This  particular  configuration  originates 
from  the  initial  structure  Be^'^(H20)2(H20)8  (c) 
but  the  other  three  starting  configurations  would 


TABLE  III _ - _ 

Harmonic  frequencies  of  the  most  important  modes  in  the  Be^^CHgO)^  ciusters  with  n  =  0-4. 


HgO 

n  =  ^ 

n  =  2 

n  =  3 

n  =  4 

Mode 

CEP 

(this 

work) 

3-21 G 
[8] 

Plane 

wave 

[12] 

CEP 

(this 

work) 

3-21 G 
[8] 

Plane 

wave 

[12] 

CEP 

(this 

work) 

3-21 G 
[8] 

Plane 

wave 

[12] 

CEP 

(this 

work) 

3-21 G 
[8] 

Plane 

wave 

[12] 

CEP 

(this 

work) 

3-21 G 
[8] 

Plane 

wave 

[12] 

Be— (OH2) 
bending 

888- 

913 

944- 

955 

710- 

816 

831- 

905 

866- 

950 

640- 

840 

718- 

912 

751- 

938 

580- 

740 

636- 

783 

657- 

785 

490- 

600 

Be— 0 
stretch 

963 

1126 

1060 

1213 

1376 

1210 

1028 

1129 

980 

877- 

982 

1007- 

1018 

830 

H— 0— H 
bending 

1727 

1799 

1610 

1752 

1820 

1700 

1754 

1824- 

1827 

1730 

1761- 

1764 

1834- 

1840 

1730 

1767- 

1783 

1830- 

1840 

1730 

0— H 
sym.  str. 

3900 

3812 

3710 

3676 

3500 

3490 

3742 

3588- 

3610 

3600 

3835- 

3853 

3703- 

3726 

3680 

3903- 

3919 

3775- 

3796 

3710 

0— H  4246 

asym.  str. 

3945 

3830 

3709 

3511 

3520 

3803 

3630 

3620 

3933- 

3935 

3764- 

3966 

3750 

4033- 

3936 

3853- 

3857 

3830 

The  experimental  values  for  HgO  are  1595,  3657,  and  3756  cm"’'  for  the  anharmonic  and  1648,  3832,  and  3943  cm  ^  for  the 
harmonic  frequencies  [30].  For  the  larger  complexes,  the  lowest  and  highest  frequency  of  a  set  of  modes  are  given,  if  available. 
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FIGURE  1.  Different  starting  configurations  of  the  Be^’'  (respectively,  Li ■')-water  clusters:  (a)  Be^''(H20)6; 

(b)  Be^''(H20)8;  (c)  Be^’^(H20)2(H20)8;  (d)  Be^‘'(H20)4(H20)8;  (center)  atypical  Be^''(H20)i2  configuration  resulting 
from  (c).  Black  spheres  represent  oxygen  atoms  of  water  molecules  in  the  first  hydration  shell. 


lead  to  similar  final  configurations  (as  is  symbol¬ 
ized  by  the  four  arrows  in  Fig.  1),  qualitatively 
only  differing  in  the  number  of  second-shell 
water  molecules.  Configurations  with  Li''  as  a 
central  cation,  on  average,  show  the  same  fourfold 
coordination. 

First,  we  discuss  the  simulations  involving 
beryllium,  then  the  ones  with  lithium  cation: 

Be 

(a)  The  initial  configuration  denoted  (a) 
in  Figure  1  is  a  Be^''(H20)6  cluster  which  was 
obtained  from  the  Tf^  symmetrically  optimized 
geometry  [8]  and  which  was  also  used  in  [12]. 
There,  it  was  found  that,  starting  from  an  octa¬ 
hedral:  cluster  at  20  K,  a  fourfold  hydrated  struc¬ 
ture  is  obtained  after  0.2  ps.  We  found  that  a 
similar  tetrahedral  structure  is  reached  after  0.1  ps 


(by  slightly  displacing  the  cation  from  the  origin, 
we  had  removed  the  symmetry  from  the  system). 
Then,  we  carried  out  equivalent  simulations  with 
Be^''(H20)8  (b),  a  hexahedral  initial  configuration 
(eight  water  molecules  cubically  arranged  in  the 
first  shell),  with  Be^''(H20)io  (c),  two  water 
molecules  in  the  first  shell  and  eight  in  the  second 
and  with  Be^''(H20)i2  (d),  four  water  molecules  in 
the  first  and  eight  water  molecules  in  the  second 
shell.  All  these  simulations  resulted  in  fourfold 
coordinated  first  hydration  shells  after  similar 
times  ( 0.1  ps).  No  exchange  of  water  molecules 
between  the  first  and  the  second  hydration  shell 
was  observed  after  this  time. 

A  quantitative  picture  of  the  solution  structure 
can  be  obtained  from  the  radial  distribution  func¬ 
tions.  In  the  following  discussion,  we  refer  to 
Be^*'(H20)i2  unless  mentioned  otherwise.  Because 
finite  clusters  are  investigated,  the  radial  distribu- 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


807 


BISCHOF  ET  AL. 


tion  functions  are  not  normalized  by  as  is 
usually  done  for  periodic  ^sterns.  The  first  maxi¬ 
mum  of  1.72  A  (Fig.  1,  solid  line)  can 

be  compared  to  1.65  A  [13]  (CP  simulation)  and 
1.75  A  (MD  study  with  a  threebody  potential  [11]). 
The  value  obtained  from  the  X-ray  diffraction  study 
[6]  of  a  5.3  molal  aqueous  BeCl2  solution  is  1.67  A. 
The  Be — O  distances  in  crystals  with  fourfold  co¬ 
ordinated  Be^'^  ions  are  shorter  and  lie  between 
1.60  and  1.63  A  [35].  Considering  that  the  CEP 
basis  set  leads  to  an  overestimation  of  the  Be — O 

o 

distance  of  about  0.07  A  (Table  I),  the  values  from 
X-ray  [6]  and  the  CP  simulation  [13]  are  probably 
more  accurate  than  in  the  MD  study  [11]  and  are 
consistent  with  our  data.  The  peak  maximum  of 
1.72  A  can  further  be  compared  with  the  values  of 
the  optimized  cluster  with  (1.714  A,  Table  IV)  and 
without  (1.736  A)  the  eight  second-shell  water 
molecules,  showing  that  the  dynamic  motion  has  a 
smaller  influence  on  the  average  Be — O  distance 
than  on  the  second  hydration  shell. 

The  second  maximum  of  gge— o  ^  devi¬ 

ates  more  from  the  potential  minimum  at  3.957  A. 
For  the  three  systems  with  fewer  water  molecules, 
values  between  3.6  and  4.4  A  are  obtained.  The 
first  and  second  peaks  of  gee— o  completely 
separated  and  are  well  defined,  similar  to  what 
was  found  in  the  MD  study  [11]  with  periodic 


boundaries.  These  data  cannot  be  compared  fur¬ 
ther  to  the  CP  simulation  study  [13]  because  the 
small  periodic  box  of  about  10  A  containing  31 
water  molecules  in  [13]  already  disturbs  the  sec¬ 
ond  hydration  shell,  leading  to  an  unstructured 
gBe— o(^)  curve  beyond  3.5  A. 

The  Be — H  distribution  function  has  its  first 
maximum  at  2.42  A  (Fig.  2,  dotted  lines).  This  is 
again  about  0.07  A  more  than  the  value  obtained 
in  [13].  It  is  again  close  to  the  values  of  the  poten¬ 
tial  minimum  (2.457  A),  indicating  that  the  water 
molecules  in  our  simulation  are,  on  average,  only 
slightly  tilted  away  from  a  planar  Be — H2O  sub¬ 
configuration. 

The  integral  values  are  also  plotted  in  figure  2 
for  O  and  H  and  show  the  expected  plateaus  for  4 
and  12  oxygen  atoms  or  8  and  24  hydrogens, 
respectively.  The  distribution  of  the  H — O — H  an¬ 
gles  for  the  first  and  second  hydration  shell  have 
maxima  at  111.5°  and  108.3°,  respectively  (opti¬ 
mized  values:  109,9°  and  108.1°)  and  thus  show 
that  the  angle  is  little  influenced  by  the  cation.  The 
distribution  of  the  intramolecular  O — H  distances 
in  the  first  shell  (Fig.  3,  solid  line)  gives  a  peak 
with  a  maximum  between  1.00  and  1.04  A,  nearly 
symmetrically  around  the  potential  minimum  of 
1.023  A.  Compared  to  an  isolated  H2O  molecule, 
the  bonds  are  elongated  by  about  0.01  A.  The 


FIGURE  2.  Be— O  radial  distribution  function  gBe~o(^)  running  coordination  number  nQ^—Qir)  (solid  curves) 
and  Be— H  radial  distribution  function  gge-H^^)  with  running  coordation  number  nQQ_^{r)  (dotted  curves)  of 
Be2‘^(H20)i2. 
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TABLE  IV 

MOLECULAR  DYNAMICS  OF  HYDRATED  Li+  AND 

CATIONS 

Optimized  geometries  of  the  Be 

"+(H20)4(H20)8  and  the  Lr(H20)4(H20)8 

clusters  (HF  with  CEP  basis  set). 

/'O  —  H 

ZHOH 

^Be  — 0 

(A) 

(degree) 

(A) 

Be2  +  (H20)4(H20)8 

First  shell 

1.023 

109.9 

1.714 

Second  shell 

1.012 

108.1 

3.957 

Li2  +  (H20)4(H20)8 

First  shell 

1.008 

108.3 

2.033 

Second  shell 

1.011 

108.0 

4.476 

average  distance  between  the  oxygens  in  the  first 
and  second  hydration  shell  in  our  the  simulation 
(2.35  A)  is  about  0,1  A  longer  than  the  optimized 
value  (2.243  A),  indicating  a  large  dynamic 
contribution. 

Li 

Simulations  with  the  Li^  ion  were  performed 
with  the  same  set  of  starting  configurations 
as  described  above.  A  tetrahedrally  hydrated 
Li^  ion  emerges  as  the  most  stable  subsystem 


but  its  formation  is  much  slower  than  for 
Be^"^  and  we  observed  '"transition  states"  with 
different  coordination  numbers  in  the  initial  states 
of  the  simulations.  It  took  between  0.45  ps  [from 
Li+(H20)2(H20)8]  and  0.70  ps  [from  Li+(H20)8] 
until  the  tetrahedral  complex  was  formed. 

Figure  4  shows  two  examples  of  intermediate 
structures:  (a)  The  complex  with  a  threefold  coor¬ 
dinated  Li"^  was  formed  from  the  Li'^(H20)6  clus¬ 
ter  after  0.16  ps.  (b)  A  complex  with  five  water 
molecules  was  formed  from  Li'^(H20)2(H20)8 
after  0.25  ps.  Similar  complexes  with  three-  and 


FIGURE  3.  Distribution  of  the  intramolecular  O — H  distances  in  the  first  hydration  shells  of  Be^'^(H20)i2  (solid  curve) 
and  Li‘^(H20)i2  (dotted  curve). 
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FIGURE  4.  Snapshots  of  (a)  an  intermediate  state  with  threefold  coordinated  Li"^  after  simulating  for  0.16  ps  from  the 
Li'^(H20)6  configuration  and  (b)  an  intermediate  state  with  fivefold  coordinated  Li"^  after  simulating  for  0.25  ps  starting 
from  Li  +  (H20)2(H20)8. 


fivefold  coordination  were  also  found  for  the  other 
starting  configurations.  Since  we  used  no  periodic 
boundary  conditions  or  any  potentials  confining 
the  particles  in  the  simulations,  water  molecules 
can — and  will  finally— evaporate  from  the  second 
hydration  shell.  In  the  Be^^/H20  system,  the 
strong  attraction  of  the  water  molecules  makes  this 
a  process  too  slow  to  be  observed,  but  for 
Li^/H20,  such  evaporation  is  observed  in  all 
simulations. 

Figure  5  shows  the  radial  distribution  functions 
gLi-o(^)  for  Li+(H20)4(H20)8.  The 

first  peak  of  the  Li — O  distance  distribution  has  its 
maximum  at  2.08  A.  The  corresponding  value  of 
the  potential  minimum  is  2.03  A  (Table  IV).  In 
simulations  studies  with  nona^dditive  potentials 
[33,  34],  values  of  1.98  and  2.0  A^were  found.  The 
Li — O  distances  are  about  0.35  A  longer  than  the 
Be — O  distances,  which  directly  reflects  the  differ¬ 
ent  ionic  radii  of  the  ions  (Li^:  0.59  A  vs.  Be^'^: 
0.27  A)  and  shows  that  the  ion-water  attraction 
and  not  the  water-water  repulsion  are  dominat¬ 
ing.  The  g(r)  curves  decrease  nearly  to  zero  be¬ 
tween  their  first-  and  second-shell  maxima.  The 
first  peak  again  integrates  to  four. 

The  second  hydration  shell  of  Li"^  leads  to  a 
broad  peak  between  4.0  and  5.0  A  with  a^  maxi¬ 
mum  at  4.57  A  (potential  minimum:  4.48  A).  The 


tail  of  the  function  is  caused  by  the  slowly  evapo¬ 
rating  water  molecules.  The  maximum  of  gu  —  n  is 
located  at  2.72  A.  This  is  about  0.15  A  longer  than 
that  foimd  in  a  simulation  carried  out  with  pair 
potentials  and  periodic  boundaries  [36].  From  our 
geometry  optimization,  a  value  of  2.75  A  is  ob¬ 
tained.  The  maxima  of  the  distribution  of  the 
H — O — H  angle  are  found  at  108.5°  and  107.5°  for 
first  and  second  shells,  respectively. 

The  intramolecular  O — H  distance  distribution 
(Fig.  3,  dashed  line)  has  its  maximum  between 
0.993  and  1.020  A  for  the  first-shell  water 
molecules.  They  are  thus  slightly  less  elongated 
than  in  the  complexes  with  Be^"^. 

The  distance  between  the  oxygens  in  the  first 
and  second  hydration  shell  from  the  simulation  is 
about  0.05  A  longer  than  between  the  correspond¬ 
ing  potential  minima.  While  this  is  only  about  half 
of  the  value  found  for  Be^'^,  it  is  still  a  large 
deviation. 


Dissociation  of  Water  Molecules 

Be^'^  can  cause  abstraction  of  hydrogen  from 
water  molecules,  giving  BeOH^.  This  is  the  reason 
why  solutions  of  Be^'^  salts  are  acidic  (a  0.5  molal 
solution  of  BeCl2  has  an  approximate  pFl  value  of 
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FIGURE  5.  Li — O  radial  distribution  function  gLj_o(r')  with  running  coordination  number  n^_^_Q{r)  (solid  curves)  and 
Li — H  radial  distribution  function  with  running  coordination  number  (dotted  curves)  of  LI'^(H20)i2. 


3.5).  Ab  initio  MD  allows  the  simulation  of  such 
dissociation  reactions.  At  room  temperature,  this 
reaction  is  rather  infrequent  in  the  liquid  phase 
and  will  be  even  more  so  in  small  clusters.  In  the 
case  of  elevated  temperatures,  however,  the  reac¬ 
tion  takes  place  more  rapidly.  We  performed  the 
following  experiments: 

(a)  Starting  from  Be^'^(H20)8  with  velocities  cor¬ 
responding  to  20  K,  we  found  water  dissocia¬ 
tion  according  to 

^  Be+(H20)20H 
+  H30^+  4H2O 

after  about  0.17  ps.  The  potential  energy  at  the 
beginning  of  the  simulation  led  to  a  tempera¬ 
ture  of  about  2000  K  during  this  time.  A  typi¬ 
cal  configuration  of  the  products  is  shown  in 
Figure  6. 

(b)  The  acidity  of  Be^"^  can  be  expected  to  be 
much  higher  if  it  is  not  saturated  with  four  (or 
more)  water  molecules.  Indeed,  a  small  model 
system  consisting  of  one  molecule  of  water  in 
the  first  hydration  shell  and  one  hydrogen 
bonded  to  the  first  one  reacts  after  0.3  ps  at 
300  K  according  to 

Be2+(OH2)OH2  ^  BeOH++  H30''. 

(c)  MD  simulations  starting  with  the  equilibrated 
Be^^ (1120)4(1120)8  configuration  were  per- 


FIGURE  6.  Typical  configuration  after  dissociation  of  a 
water  molecule  from  the  Be^'^(H20)8  cluster  after  0.45 
ps  simulation  time. 


formed  at  different  temperatures  with  intervals 
of  about  1000  K.  At  5000  K,  we  observed  water 
dissociation  after  0.45  ps. 

Figure  7  shows  the  distance  between  the  hydro¬ 
gen  atom  which  is  being  transferred  and  both  of 
the  oxygen  atoms  involved  in  the  dissociation  re¬ 
action.  At  start  of  the  simulation,  this  hydrogen 
and  the  oxygen  atom  of  the  water  molecule,  which 
will  be  protonated,  are  about  6  A  apart  (light 
curve).  The  hydrogen  exchange  at  0.45  ps  happens 
within  about  two  cycles  of  the  O — H  vibrations 
and  no  protonated  water  dimer  with  a  shared 
hydrogen  or  a  similar  species  is  formed.  The  vibra- 
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FIGURE  7.  Hydrogen  transfer  in  Be^'^(H20)i2  at  high  temperatures.  The  distance  between  a  hydrogen  atom  and  the 
oxygen  atom  in  the  same  water  molecule  (black  curve)  and  the  oxygen  atom  to  which  the  hydrogen  is  transferred  (light 
curve)  are  shown  as  a  function  of  time. 


tional  amplitudes  of  the  old  and  the  new  O — H 
bonds  are  similar.  After  the  transfer,  the  two  frag¬ 
ments,  OH"  (as  part  of  BeOH'*')  and  H30^,  imme¬ 
diately  move  away  from  each  other.  While  these 
nonequilibrium  simulation  experiments  on  water 
dissociation  are  clearly  not  statistically  representa¬ 
tive  for  water  dissociation,  in  general,  in  clusters 
or  in  the  liquid  phase,  there  are  only  a  few  cases 
where  such  events  have  been  observed  in  simula¬ 
tion  studies. 


Vibrational  Dynamics  of  the 
Water  Molecules 

From  an  experimental  point  of  view,  the  shifts 
of  the  O — H  vibrations  of  water  molecules  which 
become  members  of  ionic  clusters  or  hydration 
shells  are  highly  interesting  since  they  allow  one  to 
probe  the  molecular  environment  [37]. 

From  equilibrated  Be^^(H20)4(H20)g  and 
Li^(H20)4(H20)8  clusters,  simulations  of  about 
5000  time  steps  each  were  performed.  The  veloci¬ 
ties  at  every  time  step  were  used  to  calculate  the 
vibrational  modes.  The  results  are  listed  in  Table 
V.  The  vibrations  of  an  isolated  water  molecule 
calculated  from  a  room-temperature  MD  trajectory 
include  anharmonic  contributions  but  differ  only 
by  a  few  wave  numbers  from  harmonic  values 


obtained  from  the  matrix  of  second  derivatives  at 
equilibrium  geometry.  It  can  be  seen  from  Table  V 
that  even  for  Be^"^ — H2O  the  difference  is 
small  compared  to  the  frequency  shift  upon 
complexation. 

The  MD  frequency  shift  of  the  asymmetric  O — H 
stretching  mode  for  the  Be^^  (H20)4  system  is  -543 
cm"^  and  the  one  of  the  symmetric  O — H  stretch¬ 
ing  mode  is  -244  cm“^  The  corresponding  har¬ 
monic  values  are  —537  and  —224  cm"T  For  com¬ 
parison,  a  Hartree-Fock  study  of  Mg^"^ — H2O  and 
Al^"^ — H2O  [31]  gave  O — H  (decoupled)  fre¬ 
quency  shifts  of  220  cm"^  for  Mg^^  and  670  cm“^ 
for  AP^.  It  seems  reasonable  that  the  OH  shift 
resulting  from  the  Be^^  ion  is  located  between  the 
values  of  Mg^^  and  AP^. 

In  contrast  to  the  Be^^  ion,  Li"^  causes  no  such 
strong  frequency  shift  of  the  OH  stretching  modes 
compared  to  the  single  water  OH  stretching  modes 
(symmetric  stretch:  +54  cm"^;  asymmetric  stretch: 
-54  cm"^)  The  corresponding  value  in  [31]  was 
-70  cm"^  (O — H  uncoupled). 

These  test  calculations  serve  as  a  basis  for  inter¬ 
preting  the  results  for  larger  clusters.  In  Figure  8, 
the  power  spectra  Kv)  as  calculated  from  the 
velocity  vectors  v(t)  of  the  atoms  are  plotted  (in¬ 
tensities  are  arbitrary  and  do  not  reflect  IR 
intensities;  ^  denotes  the  Fourier  transform;  this 
formula  is  equivalent  to  calculating  the  Fourier 
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TABLE  V _ 

Molecular  dynamics-derived  and  harmonic  frequencies  (in  cm  of  the  isolated  water  molecule  and  water 
In  the  Be^  V  H2O  and  Li  "7  HgO  systems  with  1  and  12  water  molecules  (the  given  frequencies  refer  to  the 
molecules  in  bold  typeface). 


H— O— H  O— H  O— H 

bending  sym.  stretch  asym.  stretch 


H20 

Harmonic 

1727 

3900 

4246 

H20 

MD 

1736 

3906 

4232 

Be^+OHa 

Harmonic 

1752 

3676 

3709 

Be^+OHa 

MD 

1736 

3662 

3689 

irOHa 

Harmonic 

1782 

3955 

4152 

Li+OHa 

MD 

1790 

3960 

4178 

Be2+(HaO)4(H20)8 

Harmonic 

1845 

3696 

3725 

Be2  +  (HaO)4(H20)3 

MD 

1845 

3689 

3710 

Be2+(HaO)4(H20)8 

Harmonic 

1743 

3912 

4211 

Be2MHaO)4(HaO)8 

MD 

1736 

3900 

4198 

LiMHaO)4(H20)8 

Harmonic 

1792 

3952 

4141 

LiMHaO)4(H20)8 

MD 

1790 

3981 

4239 

LiMH20)4(H20)6 

Harmonic 

1729 

3913 

4242 

LiMH20)4(H20)8 

MD 

1763 

3927 

4252 

transform  of  the  velocity  autocorrelation  function): 

I(v) 

Since  no  splitting  into  symmetric  and  asymmetric 
modes  is  visible,  the  following  procedure  [38]  was 
employed  to  separate  them:  If  the  hydrogen  veloci¬ 
ties  relative  to  the  oxygen  velocities  are  projected 


onto  the  O — H  bonds,  their  sums  and  differences 
should  correspond  to  the  symmetric  and  asymmet¬ 
ric  modes.  For  Figure  8  and  Table  V,  this  scheme  is 
used  to  separate  the  modes.  It  can  be  seen  that 
their  maxima  nearly  coincide. 

The  first-shell  bending  mode  appears  at  1845 
cm“^.  For  the  second  shell,  the  bending  mode  is 
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FIGURE  8.  Power  spectra  of  the  symmetric  and  asymmetric  0— H  stretching  modes  of  the  water  molecules  in  the  first 
(solid  curves)  and  second  (dashed  curves)  hydration  shells  of  Be^‘*'(H20)i2. 
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located  at  1736  cm“^;  the  symmetric  O — H  stretch¬ 
ing  mode,  at  3900  cm"^,  and  the  asymmetric  OH 
stretching  mode,  at  4198  cm“^  (Table  V).  The  fre¬ 
quency  shift  between  the  two  shells  is  thus  211 
cm"^  for  the  symmetric  O — H  stretching  mode 
and  488  cm”^  for  the  asymmetric  stretching  mode. 
Xhe  O — H  modes  of  the  second  shell  agree  within 
a  few  wavenumbers  with  the  OH  stretching  modes 
of  an  isolated  water  molecule. 

Figure  9  shows  the  power  spectrum  for 
Li‘^(H20)4(H20)8.  The  symmetric  O — H  stretch¬ 
ing  mode  of  the  first  shell  is  located  at  3981  cm“\ 
and  the  asymmetric  stretching  mode,  at  4239  cm”^ 
(Table  V).  The  second-shell  frequencies  are  just 
slightly  different  (3927  and  4259  cm“\  respec¬ 
tively).  There  is  no  strong  frequency  shift  of  the 
OH  stretching  modes  between  the  two  shells  in 
contrast  to  the  Be^^  system.  It  could  perhaps  have 
been  expected  that  the  agreement  between  har¬ 
monic  frequencies  and  simulation-derived  values 
is  less  close  for  the  larger  complexes  than,  i.e.,  for 
the  more  rigid  monohydrates.  However,  as  can  be 
seen  from  Table  V,  at  least  for  the  O — H  modes, 
this  is  not  the  case. 


Summary  and  Conclusions 

The  data  presented  show  that  many  but  not  all 
qualitative  aspects  of  hydrated  Be^'^  clusters  can 


approximately  be  derived  from  optimized  geome¬ 
tries.  However,  without  the  corresponding 
simulations,  this  fact  would  not  have  been 
easy  to  conclude.  Besides  providing  the  statis¬ 
tical  distributions,  the  MD  simulations  show  the 
occurrence  of  intermediate  coordination  states  in 
Li"^  clusters  and  the  dynamical  effect  on  the  dis¬ 
tance  of  the  two  hydration  shells.  The  simulations 
involving  hydrogen  transfer  indicate  that  it  would 
be  possible  to  study  statistical  features  of  these 
reactions  by  performing  simulations  with  an  ex¬ 
tended  set  of  systematically  varying  initial  condi¬ 
tions.  The  evolution  of  the  Li"^  clusters  from  their 
initial  configurations  shows  that  in  aqueous 
clusters  of  this  cation  the  most  stable  hydration 
number  of  four  can  easily  be  disturbed. 

We  found  MD  simulations  with  forces  derived 
from  ab  initio  wave  functions  to  be  a  practical 
method  for  studying  these  systems.  While  this 
method  is  traditionally  used  for  geometry  opti¬ 
mization,  it  is  not  applied  as  often  as  the  so-called 
Car-Parrinello  (CP)  scheme  for  molecular  dynam¬ 
ics.  In  the  latter,  the  electron  density  is  tradition¬ 
ally  expanded  in  a  plane-wave  basis  set.  More 
characteristically,  however,  the  orbital  coefficients 
are  treated  as  part  of  the  coordinates  and  the 
combined  gradient  of  the  orbital  coeofficients  and 
of  the  nuclear  coordinates  with  respect  to  the  en¬ 
ergy  is  used  for  propagating  the  system. 


FIGURE  9.  Power  spectra  of  the  symmetric  and  asymmetric  O — H  stretching  modes  of  the  water  molecules  in  the  first 
(solid  curves)  and  second  (dashed  curves)  hydration  shells  of  Li'^(H20)i2. 
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At  first  it  might  seem  that  using  the  optimized 
wave  function  must  be  slower  since  iterations  are 
needed  at  every  time  step,  but  this  disadvantage  is 
partially  alleviated  by  the  possibility  of  using  a 
larger  time  step,  since  the  electronic  degrees  of 
freedom  must  be  combined  with  rather  low  ficti¬ 
cious  masses  which  also  complicate  frequency  cal¬ 
culations.  The  time  step  used  in  the  present  work 
(0.3  ps)  is  at  least  twice  as  large  as  the  one  typi¬ 
cally  employed  in  CP  simulations.  A  second  argu¬ 
ment  why  the  CP  method  could  be  advantageous 
is  its  reportedly  better  energy  conservation  [39]. 
Indeed,  for  perfect  energy  conservation,  extremely 
accurate  nuclear  forces  are  necessary  and  they 
might  be  difficult  to  obtain  from  a  wave  function 
optimized  to  some  threshold  criterion  than  from 
the  set  of  generalized  coordinates.  We  did  not, 
however,  observe  any  problems  related  to  this 
circumstance. 

From  a  pragmatic  point  of  view,  it  seems  that 
for  the  simulation  of  periodic  systems  there  is 
presently  no  alternative  to  using  plane  waves  as  a 
basis  set.  For  nonperiodic  systems,  simple 
"hierarchical"  MD  simulation  schemes  like  the 
one  we  used  here,  where  the  wave  function  is 
optimized  at  every  step  and  its  gradient  calcu¬ 
lated,  might  be  equally  economic.  Finally,  it  shall 
be  mentioned  that  there  exist  also  successful  at¬ 
tempts  to  combine  Hartree-Fock-type  calculations 
with  the  generalized  coordinate  treatment  [40]  as 
well  as  the  "opposite"  combination,  using 
plane-wave  basis  sets  within  a  traditional  DFT 
scheme  [41]. 
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ABSTRACT:  The  geometry  and  vibrational  spectrum  of  TeCl4  was  calculated  with 
various  quantum  chemical  methods  [Hartree-Fock,  second-order  Moller-Plesset  (MP2) 
and  generalized  gradient  approximation  density  functional  theory  (GGA-DFT)].  Five 
different  basis-set  combinations  were  tested:  the  relativistic  effective  core  potentials  with 
double-zeta  split  valence  basis  (RECP)  of  Hay  &  Wadt  (HW)  and  Stevens  et  al.  (CEP);  the 
above  RECP  basis  sets  extended  with  polarization  functions  for  Te  and  using  a  6-3 IG* 
basis  for  Cl  (HW*  and  CEP*);  a  medium-size  all-electron  basis  set  (ALL).  The  quality  of 
the  calculated  data  was  assessed  by  comparison  with  recent  experimental  results.  The 
Hartree-Fock  method  combined  with  the  HW  and  CEP  basis  sets  provided  a  very  good 
approximation  of  the  experimental  vibrational  spectra.  The  quality  of  the  results  is 
comparable  to  those  of  the  best  methods  (MP2,  B3-P,  B3-PW  with  HW*  and  CEP*  and 
B3-LYP,  B3-P,  B3-PW  with  the  ALL  basis  set).  However,  the  HW  and  CEP  basis  sets 
provided  very  poor  geometry  and  vibrational  frequencies  when  they  were  used  in 
combination  with  any  correlated  method  in  this  work.  Similarly,  the  DFT  methods  using 
Becke's  1988  exchange  functional  (B-LYP,  B-P,  B-PW)  without  the  inclusion  of  the  exact 
exchange  let  to  very  poor  results  with  the  basis  sets  used  in  this  study.  ©  1997  John  Wiley 
&  Sons,  Inc.  Int  J  Quant  Chem  65:  817-826,  1997 


Contract  grant  sponsor:  Hungarian  Scientific  Research  Foun¬ 
dation. 

Contract  grant  number:  F022170. 


International  Journal  of  Quantum  Chemistry,  Vol.  65,  817-826  (1997) 
©  1 997  John  Wiley  &  Sons,  Inc. 


CCC  0020-7608  /  97  /  05081 7-1 0 


KOVACS  AND  CSONKA 


Introduction 

Vibrational  data  are  of  great  importance  at 
various  fields  of  chemistry  and  physics  such 
as  molecular  mechanics  and  molecular  dynamics, 
statistical  thermodynamics,  gas  electron  diffraction 
analysis,  etc.  However,  their  experimental  deter¬ 
mination  for  inorganic  molecules  is  difficult.  High 
temperatures,  long  path  lengths,  and  sensitive  de¬ 
tectors  are  needed  to  detect  the  (often  low-intense) 
vibrational  bands  of  the  low-volatile  compounds. 
Additionally,  the  interpretation  of  the  high-tem- 
perature  spectra  is  hampered  by  the  broad  rota¬ 
tional  contours,  hot  bands,  and  possible  presence 
of  other  components  in  the  gas  phase  (decomposi¬ 
tion  and  association  products).  Hence,  results  from 
other  techniques  such  as  matrix  isolation  spec¬ 
troscopy  and  theoretical  methods  are  acknowl¬ 
edged  to  extend  the  high-temperature  experimen¬ 
tal  data. 

The  quantum  chemical  calculations  have  been 
successfully  applied  for  the  interpretation  and  pre¬ 
diction  of  vibrational  spectra  of  organic  com¬ 
pounds  [1].  However,  the  application  for  inorganic 
molecules  is  more  difficult.  First  of  all,  these  latter 
molecules  are  usually  built  up  out  of  heavier  ele¬ 
ments,  rendering  the  computations  with  all-elec¬ 
tron  basis  sets  very  expensive.  Furthermore,  the 
errors  arising  from  the  neglect  of  relativistic  effects 
are  important  from  the  fourth  row  of  the  periodic 
table.  These  two  problems  can  be  solved  by  using 
relativistic  effective  core  potentials  (RECP)  [2-8], 
which  are  extensively  used  in  Hartree-Fock  (HF) 
and  post-HF  calculations. 

The  density  functional  (DFT)  methods  are 
known  to  provide  good  estimates  of  the  vibra¬ 
tional  spectra  of  organic  compounds.  Recent  DFT 
studies  of  transition-metal  complexes  also  have 
shown  promising  agreement  with  the  experimen¬ 
tal  infrared  (IR)  frequencies  and  intensities  [9,  10]. 
However  less  information  is  available  about  the 
performance  of  the  DFT  methods  with  effective 
core  potentials.  The  use  of  DFT  functionals  with 
ECPs  generated  previously  from  Hartree-Fock 
atomic  calculations  is  not  free  of  problems.  Since 
the  exchange  and  correlational  functionals  are  non¬ 
linear  in  the  density,  the  core  and  valence  densities 
are  not  strictly  separable.  Hence,  the  use  of  ECPs 
generated  from  HF  calculations  should  be  care¬ 
fully  tested  for  their  applicability  in  DFT  work. 
Promising  results  have  been  published  recently  by 


Russo  et  al.  [11].  Investigating  a  small  but  repre¬ 
sentative  set  of  transition-metal  compounds 
[SCF3,  TiF4,  VF5,  TiO,  Ni(CO)4,  CuF],  they  found 
that  the  RECPs  of  Hay  and  Wadt  [2-4]  can  be  used 
in  DFT  methods,  as  well. 

In  the  present  study  we  test  the  performance  of 
currently  most  successful  generalized  gradient  ap¬ 
proximation  (GGA)  and  hybrid  DFT  methods  us¬ 
ing  effective  core  potentials  as  compared  to 
Hartree-Fock  and  second-order  Moller-Plesset 
(MP2)  methods  as  well  as  to  all-electron  basis  sets. 

The  tellurium  tetrachloride  molecule  was  cho¬ 
sen  as  a  model  for  this  study.  Its  molecular  geome¬ 
try  (determined  by  gas  electron  diffraction)  and 
experimental  vibrational  spectra  are  known  from  a 
recent  study  [12].  The  TeCl4  molecule  has  a  trigo¬ 
nal  bipyramid  structure  with  a  vacant  equatorial 
position  symmetry).  The  molecular  geometry 
is  in  accordance  with  the  valence-shell  electron-pair 
repulsion  (VSEPR)  theory  [13].  Characteristics  are 
the  axial  Te — Cl  bonds  being  longer  than  the  equa¬ 
torial  bonds  and  the  axial  chlorines  being  slightly 
bent  toward  the  equatorial  ones.  Concerning  the 
vibrational  data,  eight  of  the  nine  fundamentals  of 
TeCl4  were  found  in  the  experimental  specta  and  a 
good  estimate  [based  on  a  scaled  quantum  me¬ 
chanical  (SQM)  [14]  force  field]  is  available  for  the 
unobserved  (P4)  mode  [12]. 


Computational  Details 

Geometries,  Cartesian  harmonic  force  constants, 
and  IR  intensities  of  TeCl4  were  calculated  at  the 
HF,  MP2  [15],  and  DFT  levels  of  theory  using  the 
Gaussian  94  program  [16].  Six  exchange-correla¬ 
tion  density  functionals  were  employed  in  the  pre¬ 
sent  study:  Becke's  1988  (B)  [17]  and  Becke's  three 
parameter  (B3)  [18]  gradient-corrected  exchange 
functionals  were  combined  with  the  Lee- Yang- 
Parr  (LYP)  [19],  Perdew  (P)  [20],  and  Perdew- 
Wang  (PW)  [21]  correlation  functionals.  The  de¬ 
fault  grid  (75,302)  implemented  in  the  Gaussian  94 
program  was  used  in  the  DFT  calculations.  We 
used  five  different  basis  set  combinations: 

1.  HW:  the  Hay  &  Wadt  RECPs  with  double- 
zeta  valence  basis  sets  for  both  Cl  and  Te  [3] 

2.  CEP:  the  RECPs  of  Stevens  et  al.  and  the 
corresponding  double-zeta  valence  basis  sets 
for  both  Cl  and  Te  [5,  6] 

3.  HW* :  the  6-31G(d)  basis  set  for  chlorine  and 
the  Hay  &  Wadt  RECP  with  double-zeta  va- 
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lence  basis  set  [3]  extended  with  a  single  set 
of  d-type  polarization  functions  [22]  for  tel¬ 
lurium 

4.  CEP* :  the  6-31G(d)  basis  set  for  chlorine  and 
the  RECP  of  Stevens  et  al.  with  double-zeta 
valence  basis  set  [6]  extended  with  a  single 
set  of  d-type  polarization  functions  [22]  for 
tellurium 

5.  ALL  (all-electron  basis  set):  the  standard  6- 
31G(d)  basis  set  for  chlorine  and  a  3-21G(d) 
basis  [23]  for  tellurium 

Harmonic  vibrational  frequencies  and  IR  inten¬ 
sities  were  computed  for  the  fully  optimized  ge¬ 
ometries.  The  first  derivatives  of  the  potential  en¬ 
ergy  with  respect  to  the  nuclear  coordinates  were 
calculated  analytically  at  all  quantum  chemical 
levels.  The  second  derivatives  were  obtained  ana¬ 
lytically  for  the  all-electron  basis  set  and  numeri¬ 
cally  for  the  RECP  basis  sets. 


Results  and  Discussion 

GEOMETRY 

The  computed  geometrical  parameters  of  TeCl4 
obtained  at  various  levels  of  theory  are  compared 
to  the  experimental  geometry  in  Table  1. 

The  DFT  methods  containing  Becke's  1988  ex¬ 
change  functional  (B-LYP,  B-P,  and  B-PW)  with 
the  most  economic  HW  and  CEP  basis  sets  found 
the  geometry  as  a  single  minimum  on  the 
potential  energy  surface.  All  the  other  calculations 
converged  to  the  right  €2^  structure  resulting 
longer  axial  Te — Cl  bonds  with  respect  to  the 
equatorial  ones  in  accordance  with  the  experimen¬ 
tal  data.  The  methods  resulting  in  false  geometries 
were  omitted  from  the  further  studies. 

Figures  1(a)  and  1(b)  show  the  deviations  be¬ 
tween  the  calculated  and  experimental  geometrical 
parameters.  The  most  significant  difference  be¬ 
tween  the  computed  results  is  the  very  poor  per¬ 
formance  of  the  HW  and  CEP  basis  sets  for  the 
bond  distances  [cf.  Fig.  1(a)]. 

The  calculated  bond  lengths  are  longer  than  the 
experimental  ones,  with  one  exception  of  the 
HF/HW*  result  for  the  Te — Cl^^  bond.  In  gen¬ 
eral,  the  computations  underestimate  the  Cl^^ — Te 
— Clgq  bond  angle,  however,  differ  in  the  character 
of  the  Clg^ — Te — Cl^^  angle  (cf.  Table  I).  The  ex¬ 
perimental  tilt  of  the  axial  chlorines  toward  the 
equatorial  ones  is  reproduced  by  the  HF  and  MP2 
methods  (except  the  MP2/CEP  level)  and  by  the 
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B3-type  DFT/ALL  calculations.  In  all  the  other 
cases  no  tilt,  or  a  small  opposite  tilt,  was  obtained. 
The  latter  bond  angle  is  generally  overestimated 
by  the  various  methods,  except  the  HF  with  all  the 
basis  sets  and  the  MP2/ALL  [cf.  Fig.  1(b)].  The 
deviations  for  all  the  geometrical  parameters  are 
generally  larger  than  the  experimental  errors. 

In  general,  the  computed  bond  lengths  and  the 
— Te — Clgq  angle  increase  in  the  order  HF  < 
MP2  <  B3-P  -  B3-PW  <  B3-LYP  <  B-P  B-PW  < 
B-LYP.  [The  only  significant  deviation  from  this 
trend  was  found  for  the  bond  lengths  obtained  at 
the  MP2/HW  and  CEP  levels;  cf.  Fig.  1(a)].  We 
note  that  similar  behavior  has  been  found  earlier 
for  C — C  and  C — O  bond  lengths  of  sugars 
and  1,2-ethanediol  [24,  25].  The  computed 
Clgq — Te — Clgq  bond  angle  varies  in  a  narrower 
range.  The  above  trend,  but  with  negative  sign 
(because  of  the  underestimating  character  of  the 
computations),  can  be  recognized,  except  for  the 
results  obtained  by  using  the  ALL  basis  set. 

Summing  up,  the  experimental  geometry  of 
TeCl4  is  best  reproduced  by  the  HF  and  MP2 
methods.  The  DFT  methods  provided  poorer  re¬ 
sults,  especially  those  that  contain  the  B-type  ex¬ 
change  functional.  The  HW*,  CEP*,  and  ALL  ba¬ 
sis  sets  are  of  similar  quality,  HW*  being  slightly 
superior;  the  HW  and  CEP  basis  sets  are  much 
poorer. 

FREOUEIVCIES 

The  observed  and  computed  fundamentals  of 
TeCl4  are  compiled  in  Table  II.  The  deviations 
between  selected  computed  and  the  experimental 
frequencies  are  shown  in  Figure  2.  We  present  here 
the  results  obtained  at  various  levels  of  theory 
using  the  HW  and  HW*  basis  sets.  They  resemble 
the  results  obtained  with  the  other  related  basis 
sets  (HW  to  CEP,  HW*  to  CEP*  and  ALL).  The 
following  trend  can  be  observed:  The  B3-type 
DFT/HW  and  B-type  DFT/HW*  frequencies  are 
considerably  underestimated,  while  the  HF  and 
MP2/HW*  ones  are  overestimated.  The  HF/HW 
and  most  of  the  B3-type  DFT/HW*  frequencies 
vary  around  the  experimental  values.  We  note  the 
significant  underestimation  of  the  and  Vg 
stretching  fundamentals  by  the  DFT  calculations. 
The  computed  frequencies  increase  in  the  order 
B-LYP  <  B-P  ~  B-PW  <  B3-LYP  <  B3-PW  -  B3-P 
<  MP2  <  HF.  This  trend  is  just  the  opposite  to 
that  seen  for  most  of  the  geometrical  parameters 
(cf.  Fig.  1). 
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TABLE  I _ 

Molecular  geometry  of  TeCl4. 


Bond  length  (A)  Bond  angle  (deg) 


Basis® 

Method 

Te-Cleq 

Clax 

Cleq  Te  Clgq 

Cl^^  — Te  — Cl, 

Exp.^ 

2.430(6) 

2.298(6) 

175.9(6) 

103.0(7) 

88.7^ 

HW 

HF 

2.537 

2.409 

172.8 

101.4 

87.7 

MP2 

2.588 

2.468 

177.8 

100.5 

89.3 

B-LYP^ 

2.585 

109.5 

B-P^ 

2.568 

109.5 

B-PW^ 

2.566 

109.5 

B3-LYP 

2.578 

2.481 

174.3 

99.3 

91.9 

B3-P 

2.559 

2.463 

176.6 

99.6 

90.9 

B3-PW 

2.560 

2.466 

177.6 

99.9 

90.8 

CEP 

HF 

2.568 

2.431 

174.4 

100.6 

88.2 

MP2 

2.626 

2.501 

177.9 

98.6 

90.7 

B-LYP^ 

2.607 

109.5 

B-P^ 

2.587 

109.5 

B-PW^ 

2.586 

109.5 

B3-LYP 

2.600 

2.505 

171.0 

98.7 

92.9 

B3~P 

2.579 

2.486 

174.8 

99.2 

91.7 

B3-PW 

2.583 

2.491 

173.9 

99.1 

92.0 

HW* 

HF 

2.440 

2.290 

175.2 

100.7 

88.1 

MP2 

2.441 

2.313 

176.6 

100.2 

88.9 

B-LYP 

2.507 

2.389 

172.5 

99.4 

92.4 

B-P 

2.485 

2.369 

174.6 

99.5 

91.7 

B-PW 

2.479 

2.367 

175.9 

100.0 

91.3 

B3-LYP 

2.472 

2.346 

179.0 

100.1 

90.4 

B3-P 

2.453 

2.330 

179.6 

100.2 

89.9 

B3-PW 

2.455 

2.333 

179.8 

100.1 

89.9 

CEP* 

HF 

2.461 

2.313 

175.5 

100.6 

88.5 

MP2 

2.463 

2.339 

176.8 

100.4 

89.0 

B-LYP 

2.522 

2.415 

172.2 

99.9 

92.5 

B-P 

2.500 

2.395 

174.9 

100.1 

91.6 

B-PW 

2.499 

2.393 

174.9 

100.1 

91.7 

B3-LYP 

2.491 

2.371 

178.8 

100.2 

90.4 

B3-P 

2.471 

2.354 

179.7 

100.3 

89.9 

B3-PW 

2.475 

2.357 

179.8 

100.3 

90.1 

ALL 

HF 

2.449 

2.317 

173.5 

100.7 

87.9 

MP2 

2.448 

2.338 

174.7 

100.5 

88.3 

B-LYP 

2.509 

2.410 

178.5 

102.3 

90.4 

B-P 

2.481 

2.383 

179.6 

102.2 

89.9 

B-PW 

2.480 

2.382 

179.8 

102.1 

89.9 

B3-LYP 

2.477 

2.368 

177.8 

101.7 

89.3 

B3-P 

2.452 

2.345 

176.8 

101.5 

89.0 

B3-PW 

2.456 

2.348 

176.9 

101.5 

89.0 

®  For  details  of  basis  sets  HW,  CEP,  HW^,  CEP^,  and  ALL  see  text. 
^From  a  gas  electron  diffraction  study,  Ref.  [12],  Tg  values. 
^Calculated  from  the  data  given  in  Ref.  [12]. 

^  The  computed  geometry  has  tetrahedral  symmetry. 
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FIGURE  1a.  Deviations  between  computed  and  experimental  geometrical  parameters:  (a)  bond  lengths;  (b)  bond 
angles. 


The  performance*  of  the  computational  levels 
can  be  assessed  and  compared  by  the  root-mean- 
square  (rms)  and  maximal  deviations  between  the 

*  By  performance  we  mean  here  how  the  different  theoreti¬ 
cal  levels  are  able  to  estimate  the  experimental  (anharmonic) 
vibrational  spectra.  We  do  not  intend  to  investigate  the  quality 
of  the  calculated  harmonic  frequencies. 


experimental  and  theoretical  frequencies  (Fig. 

The  rms  deviations  are  scattered  within  the  8-53 
cm”^  range,  while  the  maximal  deviations  (ab¬ 
solute  values)  vary  from  15  to  80  cm  “L 

^  The  mode  was  not  involved  in  the  evaluation  of  these 
deviations. 
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HW  CEP  HW*  CEP*  ALL  HW  CEP  HW*  CEP*  ALL 


decf^®‘^*eq 

FIGURE  1b.  (continued) 


On  Figure  3  a  significant  trend  can  be  observed 
for  both  the  theoretical  levels  and  the  basis  sets,  a 
trend  which  is  reflected  at  both  the  rms  and  maxi¬ 
mal  deviations. 

The  performance  of  the  HW  and  CEP  basis  sets 
used  with  the  correlated  methods  is  very  poor, 
however,  it  is  surprisingly  good  with  the  Hartree- 
Fock  theory.  The  HF/HW  level  belongs  even  to 
the  best  methods.  The  HW*,  CEP*,  and  ALL  basis 


822 


sets  performed  well  with  the  MP2  and  B3-type 
DFT  levels,  while  about  two-three  times  larger 
deviations  can  be  observed  for  the  results  obtained 
at  the  B-type  DFT  calculations.  It  is  noteworthy 
that  the  MP2/HW*  and  CEP*  frequencies  are  in 
very  good  agreement  with  the  experimental  data, 
however,  the  agreement  worsened  using  the  all¬ 
electron  (ALL)  basis  set.  Here  the  B3-type  DFT 
levels  are  definitely  superior.  We  note  also  the 
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TABLE  II _ 

Experimental  and  calculated  fundamentals  (cm~^)  and  calculated  IR  intensities  (km  mol~^)  of  TeCl4. 


rms  max. 


Basis® 


HW 


CEP 


HW* 


CEP* 


ALL 


Method 

^1 

^2 

^^3 

^^4 

^5 

^6 

^7 

^8 

^9 

dev. 

dev. 

Exp.^ 

72 

104 

158 

(150°  ±  10) 

165 

290 

312 

378 

382 

HF 

73(3) 

107  (13) 

150  (19) 

160  (0) 

174  (8) 

285  (1) 

307  (224) 

366  (21) 

359  (9) 

10.5 

21 

MP2 

57(3) 

82(9) 

125  (11) 

133  (0) 

147  (4) 

254  (1) 

299  (145) 

319  (24) 

315  (10) 

38.0 

67 

B3-LYP 

50  (3) 

61  (6) 

109  (6) 

113  (0) 

128  (2) 

258  (4) 

300  (129) 

308  (33) 

309  (14) 

46.8 

73 

B3-P 

52(2) 

67(6) 

133  (7) 

118  (0) 

133  (2) 

265  (4) 

309  (134) 

320  (34) 

321  (15) 

37.3 

61 

B3-PW 

53(2) 

68  (6) 

111  (7) 

121  (0) 

135  (3) 

266  (4) 

310(133) 

316  (35) 

318  (13) 

40.8 

64 

HF 

73(4) 

102  (13) 

147  (18) 

153  (0) 

168  (6) 

271  (1) 

300  (279) 

365  (22) 

354  (11) 

15.4 

28 

MP2 

54(4) 

69  (8) 

117(8) 

121  (0) 

135  (2) 

238  (2) 

294  (162) 

303  (29) 

306  (14) 

48.2 

76 

B3-LYP 

40(3) 

54  (5) 

103  (4) 

108  (0) 

123  (1) 

250  (6) 

300  (154) 

298  (38) 

306  (16) 

52.7 

80 

B3-P 

48(3) 

63  (6) 

108  (6) 

115  (0) 

130  (2) 

258  (6) 

310  (161) 

309  (41) 

317  (16) 

44.7 

69 

B3-PW 

47(3) 

61  (6) 

107  (6) 

115  (0) 

130(2) 

257  (6) 

308  (160) 

307  (40) 

315(16) 

46.0 

71 

HF 

96(3) 

128  (10) 

181  (14) 

184  (0) 

201  (10)301  (2) 

306  (341) 

408  (60) 

415  (23) 

25.3 

36 

MP2^ 

85(2) 

113(7) 

163  (9) 

166  (0) 

183  (5) 

291  (3) 

322  (256) 

376  (59) 

385  (21) 

9.4 

18 

B-LYP 

65(2) 

72(4) 

126  (4) 

127  (0) 

146  (1) 

258  (8) 

290  (186) 

315  (62) 

326  (22) 

37.2 

63 

B-P 

67(2) 

79(4) 

131  (5) 

134  (0) 

153  (1) 

270  (7) 

303  (194) 

331  (64) 

341  (23) 

27.1 

47 

B-PW 

70(1) 

86  (4) 

132  (5) 

137  (0) 

155  (2) 

271  (7) 

305  (196) 

330  (64) 

342  (22) 

26.0 

48 

B3-LYP 

72(2) 

95  (5) 

145  (7) 

149  (0) 

167  (2) 

275  (5) 

305  (229) 

345  (62) 

357  (22) 

16.7 

33 

B3-P 

73(2) 

100  (6) 

149  (7) 

154  (0) 

172  (3) 

286  (5) 

317  (234) 

358  (65) 

370  (22) 

9.6 

20 

B3-PW 

78(2) 

100  (6) 

150  (7) 

153  (0) 

171  (3) 

284  (5) 

315  (234) 

357  (64) 

369  (22) 

10.0 

21 

HF 

92  (3) 

122  (9) 

177  (15) 

176  (0) 

195  (8) 

304  (2) 

314  (353) 

400  (61) 

407  (25) 

20.3 

30 

MP2 

80  (2) 

107(7) 

158  (10) 

158  (0) 

175  (5) 

291  (3) 

328  (256) 

368  (59) 

376  (22) 

8.4 

16 

B-LYP 

59  (2) 

73(3) 

118  (4) 

122  (0) 

139  (1) 

262  (8) 

299  (183) 

309  (60) 

320  (22) 

40.3 

69 

B-P 

64(2) 

79(4) 

124  (5) 

128  (0) 

146  (2) 

273  (8) 

312  (194) 

325  (64) 

335  (22) 

30.6 

53 

B-PW 

64(2) 

80  (4) 

126  (5) 

130  (0) 

147  (2) 

273  (8) 

312  (196) 

326  te4) 

336  (22) 

29.8 

52 

B3-LYP 

72(2) 

92(5) 

141  (7) 

142  (0) 

160  (2) 

277  (6) 

313  (229) 

340  (62) 

352  (22) 

19.3 

38 

B3-P 

74(2) 

96  (5) 

145  (8) 

147  (0) 

164  (3) 

288  (5) 

324  (237) 

354  (65) 

365  (23) 

12.5 

24 

B3-PW 

74(2) 

95  (5) 

145  (8) 

146  (0) 

164  (3) 

286  (6) 

322  (237) 

352  (65) 

363  (23) 

13.3 

26 

HF 

96  (2) 

131  (8) 

181  (13) 

183  (0) 

200  (11)307  (2) 

327  (326) 

411  (64) 

416  (26) 

27.0 

35 

MP2 

86(1) 

119(6) 

166  (9) 

167  (0) 

184  (6) 

299  (3) 

341  (252) 

384  (62) 

389  (23) 

15.2 

29 

B-LYP 

52  (2) 

80  (4) 

125  (5) 

138  (0) 

155  (1) 

238  (3) 

267  (197) 

335  (65) 

338  (23) 

36.5 

52 

B-P 

64(1) 

95  (4) 

135  (5) 

147  (0) 

164  (2) 

266  (4) 

301  (205) 

353  (67) 

355  (23) 

18.5 

27 

B-PW 

73(1) 

99  (4) 

139  (5) 

142  (0) 

157  (3) 

279  (6) 

321  (207) 

341  (68) 

348  (23) 

19.9 

37 

B3-LYP 

70(2) 

104  (5) 

146  (7) 

155  (0) 

172  (3) 

275  (4) 

309  (233) 

365  (66) 

369  (23) 

9.8 

15 

B3-P 

74  (1) 

111  (5) 

152(7) 

161  (0) 

177(4) 

294  (4) 

332  (239) 

380  (68) 

384  (24) 

9.1 

20 

B3-PW 

79  (1) 

112(5) 

154  (7) 

156  (0) 

171  (5) 

293  (4) 

333  (238) 

364  (68) 

375  (24) 

10.4 

21 

®  For  details  of  basis  sets  HW,  CEP,  HW*,  CEP*,  and  ALL  see  text, 
^  From  Ref.  [12]. 

^SQM  value  using  the  MP2/HW*  harmonic  force  field. 


significant  improvement  of  the  results  obtained  at 
the  B3-LYP  level  used  with  the  ALL  basis  set 
compared  to  those  with  HW*  and  CEP*.  The 
results  of  the  B3-P  and  B3-PW  levels  used  with 
CEP*  worsened  somewhat  compared  to  those  us¬ 
ing  the  HW*  basis  and  to  MP2/CEP*.  The  perfor¬ 
mance  of  the  Hartree-Fock  theory  with  these  three 
larger  basis  sets  is  similar  to  that  of  the  B-type  DFT 
methods. 


The  best  agreement  between  the  experimental 
and  computed  frequencies  was  obtained  at  the 
MP2/CEP*  level,  however,  only  slightly  worse 
are  the  HF/HW,  the  MP2,  B3-P  and  B3-PW/HW*, 
and  the  B3-P,  B3-LYP  and  B3-PW/ALL  results 
(rms  deviation  less  than  11  cm“\  maximal  devia¬ 
tion  not  exceeding  21  cm”^). 

The  computed  vibrational  data  of  TeCl4  are 
suitable  to  assist  the  assignment  of  the  bands  in 
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B3-P 


FIGURE  2.  Deviations  between  calculated  and  experimental  vibrational  frequencies  obtained  at  various  levels  of  theory  using  basis  sets 
(a)  HW  and  (b)  HW*. 
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FIGURE  3.  (a)  Rms  and  (b)  maximal  deviations  (absolute  values)  between  the  calculated  and  experimental  vibrational 
frequencies. 


the  experimental  spectra  (cf.  Table  II).  The  stretch¬ 
ing  fundamentals  can  be  assigned  unambiguously 
by  the  computed  IR  intensities.  In  most  cases  the 
relative  sequence  of  the  frequencies  is  in  accord 
with  the  experimental  observations,  and  only  the 


computed  and  Vg  stretching  fundamentals  are 
interchanged  at  the  HF/HW,  HF/CEP,  and 
MP2/FIW  levels  as  well  as  the  calculated  Py  and 
p^  modes  at  the  B3-P  and  B3-PW /CEP  levels.  The 
deformation  modes  have  similar  (small)  computed 
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IR  intensities,  and  the  relative  order  of  their  fre¬ 
quencies  agrees  with  that  of  the  experimental  ones. 
The  unobserved  mode  was  positioned  between 
and  by  all  the  quantum  chemical  calcula¬ 
tions  except  for  B3-P/HW  and  HF/CEP*,  how¬ 
ever  the  experimental  values  of  these  fundamen¬ 
tals  are  also  very  close  [12]. 


Conclusions 

1.  The  molecular  geometry  of  TeCl4  is  best  re¬ 
produced  at  the  HF  and  MP2  levels  using  the 
HW*,  CEP*,  and  ALL  basis  sets.  The  density 
functional  methods  gave  rise  to  poorer  re¬ 
sults,  especially  those  using  Becke's  1988  ex¬ 
change  functional  The  RECP  basis  sets  (HW 
and  CEP)  provided  poor  results,  especially 
for  the  bond  distances, 

2.  Generally,  the  vibrational  frequencies  calcu¬ 
lated  by  the  MP2  and  B3-type  DFT  methods 
with  basis  sets  HW*,  CEP*,  and  ALL  (except 
MP2/ALL  and  B3-LYP/HW*  and  CEP*)  as 
well  as  at  the  HF/HW  level  are  in  good 
accord  with  the  experimental  data.  The 
agreement  is  good  enough  to  assist  the  as¬ 
signment  of  the  measured  vibrational  spec¬ 
tra. 

3.  The  HW  and  CEP  basis  sets  compensate  ef¬ 
fectively  the  errors  of  the  HF  method  for 
TeCl4.  The  HW*  and  CEP*  basis  sets  are 
best  combined  with  MP2,  B3-P  and  B3-PW. 
Regarding  the  all-electron  basis  set  a  combi¬ 
nation  with  the  B3-type  DFT  methods  is  su¬ 
perior. 

4.  Based  on  their  performance  and  cost  effec¬ 
tiveness  the  methods  using  relativistic  effec¬ 
tive  core  potentials  are  advantageous  for 
large-scale  computations. 
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ABSTRACT:  A  general  strategy  for  computing  the  vibrational  frequencies  of 
adsorbates  on  surfaces  is  presented.  Rather  than  use  numerical  fits  to  the  surface  energy, 
we  advocate  diagonalizing  a  mass  weighted  energy  second-derivative  matrix.  The 
computation  of  energy  derivatives  has  several  advantages:  It  facilitates  a  more  accurate 
treatment  of  the  adsorbate/substrate  mass  ratios.  The  normal  modes  of  the  surface 
vibrations  are  given  automatically  in  the  diagonalization  step.  Finally,  a  major  advantage 
of  using  energy  derivatives  is  that  they  provide  estimates  of  the  range  of  the 
adsorbate/substrate  interactions  which  can  be  used  to  ensure  that  the  frequency  values 
are  converged.  We  illustrate  the  strategy  for  cluster  calculations  which  simulate  the 
H/A1(100),  0/Al(lll),  H20/A1(111),  and  Al/Si(lll)  surfaces.  ©  1997  John  Wiley  &  Sons, 
Inc.  Int  J  Quant  Chem  65:  827-838,  1997 


Introduction 

High-resolution  electron  energy  loss  spec¬ 
troscopy  (HREELS)  [1,  2]  and  reflection  ad¬ 
sorption  infrared  spectroscopy  (RAIRS)  [1,  3]  are 
vibrational  spectroscopic  techniques  which  now 
play  a  valuable  role  in  identifying  and  characteriz¬ 
ing  structural  information  on  adsorbates,  such  as 
reactive  intermediates,  on  surfaces.  Generally, 
symmetry  arguments  coupled  with  experimental 
experience  from  related  better-understood  systems 
enables  vibrational  assignments  to  be  correlated 
with  a  specific  atomic  or  molecular  adsorbate  at  a 
particular  surface  site.  Unfortunately,  this  empiri¬ 
cal  approach  does  not  always  facilitate  an  unam¬ 
biguous  assignment  of  the  adsorbate  identity  and 
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structure  to  the  data.  For  example,  we  have  re¬ 
cently  argued  that  water  molecularly  adsorbs  on 
the  Al(lll)  surface  at  low  temperatures  [4,  5], 
while  the  experimental  HREELS  data  was  origi¬ 
nally  interpreted  with  water  dissociation  taking 
place  [6].  Our  argument  was  based  on  ab  initio 
cluster  calculations  where  we  computed  the  geom¬ 
etry  and  vibrational  frequencies  for  the  chemi¬ 
sorbed  water.  We  were  able  to  show  that  the  two 
high-frequency  vibrational  modes  observed  at  3450 
and  3720  cm“^  by  HREELS  could  be  assigned, 
respectively,  to  H-bonded  and  non-H-bonded  OH 
stretches  in  water  [5]  and  did  not  require  the 
presence  of  a  hydroxyl  group  on  the  Al(lll)  sur¬ 
face  as  proposed  earlier  [6]. 

There  have  been  a  large  number  of  ab  initio 
calculations  used  to  predict  vibrational  frequencies 
for  molecules.  Such  calculations  are  powerful  be- 
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cause  they  can  readily  provide  useful  vibrational 
information  about  both  well  and  poorly  character¬ 
ized  molecular  structures.  A  useful  overview  of 
the  accuracy  of  the  vibrational  frequencies  com¬ 
puted  by  Hartree-Fock  (HF),  Moller-Plesset 
(MP2),  and  density  functional  theory  (DFT)  was 
given  recently  by  Scott  and  Radom  [7]  for  a  series 
of  small  polyatomic  molecules  containing  up  to 
four  heavy  atoms.  Ab  initio  vibrational  frequency 
calculations  for  quite  large  molecules  have  also 
been  reported.  For  example,  Pulay  and  co-workers 
recently  published  DFT  calculations  which  com¬ 
puted  vibrational  frequencies  for  free-base  por¬ 
phyrin  [8].  However,  the  computational  require¬ 
ments  of  vibrational  calculations  must  obviously 
scale  rapidly  with  system  size,  making  their  use  in 
analyzing  adsorbates  on  surfaces  challenging. 
Cluster  calculations  are  one  of  the  common  tech¬ 
niques  for  theoretically  investigating  adsorption 
on  surfaces.  The  cluster  approach  assumes  that  the 
dominant  chemical  interactions  are  local  in  nature 
and  tend  to  involve  only  a  few  of  the  substrate 
atoms  close  to  the  adsorbate  [9].  It  is  the  local 
nature  of  the  adsorbate  interactions  that  enables  a 
reduction  in  the  computational  effort  since  only  a 
subset  of  vibrational  modes  of  the  cluster  need  to 
be  calculated.  Apart  from  some  recent  calculations 
involving  relatively  small  clusters  which  have  ob¬ 
tained  adsorbate  vibrational  frequencies  by  explic¬ 
itly  computing  a  second  derivative  matrix  [10,  11], 
the  usual  approach  to  computing  the  adsorbate 
vibrational  frequencies  has  been  by  making  nu¬ 
merical  fits  to  the  variation  in  the  total  energy  of 
the  cluster  as  the  adsorbate  atoms  are  displaced 
along  expected  normal  mode  directions  [12].  This 
energy-fitting  procedure  has  also  been  used  for 
slab  calculations  [13].  The  calculations  treat  the 
substrate  mass  as  being  infinitely  larger  than  that 
of  the  adsorbate,  but,  as  we  show  below,  this  can 
lead  to  large  errors  in  the  adsorbate  frequencies.  In 
addition  to  the  substrate  mass  approximation,  the 
energy-fitting  procedure  requires  some  approxi¬ 
mation  to  the  form  of  the  normal  modes  when 
considering  a  polyatomic  adsorbate.  For  example, 
both  Rosch  and  Bagus  et  al.  obtained  a  stretching 
frequency  for  CO  adsorbed  on  a  surface  by  fixing 
the  CO  center  of  mass  at  the  equilibrium  distance 
above  the  surface  and  fitting  to  the  energy  varia¬ 
tion  with  CO  distance  [12].  To  avoid  making  as¬ 
sumptions  about  the  form  of  the  normal  modes, 
we  have  started  computing  vibrational  frequencies 
for  an  adsorbate  on  a  cluster  by  diagonalizing  the 
mass-weighted  energy  second-derivative  matrix 


computed  for  a  subset  of  atoms  composing  the 
cluster.  In  the  case  of  a  single  adatom,  the  energy-fit 
procedure  and  the  energy  second  derivatives  for 
the  adatom  alone  result  in  identical  adsorbate  vi¬ 
brational  frequencies.  A  disadvantage  of  our  strat¬ 
egy  over  numerical  fits  is  that  the  more  computa¬ 
tionally  demanding  energy  derivatives  now  need 
to  be  calculated.  However,  by  computing  esti¬ 
mates  of  the  energy  second  derivatives  between 
the  adsorbate  and  substrate  and  using  a  frequency 
correction  formula,  originally  proposed  by  Black 
[14],  one  can  more  realistically  incorporate  the 
adsorbate/substrate  mass  ratio  into  the  frequency 
calculation.  The  frequency  correction  formula  also 
provides  a  guide  to  the  spatial  extent  of  the  adsor¬ 
bate  interactions  with  the  substrate  and  provides  a 
convergence  check  on  the  necessary  cluster  size. 

In  this  article,  we  illustrate  strategies  for  com¬ 
puting  more  physically  realistic  adsorbate  vibra¬ 
tional  frequencies  by  considering  H,  O,  and  water 
adsorbates  on  Al  surfaces  and  for  an  Al  adatom  on 
the  Si(lll)  surface.  In  the  next  section,  we  describe 
how  the  cluster  calculations  were  performed  and 
how  the  vibrational  frequencies  were  computed 
from  the  mass-weighted  energy  second-derivative 
matrix  and  the  frequency  correction  formula.  In 
the  third  section  of  the  article,  we  present  results 
and  discuss  the  adsorbate  vibrations  for  each  of 
the  four  systems.  We  first  show  that,  as  expected, 
due  to  the  H  adatoms  having  the  smallest  adsor- 
bate-to-substrate  mass  ratio,  H  adsorption  on  the 
Aid  00)  surface  is  the  most  straightforward  system 
to  theoretically  treat.  We  find  substantial  fre¬ 
quency  correction  terms,  owing  to  the  larger  0.59 
O-to-Al  mass  ratio,  to  occur  when  using  the  en¬ 
ergy  second-derivative  matrix  to  compute  the  vi¬ 
brational  frequencies  for  a  single  O  adatom  on  the 
Aid  11)  surface.  The  size  of  the  O  frequency  correc¬ 
tion  terms  becomes  much  smaller  if  a  larger  sec¬ 
ond  derivative  matrix  is  formed  using  additional 
second  derivatives  from  the  nearby  substrate 
atoms.  We  find  an  AI3O  subcluster  to  produce  O 
vibrations  consistent  with  the  corrected  O-only  fre¬ 
quency  calculation.  We  also  consider  different¬ 
sized  subclusters  in  the  frequency  calculations  for 
water  adsorbed  on  the  AKlll)  surface.  An  impor¬ 
tant  feature  about  the  adsorbed  water  frequency 
calculations  is  that  we  do  not  need  to  make  any 
assumptions  about  the  form  of  the  adsorbate  nor¬ 
mal  modes  since  they  are  given  automatically  by 
the  diagonalization  procedure.  However,  vibra¬ 
tional  calculations  at  the  SCF  level  on  molecules 
are  well  known  to  overestimate  the  frequencies 
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and  we  go  on  to  show  for  the  H20/A1(111)  sur¬ 
face,  using  energy-fitting  procedures,  how  the  nor¬ 
mal  modes  obtained  at  an  initial  level  of  theory 
can  be  used  to  obtain  estimates  of  the  vibrational 
frequencies  at  a  higher  level  of  theory.  In  the 
Al/Si(lll)  system,  we  show  that  the  frequency 
correction  formula  is  still  a  useful  tool  for  identify¬ 
ing  adsorbate  and  substrate  localized  vibrations 
even  though  the  Al-to-Si  mass  ratio  is  0.96  and  the 
adsorbate  frequency  values  are  overlapped  by  vi¬ 
brations  from  the  Si  substrate.  Concluding  remarks 
are  given  in  the  final  section  of  the  article. 


Method 

In  our  previous  work,  we  used  cluster  calcula¬ 
tions  to  simulate  the  H/A1(100)  [15],  0/Al(lll) 
[16,  17],  water/ Al(lll)  [4,  5],  and  Al/Si(lll)  [18] 
surfaces.  We  found  that  the  general  features  of  the 
adsorption  can  often  be  reliably  modeled  by  taking 
the  set  of  central  substrate  atoms  which  interact 
most  strongly  with  the  adsorbate  atoms  and  sur¬ 
rounding  them  by  a  complete  shell  of  neighboring 
substrate  atoms  having  the  same  geometry  as  in 
the  extended  surface.  For  example,  in  our  study  of 
the  H/A1(100)  surface,  we  used  the  Al9(5,4)H  and 
A1i4(8,6)H  clusters,  with  one  and  two  central  A1 
atoms,  to  simulate  H  adsorption  at  the  four-fold 
on-top  and  two-fold  bridging  sites,  respectively, 
where  the  (5,4)  and  (8,6)  notation  refers  to  the 
number  of  substrate  atoms  in  each  cluster  layer. 
We  ignore  edge  effects  for  the  A1  clusters,  while  in 
the  Si  calculations  we  follow  the  usual  practice  of 
terminating  the  Si  dangling  bonds  at  the  cluster 
edges  with  H  atoms.  The  A1  cluster  atoms  are 
constrained  to  appropriate  fee  lattice  positions  us- 

o 

ing  the  bulk  lattice  parameter  of  4.05  A  [19]  except 
in  the  water /Al(l  11)  calculations  where  we  allow 
surface  relaxation  effects  for  the  central  A1  atoms. 
In  our  simulation  of  the  Al/Si(lll)  surface,  we 
found  the  inclusion  of  substrate  relaxation  effects 
to  be  essential  in  order  to  obtain  a  stable  adsorp¬ 
tion  site  which  agrees  with  the  experimentally 
observed  surface  structure  and  we  constrained  the 
outer  Si  atoms  at  their  experimental  Si  bulk  lattice 
positions  [20]. 

All  the  cluster  calculations  were  performed  us¬ 
ing  the  GAMESS  [21]  computer  program.  The 
equilibrium  adsorbate  geometry  is  found  by  par¬ 
tially  optimizing  the  clusters'  total  energy  while 
constraining  most  of  the  substrate  atoms  to  their 


extended  surface  positions  [22].  The  adsorbate  par¬ 
tial  geometry  optimizations  and  vibrational  fre¬ 
quency  calculations  utilize  energy  gradients  and 
these  computations  are  performed  at  the  restricted 
closed  and  open  Hartree-Fock  level.  We  also 
discuss  some  single  geometry  cluster  energies 
computed  using  restricted  second-order  Moller- 
Plesset  theory  [23].  The  Hay  and  Wadt  effective 
core  potential  (ECP)  and  its  double  zeta  valence 
basis  set  [3s3p]/(2s2p)  was  used  for  A1  and  Si 
[24].  In  the  Al/Si(lll)  clusters,  d-polarization 
functions  were  also  added  to  the  basis  sets  for  the 
A1  adatom  and  its  neighboring  Si  atoms.  The  Dun¬ 
ning  and  Hay  split  valence  basis  [4s] /(2  s)  was 
used  for  H  [25].  The  Dunning  and  Hay  split  va¬ 
lence  basis  set  augmented  with  a  d-polarization 
and  a  diffuse  p-function  [9s6pld]/(3s3pld)  was 
used  for  O  [25].  More  complete  details  on  the 
calculations  are  given  in  our  earlier  work  [4,  5, 
15-18]. 

Computing  the  complete  set  of  vibrational  fre¬ 
quencies  for  larger  clusters  rapidly  becomes  a  de¬ 
manding  task  and  would  produce  a  number  of 
vibrational  modes  associated  with  cluster  edge  ef¬ 
fects  which  are  not  strictly  relevant  to  the  experi¬ 
mental  characterization  of  the  adsorbate.  For  this 
reason,  in  both  cluster  and  slab  calculations,  a 
more  economical  approach  is  to  fit  the  cluster 
energy  to  some  potential  energy  surface  for  a  se¬ 
lect  number  of  vibrational  modes  involving  the 
adsorbate  [12].  A  typical  application  of  this  proce¬ 
dure  was  used  by  Jacobsen  et  al.  [13]  in  their  slab 
calculations  to  obtain  O  vibrational  frequencies  v 
on  the  Aid  11)  surface  using  the  quadratic  fitting 
formula 


where  AE  is  the  energy  change  when  the  O  atom 
is  displaced  by  a  distance  Afo  from  the  minimum. 
Jacobsen  et  al.  took  to  be  the  mass  of  the 
adatom  and  made  the  common  approximation  that 
the  substrate  has  infinite  mass.  Another  difficulty 
with  this  fitting  approach  is  deciding  which  direc¬ 
tion  to  take  the  Ai?  displacement  when  dealing 
with  polyatomic  adsorbates  [12].  We  encountered 
this  problem  in  our  water/Al(lll)  cluster  calcula¬ 
tions  and  we  started  to  compute  the  adsorbate 
vibrational  frequencies  and  relevant  normal  modes 
by  diagonalizing  subblocks  of  the  mass-weighted 
energy  second-derivative  matrix  [4,  5]. 
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Forming  the  mass-weighted  energy  second- 
derivative  matrix  either  analytically  or  numeri¬ 
cally  as  finite  differences  of  the  energy  gradients  is 
the  routine  quantum  chemical  technique  for  com¬ 
puting  vibrational  frequencies  of  molecules  [26]. 
However,  to  keep  the  calculations  tractable  and 
since  we  are  only  interested  in  a  subset  of  the 
possible  cluster  frequencies,  we  only  form  the 
mass-weighted  second-derivative  matrix  for  a 
fragment  of  the  atoms  in  the  cluster.  In  block  form, 
the  full  mass-weighted  force  constant  matrix  is 


K  = 


K 

K 


AA 


SA 


Kas] 

Kssj' 


(2) 


where  A  labels  the  adsorbate  set  of  atoms  used  in 
the  second-derivative  matrix  formation  and  S  are 
the  remaining  substrate  atoms  composing  the  clus¬ 
ter.  The  A  subset  may  also  include  some  substrate 
atoms,  but  these  atoms  should  be  chosen  to  pre¬ 
serve  the  symmetry  of  the  adsorbate  vibrational 
modes.  The  f-th  row  of  the  force  constant  matrix  is 
initially  calculated  by  taking  the  finite  difference 
of  the  analytical  gradient  g  vector  by  displacing 
the  atom  with  the  /-th  coordinate  in  the  A  set  to 
give 


normal  modes.  However,  restricting  the  A  set  to 
just  adsorbate  atoms  means  that  the  frequency 
calculations  are  still  treating  the  substrate  as  hav¬ 
ing  infinite  mass.  Furthermore,  these  frequency 
calculations  do  not  give  any  indication  as  to 
whether  any  significant  interactions  between  the 
adsorbate  and  the  substrate  are  being  neglected. 

However,  by  calculating  energy  gradients  and 
forming  the  off-diagonal  block  enables  an 
estimate  to  the  convergence  of  the  frequency  calcu¬ 
lations  to  be  made.  Transforming  the  mass- 
weighted  second-derivative  matrix  K  so  that  the 
separate  adsorbate  and  surface  blocks  are  diago¬ 
nalized 


-  I  co^  K.s\ 

K  =  .  (4) 

^  Ksa  / 

where  ideally  the  eigenvalues  would  corre¬ 
spond  to  surface  phonons,  the  off-diagonal  force 
constant  matrix  block  K^g  should  be  small  when 
the  adsorbate  vibrational  frequencies  are  strongly 
localized  and  decoupled  from  the  substrate.  Fol¬ 
lowing  an  approach  originally  suggested  by  Black 
[14],  an  estimate  of  the  corrected  adsorbate  vibra¬ 
tional  frequencies  can  be  obtained  from 


1  d^E  1 

_ _ _ _ _ 

where  Ag  ==  giAqj)  —  g(0)  and  is  the  mass  of 
the  nucleus  associated  with  coordinate  i.  Equation 
(3)  can  also  be  written  in  terms  of  taking  ±Aqj 
displacements  so  that  a  more  symmetrical  force 
constant  matrix  is  produced.  Similarly,  when  i  and 
j  are  both  elements  of  A,  an  improved  force  con¬ 
stant  block  should  result  from  averaging  K^j  and 
Kji.  Equation  (3)  also  facilitates  making  an  esti¬ 
mate  of  the  K^g  force  constant  block  provided  that 
the  gradient  vector  g  is  computed  for  all  the  atoms 
in  the  cluster. 

Once  the  K^a  block  of  the  force  constant  matrix 
is  formed,  the  adsorbate  vibrational  frequencies  co 
are  obtained  by  diagonalizing  As  expected, 
we  find  for  a  single  atom  adsorbate  the  K^a  diag- 
onalization  and  the  curve-fitting  procedure  im¬ 
plied  by  Eq.  (1)  produce  identical  frequencies.  For 
polyatomic  adsorbates,  such  as  water  on  the 
Al(lll)  surface,  the  diagonalization  of  the  adsor¬ 
bate  force  constant  block,  produces  both  the 
adsorbate  vibrational  frequencies  and  the  relevant 


(co^f  =  +  E 


;  Cof-nj 


- 


+  ^E(ie.s),;.  (5) 


where  the  approximation  in  the  last  formula  as¬ 
sumes  that  CO  I  (1^,  the  maximum  phonon  fre¬ 
quency  of  the  substrate.  Equation  (5)  can  be  de¬ 
rived  by  applying  a  simple  perturbation  theory 
argument  to  Eq.  (4)  or  by  using  the  more  elaborate 
continued  fraction  technique  originally  used  by 
Black  [14].  Bulk  A1  has  a  maximum  phonon  fre¬ 
quency  of  approximately  300  cm"^  [27],  suggest¬ 
ing  that  only  small  corrections  should  be  found  for 
high  adsorbate  frequency  modes.  Much  higher- 
frequency  phonon  modes  exist  for  the  Al/Si(lll) 
surface  but  we  still  find  Eq.  (5)  useful  because  it 
provides  a  measure  of  how  small  the  off-diagonal 
K^s  block  is  becoming.  In  the  next  section,  we 
demonstrate  how  Eq.  (5)  generally  serves  as  a 
useful  guide  to  the  convergence  and  reliability  of 
the  frequency  values  computed  for  surface  vibra¬ 
tions. 
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Results  and  Discussion 

H/AL(100)  RESULTS 

In  Table  I,  we  present  the  vibrational  frequen¬ 
cies  for  H  adsorption  at  an  on-top  and  a  bridging 
site  on  clusters  which  simulate  the  Al(lOO)  surface. 
EELS  experiments  by  Paul  found  bands  at  1750 
and  1125  cm~^  which  were  assigned  to  H  vibra¬ 
tions  at  the  on-top  and  bridging  sites  [28],  The 
general  EELS  selection  rule  is  that  only  adsorbate 
vibrations  normal  to  the  surface  are  observed. 
However,  it  should  be  noted  the  EELS  band  at 
1750  cm“^  may  not  be  simply  associated  with  H 
adsorbed  at  an  on-top  site  since  the  intensity  band 
grows  simultaneously  with  a  peak  at  750  cm”h  In 
our  earlier  cluster  calculations  [15],  by  assuming 
the  form  of  the  H  atom  normal  modes  and  using 
energy  fits,  we  argued  the  origin  of  the  1750  and 
750  cm”^  peaks  to  be  due  to  two  H  at  neighboring 
on-top  sites.  However,  by  following  the  procedure 
outlined  in  this  article  and  obtaining  the  adsorbate 
normal  modes  by  diagonalizing  the  mass-weighted 
second-derivative  matrix  for  the  2  H  atoms,  we 
found  a  strong  coupling  between  the  surface  nor¬ 


mal  and  parallel  H  modes.  This  surface  normal 
and  parallel  modes  coupling  contradicts  our  ear¬ 
lier  assumption  and  makes  our  previous  analysis 
no  longer  valid.  At  present,  we  do  not  fully  under¬ 
stand  the  origin  of  the  two  peaks  at  1750  and  750 
cm“^  and  more  work  is  in  progress.  Nevertheless, 
as  indicated  in  Table  I,  our  computed  surface  nor¬ 
mal  vibrations  are  in  very  good  agreement  with 
the  two  high-frequency  bands  observed  in  the  EELS 
experiments.  Hjelmberg,  in  substrate  perturbed  jel- 
lium  calculations,  also  computed  1694  and  1048 
cm~^  for  the  on-top  and  bridge  sites,  respectively 
[29].  As  expected,  due  to  the  small  H-to-Al  mass 
ratio,  we  find  the  frequencies  corrected  using  Eq. 
(5)  to  differ  only  slightly  from  their  uncorrected 
values.  The  largest  corrections  occur  for  the  H  x 
and  y  surface  parallel  modes  at  the  on-top  site 
where  Eq.  (5)  is  not  strictly  valid  since  the  uncor¬ 
rected  frequency,  233  cm“\  is  less  than  the  A1 
maximum  phonon  frequency.  Our  H/A1(100)  clus¬ 
ter  calculations  demonstrate  that  vibrational  fre¬ 
quencies  for  adsorbates  with  low  masses  can  be 
reliably  obtained  using  total  energy-fitting  proce¬ 
dures  provided,  of  course,  the  correct  normal  mode 
associated  with  the  adsorbate  is  known. 


TABLE  I _ 

Vibrational  frequencies  computed  using  clusters  which  simulate  the  H/AI(100)  and  0/AI(111)  surfaces;  the 
z  direction  is  normal  to  the  surface  and  all  frequencies  are  in  cm 


Adsorption  site  —  X  distance  Vibrational  frequency 


and  cluster 

(A) 

Uncorrected 

Corrected 

Assignment 

On-top 

1.630 

1737 

1767 

H  z 

Al9(5,4)H 

233 

289 

H  X 

233 

289 

Hy 

Bridge 

1.832 

1095 

1106 

H  z 

AIi4(8,6)H 

778 

793 

H  X  in-plane 

526 

0  only 

532 

H  y  out-of-plane 

Three-fold  open 

1.788 

595 

708 

Ox 

AIi8(12,6)0 

594 

703 

Oy 

393 

AI3O 

577 

Oz 

713 

717 

Ox 

712 

716 

Oy 

607 

610 

0  z 

213 

280 

AI3O  X  frus.  tran. 

211 

281 

AI3O  y  frus.  tran. 

205 

334 

AI3O  z  frus.  tran. 
Al  —  Al  vibrations 
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0/ Aid  11)  RESULTS 

The  investigation  of  O  adsorption  on  the  Al(lll) 
surface  has  been  a  surprising  challenge  to  surface 
science  [30].  HREELS  experiments  at  low  O  cover¬ 
age  find  peaks  in  two  regions  at  545-650  and 
800-850  cm“^  [31-34].  In  agreement  with  the  re¬ 
cent  work  of  others  [35-37],  we  have  performed 
cluster  calculations  which  show  the  545-650  cm"^ 
HREELS  band  to  correlate  with  normal  vibrations 
of  O  above  the  surface  at  threefold  open  or  fee 
sites  and  the  800-850  cm“^  band  to  correlate  with 
symmetric  combinations  of  horizontal  vibrations 
from  two  or  more  O  at  neighboring  threefold  fee 
sites  which  become  HREELS-allowed  [17].  In  Table 
I,  we  list  results  for  the  frequency  calculations  for 
a  single  O  atom  on  the  Alig(12,6)  cluster.  The 
uncorrected  frequencies  are  obtained  by  diagonal¬ 
izing  the  mass-weighted  energy  second-derivative 
matrix  formed  for  O  only  and  for  the  AI3O  sub¬ 
cluster  consisting  of  O  and  its  three  nearest  A1 
neighbors.  The  uncorrected  frequency  calculations 
ignore  the  O-to-Al  mass  ratio  of  0.59  and  treat  the 
substrate  mass  as  infinite  to  produce  an  O  surface 
normal  vibration  in  poor  agreement  with  the  ex¬ 
perimental  values.  Similar  O  vibrational  results 
were  obtained  by  Jacobsen  et  al.  using  the  numeri¬ 
cal  fitting  procedure  given  by  Eq.  (1)  in  their  slab 
calculations  [13].  For  a  slab  with  a  0.25  O  mono- 
layer  coverage,  Jacobsen  et  al.  computed  411  and 
427  cm"^  for  the  surface  normal  and  parallel  O 
vibrations.  Their  surface  normal  vibration  411  cm” ^ 
value  matches  very  closely  our  O  only  result.  By 
performing  the  frequency  calculations  with  the 
mass-weighted  energy  second-derivative  matrix 
formed  for  AI3O  subcluster,  the  O  vibrations  are 
shifted  to  much  higher  frequency  and  we  obtain  a 
surface  normal  O  mode  which  is  much  more  con¬ 
sistent  with  the  HREELS  values.  The  O-only  fre¬ 
quencies  after  correction  with  Eq.  (5)  are  in  very 
close  agreement  with  the  three  high  frequencies 
computed  for  the  AI3O  subcluster.  Diagonalizing 
the  AI3O  second-derivative  matrix  results  in  12 
frequencies  altogether;  we  have  listed  only  the  six 
highest  frequencies  in  Table  I  since  the  nine  low- 
frequency  modes  are  mostly  associated  with 
Al — Al  vibrations  and  the  frequency  correction 
formula  [Eq.  (5)]  is  no  longer  valid.  Further  sup¬ 
port  that  the  AI3O  subcluster  results  in  more  real¬ 
istic  frequencies  than  the  O-only  calculations  comes 
from  Eq.  (5),  which  corrects  the  three  high- 
frequency  modes  by  4  cm”^  or  less.  These  small 
AI3O  subcluster  frequency  corrections  suggest  that 


the  most  important  surface  interactions  responsi¬ 
ble  for  O  vibrations  are  between  the  O  and  the 
three  neighboring  Al  atoms  and  that  the  adsorbate 
frequency  values  are  essentially  converged  when 
the  second-derivative  matrix  formed  for  the  AI3O 
subcluster  is  used.  We  have  tested  this  frequency 
convergence  idea  further  by  performing  frequency 
calculations  for  larger  subcluster  sizes  for  O  ad¬ 
sorbed  on  a  single  layer  Al^g  cluster  having  a 
central  Al^  subcluster  surrounded  by  all  the  neigh¬ 
boring  Al  atoms  in  the  Al(lll)  surface  layer.  We 
verified  that  the  O  frequencies  computed  with  an 
AI3O  subcluster  are  converged  by  obtaining  the 
same  O  frequencies  with  the  Al^O  subcluster  [17], 

To  summarize,  these  cluster  calculations  which 
simulate  the  0/Al(lll)  surface  demonstrate  that 
caution  is  needed  with  energy-fitting  procedures 
to  obtain  adsorbate  vibrational  frequencies  when 
the  adsorbate  to  substrate  mass  ratio  is  significant. 
The  O-only  frequency  calculation  shows  that  use¬ 
ful  estimates  of  the  adsorbate-to-substrate  mass 
effect  can  be  obtained  from  the  correction  formula 
given  by  Eq.  (5).  Equation  (5)  corrections  to  the 
adsorbate  frequencies  become  small  when  the  sub¬ 
cluster  A  block  includes  all  the  dominant  adsor¬ 
bate-to-substrate  interactions. 

H^O/Aldll)  RESULTS 

Table  II  summarizes  the  results  of  frequency 
calculations  for  water  adsorbed  at  an  on-top  site 
on  the  Al(lll)  surface  using  the  Ali5(10,5)H2O 
cluster  shown  in  Figure  1.  The  Aii5(10,5)  cluster 
consists  of  two  central  Al  atoms  surrounded  by  the 
complete  set  of  13  nearest  neighbors  from  the  first 
and  second  layers  of  the  (111)  surface  of  the  Al  fee 
lattice.  We  used  the  Al^g  cluster  in  our  earlier 
work  to  explore  water  adsorption  at  the  on-top 
and  bridging  sites  and  found  the  on-top  site  to  be 
preferred.  The  frequency  calculations  are  per- 
jformed  by  diagonalizing  the  mass-weighted  en¬ 
ergy  second-derivative  matrix  for  H2O  only  and 
the  AI2H2O  subcluster.  Not  surprisingly,  because 
of  their  weak  interaction  with  Al  surface,  the  fre¬ 
quencies  for  OH  stretches  and  H — O — H  bending 
mode  are  the  same  for  both  the  H2O  only  and 
AI2H2O  subcluster.  A  Mulliken  population  analy¬ 
sis  does  show  charge  transfer  from  the  water  H 
atoms  to  the  Al  surface,  which  is  probably  respon¬ 
sible  for  the  adsorbed  OH  stretches  being  red- 
shifted  from  the  4237  and  4102  cm”^  stretch  fre¬ 
quencies  computed  for  free  water  using  the  same 
basis  set  [4,  5].  The  free-water  bending  frequency 


832 


VOL  65,  NO.  5 


VIBRATIONAL  FREQUENCIES  FOR  ADSORBATES 


TABLE  II _ 

Vibrational  frequencies  for  H2O  adsorbed  at  an  on-top  site  on  the  Al(1 11)  surface  using  the 
AIi5(10,5)H2O  cluster. 


H2O  only  AI2H2O  MP2 


Mode 

Uncorrected 

Corrected 

Uncorrected 

Corrected 

Frequency 

5  (A) 

Assignments 

1 

4174 

4174 

4174 

4174 

4186 

0.001 

Antisym.  OH  str. 

2 

4053 

4053 

4053 

4053 

3749 

0.018 

Sym.  OH  stretch 

3 

1800 

1800 

1800 

1800 

1606 

0.001 

H  — 0  — H  bend 

4 

575 

583 

584 

584 

620 

0.099 

In-plane  H2O  rock 

5 

243 

298 

317 

319 

232 

0.021 

Al  —  0  stretch 

6 

295 

302 

In-plane  AI"-AI  str. 

7 

200 

220 

236 

352 

HgO  trust,  rot. 

8 

203 

247 

HgO  trust,  rot. 

9 

185 

250 

HgO  trust,  rot. 

10 

166 

266 

Al  mode 

11 

138 

252 

Al  — Al  trans. 

12 

119 

146 

z  trust,  trans. 

13 

91 

156 

84 

191 

HgO  trust,  rot. 

14 

84 

121 

76 

98 

X  trust,  trans. 

15 

61 

279 

57 

404 

y  trust,  trans. 

The  uncorrected  frequencies  are  computed  by  using  HgO  only  and  the  AI2H2O  subcluster  at  the  Hartree-Fock  level.  Estimates  of 
MP2  frequencies  are  obtained  by  performing  numerical  fits  along  the  normal  modes  computed  for  the  H20-only  calculations.  The 
shift  in  the  MP2  minimum  from  the  original  SCF  geometry  along  each  normal  mode  is  given  by  d.  All  frequencies  are  in  cm^L 


treated  with  the  AI2H2O  subcluster.  Mode  5  corre- 
spends  to  the  A1 — O  stretching  mode  and  Eq.  (5) 
in  the  H20-only  frequency  calculation  produces  a 
fairly  large  correction  term  since,  again,  we  are  not 
treating  the  substrate  mass  correctly.  The  corrected 
frequency  from  the  AI2H2O  subcluster  calculation 
shows  that  mode  5  is  now  probably  converged, 
even  though  Eq.  (5)  should  not  be  expected  to  be 
very  accurate  for  such  a  low-frequency  value.  In 
our  earlier  work  [5],  we  assigned  the  water  sym¬ 
metric  stretch,  the  bending  frequency,  the  in-plane 
rock,  and  the  A1 — O  stretch  to  the  HREELS  peaks 
experimentally  observed  at  3770,  1645,  775,  and 
325  cm“^  respectively,  for  the  submonolayer  cov¬ 
erage  with  5.4  X  water  molecules/cm^  [6]. 

The  frequency  correction  formula  [Eq.  (5)]  serves 
as  a  useful  check  on  the  convergence  of  the  ad¬ 
sorbed  water  frequencies  on  the  Al(lll)  surface. 
The  small  frequency  corrections  found  for  the  four 
highest-frequency  modes  could  be  used  as  an  ar¬ 
gument  that  numerical  fitting  procedures  would 
also  give  adequate  frequencies  for  the  adsorbed 
water.  However,  gradient  information  is  needed 
before  the  argument  for  simply  performing  numer¬ 
ical  fit  calculations  can  be  made.  Furthermore,  the 
numerical  fitting  procedure  would  require  an  as¬ 
sumed  form  for  the  adsorbed  water  normal  modes 


at  1802  cm”^  is  unaffected  by  the  surface  adsorp¬ 
tion.  Relatively  small  differences  between  the  un¬ 
corrected  and  corrected  frequencies  start  to  appear 
for  modes  4  and  5.  Mode  4  is  a  frustrated  water 
rotation  where  the  water  does  an  in-plane  rock 
above  the  A1  atom  and  the  small  frequency  correc¬ 
tion  from  Eq.  (5)  suggests  that  it  is  more  correctly 


FIGURE  1.  The  Ali5(10,5)H2O  cluster  with  the  water 
preferentially  adsorbed  atop  an  Al  atom.  This  cluster 
geometry  was  obtained  by  a  partial  geometry 
optimization  which  included  the  water  and  the  two  central 
Al  cluster  atoms. 
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and  the  advantage  of  our  approach  is  that  we 
avoid  making  any  assumptions  by  diagonalizing 
the  mass-weighted  energy  second-derivative  ma¬ 
trix  obtained  at  the  SCF  level.  Unfortunately,  SCF 
calculations  are  well  known  to  overestimate  vibra¬ 
tional  frequencies  of  molecules  by  10-13%  [7].  The 
trends  in  the  SCF  frequencies  computed  for  adsor¬ 
bates  on  surfaces  has  not  been  rigorously  quanti¬ 
fied,  but  one  way  to  obtain  improved  frequency 
values  is  to  use  the  SCF  normal  modes  with  small 
Eq.  (5)  corrections  as  displacement  directions  for 
numerical  fits  to  the  cluster  energies  calculated  at 
a  higher  level  of  theory.  In  Table  II,  we  list  the 
adsorbed  water  frequencies  obtained  by  fitting  to 
the  MP2  energy  variation  along  the  normal  modes 
calculated  in  the  Fl20  only  SCF  calculations.  We 
do  find  the  MP2  symmetric  OH  stretch  and  H — O 
— H  bending  frequencies  to  be  in  much  closer 
agreement  with  the  HREELS  vibrations  observed 
at  3770  and  1650  cm“k  The  parameter  8  in  Table 
II  gives  the  displacement  along  each  normal  mode 
between  the  SCF  and  MP2  minimum  energies  and 
this  gives  some  indication  as  to  whether  the  opti¬ 
mized  MP2  adsorbate  geometry  is  significantly 
different  from  the  SCF  minimum.  We  find  that  the 
largest  8  displacement  occurs  for  the  in-plane  H2O 
rock  mode.  Hence,  for  the  modes  having  small  8 
values,  the  relatively  straightforward  SCF  vibra¬ 
tional  calculation  combined  with  Eq.  (5)  provides 
some  justification  for  obtaining  the  water  on 
Al(lll)  vibrational  frequencies  using  numerical  fits 
at  the  MP2  level  without  having  to  perform  the 
more  computationally  demanding  MP2  energy- 
gradient  calculation. 

Al/Sidll)  RESULTS 

The  interaction  of  A1  with  the  Si  surface  has 
been  extensively  investigated  both  experimentally 
and  theoretically,  and  for  a  more  extensive  review 
of  the  literature,  we  refer  the  reader  to  our  recent 
cluster  calculations  for  this  surface  [18].  On  sur¬ 
faces  with  up  to  I  ML  A1  coverage,  scanning 
tunneling  microscopy  experiments  [38]  and  slab 
calculations  [39]  indicate  that  the  adatom  preferen¬ 
tially  adsorbs  at  a  site  where  the  A1  bonds  to 
three  first-layer  Si  atoms  and  is  directly  above  a  Si 
atom  from  the  second  layer  rather  than  the  H3  site 
where  the  second  layer  Si  atom  is  absent.  Surpris¬ 
ingly,  cluster  calculations  at  the  Hartree-Fock  level 
found  A1  adsorption  at  the  site,  where  the  A1 
atom  is  adsorbed  atop  a  single  Si  atom,  to  be  more 
stable  than  is  adsorption  at  either  of  the  T4  or  H3 


sites.  The  A1  adsorption  induces  a  Si(lll)  surface 
relaxation  and  our  work  shows  these  relaxation 
effects  can  be  simulated  providing  that  the  cluster 
is  large  enough  and  the  energy  is  computed  at 
beyond  the  Hartree-Fock  level.  We  were  able  to 
show  that  the  largest  relaxation  effects  occur  at  the 
T4  site,  resulting  in  an  A1  adsorption  energy  5 
kcal/mol  more  stable  than  adsorption  at  the  H3 
site  and  considerably  more  stable  than  adsorption 
at  the  T4  site.  However,  we  only  obtained  this 
result  for  the  site  after  the  partial  geometry 
optimization  of  the  AlSis  subcluster  in  the 
AlSi26H24  cluster  and  computing  MP2  energy  cor¬ 
rections  [18].  Some  of  the  previous  cluster  calcula¬ 
tions  have  also  attempted  to  calculate  A1  adsorp¬ 
tion  frequencies  even  though  the  Al-to-Si  mass 
ratio  is  0.96  [40],  Kelly  et  al.  in  HREELS  experi¬ 
ments  on  the  Al/Si(lll)  surface  found  peaks  at 
525  and  450  cm“^  [41].  Kelly  et  al.  noted  that  their 
interpretation  of  the  two  peaks  is  complicated  by 
the  closeness  of  a  peak  due  to  the  clean  Si(lll)2  X  1 
surface  and  by  the  similarity  of  the  Al  and  Si 
masses.  To  clarify  the  origin  of  the  525  and  450 
cm"^  peaks,  Kelly  et  al.  also  investigated  In  ad¬ 
sorption  on  the  Si  surface  and  again  found  peaks 
at  525  and  450  cm“^  suggesting  that  the  Al  ad¬ 
sorption  results  in  two  new  peaks  due  to  localized 
substrate  phonons. 

To  explore  the  limits  of  calculating  adsorbate 
vibrational  frequencies,  we  investigated  comput¬ 
ing  the  Al — Si  vibrations  using  the  partially  opti¬ 
mized  Si22H2iAl  and  Si26H24Al  clusters  described 
previously  [18]  which  simulate  Al  adsorption 
atop  the  Tj  and  threefold  T4  sites,  respectively. 
Since  previous  cluster  calculations  have  suggested 
that  the  preferred  Al  adsorption  is  at  either  the 
or  the  T4  sites,  we  did  not  perform  vibrational 
calculations  on  the  H3  site  [18].  The  partially  opti¬ 
mized  Tj  Si4Al  and  T4  Si5Al  subcluster  fragments 
from  the  larger  clusters  are  shown  in  Figure  2.  In 
Tables  III  and  IV,  we  present  the  Hartree-Fock 
vibrational  frequencies  obtained  by  diagonalizing 
the  mass-weighted  energy  second-derivative  ma¬ 
trix  for  differently  sized  subclusters  at  the  two 
sites.  Tables  III  and  IV  show  that  the  Si  phonon 
frequencies  overlap  the  Al — Si  vibrations  and  the 
approximate  form  of  Eq.  (5)  is  no  longer  valid,  and 
for  this  reason,  we  list  a  A  value  equaling  the 
amount  of  the  correction  which  would  be  obtained 
if  Eq.  (5)  was  still  correct.  Our  Al-only  frequency 
calculations  give  essentially  the  same  values  as 
obtained  by  Illas  et  al.  who  obtained  237  cm~^  at 
the  and  313  cm“^  at  T4  sites  [40].  In  the  fre- 
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FIGURE  2.  The  Si4AI  and  SigAl  subclusters  used  in  the  frequency  calculations  for  the  T,  and  sites.  The  atom  labels 
S\^,  Sib,  and  Slg  correspond  to  the  central,  bulklike,  and  surface  Si  atoms  referenced  in  Tables  ill  and  IV. 


quency  calculation  for  the  site  using  the  Si4Al 
subcluster,  the  surface  normal  A1 — Si  stretching 
frequency  shifts  to  335  cm"\  as  predicted  by  Eq. 
(5).  However,  this  shift  agreement  has  to  be  fortu¬ 
itous  since  A  is  still  77  cm"^  for  the  new  stretch 


frequency  value.  Although  we  do  not  expect  Eq. 
(5)  to  be  very  reliable  for  the  Al/Si(lll)  surface, 
the  correction  factors  A  do  still  serve  as  a  useful 
guide  to  whether  the  frequencies  are  converged. 
The  three  highest  frequencies  have  the  smallest  A 


TABLE  III _ 

Vibrational  frequencies  for  A!  adsorption  at  the  T,  site  on  the  Si(111)  surface  using  the  Si22H2iAf  cluster. 


Al  only 

Si4AI 

Mode 

Frequency 

A 

Frequency 

A 

Assignments 

1 

509 

26 

2 

508 

26 

y  ^Icen 

3 

466 

31 

4 

425 

69 

Sibulk 

5 

425 

69 

^•bulk 

6 

404 

75 

S^bulk 

7 

400 

81 

^ibulk 

8 

368 

85 

^•bulk 

9 

368 

85 

S'bulk 

10 

239 

96 

335 

77 

Al  —  Si  z  str. 

11 

233 

174 

12 

233 

174 

13 

148 

100 

Al  —  Si  z  trans. 

14 

51 

49 

50 

59 

Al  X 

15 

51 

49 

50 

59 

Aly 

The  uncorrected  frequencies  are  computed  by  using  Al  only  and  the  Si4AI  subcluster.  All  frequencies  are  in  cm  ''  and  A  is 
frequency  correction  from  Eq.  (5). 
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TABLE  IV _ 

Vibrational  frequencies  for  Al  adsorption  at  the  site  on  the  Si(111)  surface  using  the  8126^24 Al  cluster. 


Mode 

Al  only 

Si4AI 

SisAI 

Assignments 

Frequency 

A 

Frequency 

A 

Frequency 

A 

1 

585 

39 

641 

7 

^Len  ^*bulk  ^1'^* 

2 

484 

21 

485 

23 

Sicen— Sisurf  ^  Str. 

3 

484 

21 

485 

23 

^Len  ^Lurf  Y 

4 

454 

99 

S'bulk  ^ 

5 

453 

98 

sibuik  y 

6 

426 

42 

444 

49 

^Lurf 

7 

416 

64 

416 

64 

S'surf 

8 

416 

64 

416 

64 

Sisurf 

9 

371 

80 

Sibulk  ^ 

10 

369 

81 

369 

80 

Sisurf 

11 

337 

23 

336 

24 

Sisurf 

12 

337 

23 

336 

24 

Sisurf 

13 

317 

85 

308 

26 

306 

21 

Al— Sicen  ^  Str. 

14 

212 

144 

210 

146 

Sicen 

15 

212 

144 

210 

146 

Sicen 

16 

260 

95 

196 

76 

196 

76 

X  Al 

17 

259 

95 

196 

76 

196 

76 

yAl 

18 

231 

128 

184 

109 

Al— Sicen  z  trans. 

The  uncorrected  frequencies  are  computed  by  using  Al  only  and  the  Si4AI  and  SigAI  subclusters.  All  frequencies  are  in  cm  ^  and 
A  is  frequency  correction  from  Eq.  (5). 


correction  factors  in  Table  III  and  correspond  to 
vibrational  modes  for  the  central  Si  atom  directly 
underneath  the  Al  adatom  which  is  also  bonded  to 
the  three  bulklike  Si  atoms  in  the  Si4  Al  subcluster. 
The  three  bulk  Si  atoms  in  the  Si4Al  subcluster 
each  have  three  nearest  neighbors  not  included  in 
the  Si4Al  subcluster,  and  as  a  result,  their  A  values 
are  larger.  It  is  worth  noting  that  the  surface  nor¬ 
mal  vibration  for  the  central  Si  atom  at  466  cm“^ 
agrees  remarkably  well  with  the  450  cm”^  peak 
observed  in  the  HREELS  experiments  by  Kelly 
et  al.  [41]. 

Similar  trends  in  the  frequency-correction  term 
A  are  found  in  Table  IV  for  the  T4  site.  The  Si4Al 
subcluster  does  not  include  the  Si^uik  atom  from 
the  third  layer  of  the  Si(lll)  surface.  From  both  the 
Si4  Al  and  Si5Al  subclusters,  the  Al  adatom  surface 
normal  vibration  appears  to  be  converging  to  a 
frequency  of  306  cm“^  and  is  not  that  much  differ¬ 
ent  from  the  frequency  calculated  in  the  Al-only 
calculation.  The  convergence  of  the  highest- 
frequency  mode,  which  is  associated  with  the  cen¬ 
tral  Si  directly  underneath  Al  and  above  the  Si^uik 
atom,  is  more  sensitive  to  the  subcluster  size  used 
and  improves  tremendously  on  going  to  the  Si5Al 
subcluster.  We  suspect  that  this  surface  normal 


highest-frequency  mode  could  be  responsible  for 
the  525  cm“^  peak  observed  in  HREELS,  although 
our  SCF  frequency  value  is  much  too  large.  The 
small  A  value  computed  for  this  mode  suggests 
that  it  might  be  valid  to  use  the  Si5Al  subcluster 
normal  modes  to  perform  a  numerical  fit  to  the 
MP2  energy  and  this  work  is  in  progress.  Overall, 
these  Al/Si(lll)  cluster  calculations  show  that 
while  Eq.  (5)  is  not  valid  for  computing  adsorbate 
frequency  correction  terms,  Eq.  (5)  can  still  serve 
as  a  useful  guide  to  the  convergence  of  the  fre¬ 
quency  of  surface  vibrations. 


Conclusions 

We  have  presented  a  general  strategy  for  com¬ 
puting  vibrational  frequencies  for  adsorbates  on 
surfaces.  Rather  than  use  numerical  fitting  meth¬ 
ods,  we  advocate  using  a  scheme  which  involves 
energy-gradient  calculations.  The  initial  vibra¬ 
tional  frequencies  are  obtained  by  diagonalizing  a 
mass-weighted  energy  second-derivative  matrix 
formed  for  a  subcluster  of  atoms  from  the  surface. 
The  diagonalization  avoids  having  to  make  as¬ 
sumptions  on  the  form  of  the  adsorbate  normal 
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modes.  Furthermore,  the  energy  gradients  facili¬ 
tate  making  estimates  of  the  influence  of  the 
adsorbate/substrate  mass  ratio  through  the 
frequency-correction  formula  given  by  Eq.  (5). 
Equation  (5)  produces  only  small  frequency- 
correction  terms  for  the  H/A1(100)  surface  since 
the  H-to-Al  substrate  mass  ratio  is  small.  For  the 
O/ Aid  11)  surface,  we  find  larger  frequency  cor¬ 
rections,  and  these  are  accurate  providing  that  the 
frequency  value  of  interest  is  greater  than  the 
maximum  phonon  frequency  of  the  substrate. 
However,  as  we  show  for  H20/A1(111)  and 
Al/Si(lll)  surfaces,  even  when  the  surface  vibra¬ 
tion  frequency  does  not  exceed  the  maximum  sub¬ 
strate  phonon  frequency  value,  Eq.  (5)  still  serves 
as  a  useful  guide  for  checking  if  the  frequency  of  a 
surface  normal  mode  is  converged.  Once  the  fre¬ 
quency  of  surface  normal  mode  has  been  shown  to 
converge,  one  can  then  resort  to  numerical  fitting 
methods  and  obtain  the  adsorbate  vibrational  fre¬ 
quency  at  a  higher  level  of  theory  where  the  gradi¬ 
ent  calculation  is  more  prohibitively  expensive. 
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ABSTRACT:  Systematic  theoretical  studies  for  the  mechanisms  of  the  epoxidation  and 
complete  oxidation  of  ethylene  and  propylene  over  silver  surface  as  well  as  the  reactivity 
and  the  stability  of  oxygen  species  on  Cu,  Ag,  and  Au  surfaces  have  been  presented.  The 
dipped  adcluster  model  (DAM)  combined  with  the  ab  initio  Hartree-Fock  (HF),  second- 
order  Moller-Plesset  (MP2),  and  SAC /SAC -Cl  (symmetry-adapted  cluster /configuration 
interaction)  methods  are  used.  These  studies  clarify  the  origin  of  silver  as  a  unique 
effective  catalyst  for  the  epoxidation  of  ethylene  and  the  different  mechanisms  for  the 
oxidation  of  olefins  over  silver  surface.  For  the  epoxidation  of  ethylene,  the  superoxide 
O/,  which  is  molecularly  adsorbed  in  bent  end-on  geometry  on  the  silver  surface,  is  the 
active  species.  The  origin  of  the  unique  catalytic  activity  of  silver  for  the  epoxidation  of 
ethylene  is  due  to  its  ability  to  adsorb  oxygen  as  the  superoxide  species.  Such  an 
adsorbed  species  cannot  be  stable  or  exist  on  Cu  and  Au  surfaces.  For  the  oxidation  of 
propylene,  both  reaction  mechanisms  initiated  by  the  activation  of  olefinic  carbon  and  by 
the  activation  of  the  allyl  hydrogen  exist.  The  activation  of  the  allyl  hydrogen  is  the 
origin  of  the  complete  oxidation  of  some  olefins  over  silver  surface.  The  present  results 
not  only  let  us  have  a  better  understanding  of  the  reaction  mechanisms  of  olefins  over 
silver  surface  but  also  supply  a  basic  idea  for  the  new  catalyst  design  of  the  epoxidation 
reaction.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  839-855,  1997 
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Introduction 


The  heterogeneous  selective  oxidation  of 
olefins  to  epoxides  on  silver 


:c=c: 

olefin 


_j - Q 


Ag 


^c-c^ 
\  / 
o 

epoxide 


(1) 


is  an  exceedingly  important  industrial  catalytic 
reaction,  and  as  such  has  received  long  and  exten¬ 
sive  study  [1-7].  One  particularly  interesting  as¬ 
pect  of  the  epoxidation  reaction  is  its  uniqueness: 
Not  only  is  silver  a  uniquely  effective  catalyst  for 
heterogeneous  epoxidation  but  also  ethylene  is  the 
only  hydrocarbon  which  may  be  epoxidized  with 
high  selectivity.  Actually,  silver  catalysts  are  used 
in  industry  to  produce  several  million  tons  of 
ethylene  oxide  yearly  [1-4].  Recent  experimental 
studies  show  that,  except  for  ethylene,  only  a  few 
olefins  such  as  styrene  [8,  9],  3,3-dimethylbutene 
[10],  norbornene  [11],  and  butadiene  may  be  epoxi¬ 
dized  over  silver  with  high  selectivity.  In  contrast, 
other  olefins  such  as  propylene,  butenes,  and 
pentenes  are  epoxidized  over  silver  but  with  ex¬ 
tremely  low  selectivity  [12-16]. 

During  the  past  20  years,  many  studies  [12-24] 
have  been  devoted  to  getting  a  better  understand¬ 
ing  of  the  epoxidation  reaction  mechanism.  How¬ 
ever,  the  origin  of  this  unique  catalytic  activity  of 
silver  has  not  yet  been  clarified,  and  also  the 
mechanistic  details  of  the  epoxidation  and  that  of 
the  competing  combustion  to  carbon  dioxygen  and 
water  are  far  from  understood.  One  of  the  main 
questions  about  the  mechanism  of  ethylene  oxida¬ 
tion  is  the  roles  of  the  adsorbed  molecular  and 
atomic  oxygen.  Both  mechanisms  involving  molec¬ 
ular  oxygen  [12-18]  and  atomic  oxygen  [19-23]  as 
the  active  species  have  been  suggested.  It  has  been 
suggested  that  the  uniqueness  of  silver  is  associ¬ 
ated  with  its  ability  to  adsorb  oxygen  as  the  super¬ 


oxide  species  [24].  More  direct  evidence  for  molec¬ 
ular  oxygen  as  the  active  species  has  been  ob¬ 
tained  by  several  studies:  e.g.,  only  complete  oxi¬ 
dation  occurs  when  N2O  pulse  which  gives  atomi¬ 
cally  adsorbed  oxygen  is  used,  but  the  epoxidation 
reaction  occurs  when  O2  pulse  which  gives  molec- 
ularly  adsorbed  oxygen,  at  least  initially,  is  used 
[25].  The  mechanism  assuming  atomic  oxygen  as 
an  active  species  was  suggested  from  the  experi¬ 
mental  data  that  only  atomically  adsorbed  oxygen 
exist  at  high  temperature  in  an  industrial  condi¬ 
tion,  since  molecular  oxygen  desorbs  or  dissoci¬ 
ates.  The  assumption  that  molecular  and  atomic 
oxygens  cause  the  partial  and  total  oxidation  reac¬ 
tions,  respectively,  gives  an  upper  limit  for  the 
selectivity  of  6 /7.  Recent  experiments  giving 
higher  selectivity  than  6/7  [26]  imply  that  the 
atomic  oxygen  may  at  least  pay  an  important  role 
for  the  epoxidation  of  ethylene. 

Theoretical  studies  have  done  much  less  for  the 
epoxidation  reaction  of  olefins.  Though  a  few 
works  [27-30]  studying  the  epoxidation  mecha¬ 
nism  of  ethylene  have  ever  been  published,  less 
discussions  were  presented  [27,  30]  for  the  oxida¬ 
tion  of  propylene,  and  the  calculations  did  not 
explicitly  include  the  surface  contribution.  The 
main  difficult  problem  still  remaining  in  the  theo¬ 
retical  studies  is  how  to  describe  chemisorption 
species  and  calculate  adsorption  energy  correctly 
based  on  a  selected  small  cluster.  This  may  be  why 
no  ab  initio  theoretical  study  for  the  full  reaction 
pathway  and  the  associated  energy  diagrams  of 
the  epoxidation  reaction  of  olefins  has  been  re¬ 
ported. 

In  a  series  of  our  previous  studies  [31-37],  a 
theoretical  model,  named  the  dipped  adcluster 
model  (DAM)  [31-33],  has  been  proposed  to  study 
chemisorptions  and  surface  reactions  by  involving 
the  interactions  between  bulk  metal  and  ad- 
molecules  with  consideration  of  the  electron  trans¬ 
fer  between  them  and  the  image  force  correction. 
Such  an  interaction  is  found  to  be  very  important 
for  studying  the  chemisorptions  and  the  reactions 
of  the  oxygen  on  a  metal  surface,  and  even  a  small 
DAM  could  be  used  to  give  correct  electronic 
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structures  and  reliable  adsorption  energies  in  con¬ 
trast  to  the  cluster  model.  Previous  studies  [34-36] 
for  the  chemisorption  of  oxygen  on  a  silver  surface 
showed  that  the  molecularly  adsorbed  oxygen  ex¬ 
ists  in  two  stable  geometries;  i.e.,  an  end-on  (bent) 
structure  in  an  on-top  site  and  a  side-on  structure 
in  a  bridge  site.  The  ground  state  in  the  former 
geometry  is  superoxide,  which  is  expected  to  be 
more  electronically  favorable  for  the  reaction  with 
ethylene  [36]. 

In  this  work,  we  present  a  brief  account  of  our 
recent  systematic  studies  on  the  catalytic  activity 
of  silver  surface  for  the  oxidation  of  olefins.  We 
will  try  to  address  the  following  three  questions: 
(1)  What  species  of  surface  oxygen  is  responsible 
for  the  ethylene  epoxidation?  (2)  Why  is  silver  a 
unique  catalyst  for  the  epoxidation?  (3)  Why  do 
higher  olefins  mainly  process  complete  oxidation 
on  the  same  silver  catalyst?  We  performed  the  ab 
initio  Hartree-Fock  (HF),  second-order  Moller- 
Plesset  (MP2),  and  the  SAC  (symmetry-adapted 
cluster)/SAC-CI  (configuration  interaction)  calcu¬ 
lations  with  the  use  of  the  DAM.  More  details  are 
seen  in  the  literature  [38-40]. 


Epoxidation  Mechanism  of  Ethyiene  on 
Silver  Surface 

In  this  section  we  give  an  account  of  our  re¬ 
search  on  the  mechanism  of  the  epoxidation  of 
ethylene  on  a  silver  surface  [38].  The  energy  dia¬ 
grams  of  the  reactions  of  ethylene  with  both 
molecularly  adsorbed  and  atomically  adsorbed 
oxygens  on  a  Ag  surface  are  studied.  The  Eley- 
Rideal  mode,  namely  the  reaction  between  the 
adsorbed  oxygen  species  and  a  gaseous  ethylene  is 
assumed.  The  DAM  was  used  to  include  the  ef¬ 
fects  of  the  bulk  metal  such  as  the  electron  transfer 
between  adcluster  and  surface  and  the  image  force. 
Geometry  optimizations  for  calculating  the  reac¬ 
tion  path  were  performed  using  the  ab  initio  unre¬ 
stricted  Hartree-Fock  (UHF)  method  and  the  ener¬ 
gies  at  the  optimized  geometries  were  calculated 
by  the  MP2  methods  including  the  image  force 
correction.  The  Gaussian  basis  set  used  is  of  dou- 
ble-zeta  accuracy,  and  some  additional  functions 
were  added  for  improvement.  They  are  shown  to 
reproduce  well  the  known  geometries  and  the  en¬ 
ergetics  of  the  reactions  [38].  The  model  adclusters 
used  in  the  present  study  are  the  molecularly 
adsorbed  end-on  (bent)  form  in  an  on-top  site  and 
the  atomically  adsorbed  on-top  structure. 


REACTIONS  WITH  MOLECULARLY 
ADSORBED  OXYGEN 

Epoxide  Formation 

We  first  study  the  epoxidation  reaction  of  eth¬ 
ylene  with  the  molecularly  adsorbed  superoxide 
on  a  silver  surface.  For  the  formation  of  ethylene 
oxide,  the  reaction  takes  place  by  an  initial  interac¬ 
tion  of  the  oxygen  with  one  of  the  carbon  in 
ethylene  [5,  6].  Figure  1  shows  the  obtained  favor¬ 
able  initial  attack  of  ethylene  to  superoxide  species 
adsorbed  on  the  silver  surface.  Starting  from  this 
geometry,  we  optimized  the  geometries  of  the  in¬ 
termediates  and  the  transition  states  (TSs)  along 
the  reaction  pathway.  The  optimized  geometries 
and  energy  diagrams  are  shown  in  Figure  2.  The 
reaction  proceeds  from  the  left  to  the  right,  and  the 
energy  is  in  kcal/mol  relative  to  the  free  system, 
i.e.,  Ag2  +  O2  +  C2H4. 

Figure  2  shows  a  two-step  mechanism  of  the 
reactions.  The  first  step  of  this  reaction  is  the 
adsorption  of  molecular  O2  on  Ag  surface  in  which 
one  electron  is  transferred  from  the  bulk  metal  to 
the  adcluster,  i.e.,  n  =  1,  The  transferred  electron 
mainly  occupies  the  out-of-plane  p*  orbital  of  O2/ 
and  the  in-plane  p*  molecular  orbital  (MO)  of  O2 
remains  singly  occupied.  The  ground  state  for  the 
end-on  adsorption  of  O2  is  the  superoxide.  The 
adsorption  energy  of  superoxide  is  calculated  to  be 


FIGURE  1.  Model  adcluster  used.  Geometries  were 
optimized  in  the  Cs  symmetry  except  for  the  Ag  —  Ag 
distance  and  the  Og  — Ag  —  Ag  angle  fixed  at  2.8894  A 
and  the  90.0°,  respectively. 
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FIGURE  2.  Energy  diagram  for  the  reaction  between  ethylene  and  molecularly  adsorbed  superoxide  on  Ag  surface. 
The  route  leading  to  ethylene  oxide  (EO)  and  acetaldehyde  (AA)  are  shown  by  the  solid  and  broken  lines,  respectively. 


7.4  kcal/mol  at  the  UHF  level  and  26.9  kcal/mol 
at  the  MP2  level.  The  experimental  molecular  ad¬ 
sorption  energies  are  9.2,  9.3,  and  24.1  kcal/mol 
for  the  Ag(lll)  [18],  Ag(llO)  [18],  and  electrolytic 
Ag  [41]  surfaces,  respectively.  The  next  step  is  the 
attack  of  ethylene  onto  the  terminal  oxygen  atom 
which  is  more  reactive  than  the  inside  one  O^, 
leading  to  the  TS  and  the  intermediate  with  the 
energy  barrier  of  11.3  kcal/mol.  The  energy  level 
of  the  intermediate  is  similar  to  that  of  initial  step. 
In  the  TS  the  a  bond  between  and  is 
formed  and  the  in-plane  p  bond  of  O2  and  the  p 
bond  of  C2  are  broken.  The  geometry  parameters 
at  every  step  clearly  reflect  the  reaction  processing 
along  the  reaction  path  [38]. 

When  the  intermediate  is  produced,  the  forma¬ 
tion  of  ethylene  oxide  is  an  easy  path  as  shown  by 
the  real  line,  leaving  an  atomic  oxygen  on  the  Ag 
surface.  The  calculated  heat  of  reaction  is  37.1 
kcal/mol.  The  energy  diagram  given  in  Figure  2 
shows  that  the  reaction  leading  to  ethylene  oxide 
proceeds  very  smoothly:  The  reaction  is  exother¬ 
mic,  there  are  no  very  high  barriers,  and  there  are 
no  too  stable  intermediates.  We  therefore  conclude 
that  the  epoxide  formation  reaction  from  the  su¬ 


peroxide  species  on  a  silver  surface  proceeds  very 
smoothly.  We  note  that  the  exothermicity  of  the 
overall  reaction  is  partially  due  to  the  formation  of 
the  atomically  adsorbed  oxygen  on  the  silver  sur¬ 
face. 

Acetaldehyde  Formation 

We  next  study  the  formation  process  of  ac¬ 
etaldehyde  from  ethylene  and  molecularly  ad¬ 
sorbed  superoxide  on  the  Ag  surface.  In  Figure  2, 
the  path  is  given  by  the  broken  line.  When  ac¬ 
etaldehyde  is  produced,  it  is  further  oxidized  to 
CO2  and  FI  2  O  by  combustion  in  the  presence  of 
adsorbed  oxygen,  and  therefore  it  is  an  intermedi¬ 
ate  in  the  complete  oxidation  process  [42]. 

The  formation  of  acetaldehyde  starts  from  the 
intermediate  shown  in  the  center  of  Figure  2:  up  to 
it  the  reaction  path  is  common  to  the  epoxidation 
reaction.  FF^  bound  to  in  the  intermediate  mi¬ 
grates  to  C^,  giving  a  stable  conformation  denoted 
as  acetaldehyde  intermediate  in  Figure  2.  Fiow- 
ever,  the  energy  barrier  for  this  Fi^  migration  is  as 
high  as  71.5  kcal/mol.  This  step  is  therefore  ener¬ 
getically  forbidden.  This  is  a  reason  for  the  high 
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selectivity  of  the  epoxide  formation  process  start¬ 
ing  from  the  superoxide  species  summarized  in 
the  preceding  section. 

Effect  of  Silver  Surface 

The  effect  of  the  silver  surface  may  be  twofold: 
One  is  to  provide  a  reactive  species  adsorbed  on 
the  surface,  and  the  other  is  to  provide  electrons  to 
the  reaction  site  as  considered  in  the  DAM.  Figure 
3  shows  the  energy  diagram  for  the  reaction  be¬ 
tween  ethylene  and  gaseous  oxygen  without  the 
silver  surface.  Two  energy  diagrams  correspond  to 
the  calculations  with  n  ==  1  and  n  =  0.  The  former 
involves  an  excess  electron  but  the  latter  is  neutral. 
For  n  =  1,  we  used  the  optimized  geometries 
shown  in  Figure  2,  and,  for  n  =  0,  we  performed 
the  geometry  optimization  for  the  present  purpose. 
The  geometries  illustrated  in  the  upper  and  lower 
sides  correspond  to  those  for  n  =  0  and  n  =  1, 
respectively.  For  0,  the  O — O  distance  is  cal¬ 
culated  to  be  1.37  A  for  the  intermediate,  which  is 

o 

smaller  than  the  corresponding  value  1.49  A  for 
n  1.  The  same  is  true  for  all  the  intermediates 
and  TSs. 


Comparing  Figure  3  with  Figure  2,  we  clearly 
see  the  catalytic  activity  of  the  silver  surface  for 
the  epoxidation  reaction.  Without  the  surface,  the 
reaction  is  endothermic  and  the  pathway  has  a 
large  barrier.  Electron  transfer  (n  ~  1)  certainly 
reduces  the  barrier,  but  the  direct  interaction  with 
the  actual  silver  atoms  on  the  surface  is  quite 
important.  Clearly,  the  exothermicity  of  the  reac¬ 
tion  shown  in  Figure  2  is  due  to  the  formation  of 
the  atomically  adsorbed  oxygen  on  the  surface. 

REACTIONS  WITH  ATOMICALLY  ADSORBED 

OXYGEN 

When  ethylene  oxide  is  ejected  out  of  the  Ag 
surface,  the  atomically  adsorbed  oxygen  O”  is  left 
on  the  surface.  It  is  also  produced  by  the  dissocia¬ 
tive  adsorption  of  O2  on  the  surface,  and  the 
dissociative  state  is  more  stable  than  the  molecular 
adsorption  state.  In  our  calculation,  the  dissociated 
oxygen  has  its  singly  occupied  orbital  in  the  2p 
orbital  parallel  to  the  surface. 

Figure  4  shows  the  energy  diagram  for  the  reac¬ 
tion  of  ethylene  with  the  atomically  adsorbed  oxy¬ 
gen  on  the  Ag  surface.  When  ethylene  attacks 


FIGURE  3.  Energy  diagram  for  the  reactions  between  ethylene  and  gaseous  oxygens  [neutral  Og  in  =  0)  and  O2 
anion  (n  =  1)].  The  upper  geometries  are  for  n  =  0,  and  the  lower  geometries  for  r?  =  1. 
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FIGURE  4.  Energy  diagram  for  the  reaction  between  ethylene  and  atomically  adsorbed  oxygen  on  Ag  surface. 


atomic  oxygen  on  the  surface,  the  most  favorable 
approach  is  to  form  the  bond  as  illustrated 

in  Figure  4.  It  is  similar  to  the  approach  of  ethylene 
to  the  superoxide  species  shown  in  Figure  2.  The 
calculated  energy  barrier  from  the  adsorbed  oxy¬ 
gen  to  the  intermediate  is,  however,  as  small  as  1.5 
kcal/mol,  reflecting  the  reactivity  of  the  atomic 
oxygen. 

The  next  step  from  this  intermediate  is  very 
important,  since  it  determines  the  selectivity  of  the 
atomically  adsorbed  oxygen.  If  Cj,  attacks  the  oxy¬ 
gen  to  form  the  bond,  ethylene  oxide  is 

formed.  If  attacks  another  C — H  bond,  causing 
the  hydrogen  migration,  acetaldehyde  is  a  prod¬ 
uct,  which  is  an  intermediate  for  the  complete 
oxidation  product. 

For  the  hydrogen  migration  reaction,  the  barrier 
was  calculated  to  be  39.1  kcal/mol,  which  is  high 
but  lower  than  the  barrier,  71.5  kcal/mol,  of  the 
reaction  in  which  the  superoxide  species  was  in¬ 
volved  (see  Fig.  2).  This  process  were  calculated 
using  the  adcluster  with  n  =  1.  Unfortunately,  the 
structure  of  the  TS  to  form  the  bond  could 

not  be  determined  by  the  geometry  optimization 
procedure  using  both  of  the  n  =  1  and  n  ^  0  ad- 
clusters.  If  the  pathway  from  the  intermediate  to 


ethylene  oxide  is  one  step,  it  is  a  quite  interesting 
step  since  it  involves  both  geometrical  changes 
and  one  electron  back-transfer  from  the  admolecule 
to  the  bulk  metal.  In  the  DAM  picture,  the  inter¬ 
mediate  in  Figure  4  is  neutrally  described  with 
n  =  1,  i.e.,  with  one  additional  electron  supplied 
from  the  bulk  metal  to  the  adcluster,  but  the  prod¬ 
uct,  ethylene  oxide  plus  Ag2  should  be  neutral, 
i.e.,  n  =  0.  Therefore,  we  calculated  two  potential 
curves,  one  starting  from  the  intermediate  and  the 
other  from  ethylene  oxide  plus  Ag2  using  n  =  1 
and  n  =  0,  respectively.  All  the  geometrical  pa¬ 
rameters  were  assumed  to  change  linearly,  except 
for  the  Ag — O  distance,  which  was  fixed  to 
2.180  A. 

The  potential  curve  for  n  =  1  increases  mono- 
tonically  from  the  intermediate  to  the  product, 
while  the  curve  for  n  =  0  also  increases  monotoni- 
cally  from  ethylene  oxide  and  Ag2  to  the  interme¬ 
diate,  and  the  two  energy  curves  cross.  The  energy 
difference  between  the  crossing  point  and  the  in¬ 
termediate  is  about  36  kcal/mol,  which  may  be 
considered  as  the  energy  barrier  of  this  process. 
Namely  at  this  TS,  one  electron  is  transferred  back 
to  the  metal. 
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The  selectivity  giving  either  ethylene  oxide  or 
acetaldehyde  would  be  dependent  on  the  heights 
of  the  barriers  of  the  two  processes  and  on  the 
stabilities  of  the  two  products.  The  calculated  en¬ 
ergy  barriers  for  the  two  processes  are  similar,  39.1 
and  about  36  kcal/mol,  respectively.  On  the  other 
hand,  the  energy  difference  between  the  two  prod¬ 
ucts  and  the  intermediate  are  +15.6  and  -13.7 
kcal/mol,  respectively,  which  means  endothermic- 
ity  and  exothermicity  from  the  intermediate,  re¬ 
spectively.  From  the  above  data  alone,  it  is  diffi¬ 
cult  to  decide  which  is  a  preferential  process, 
though  the  acetaldehyde  formation  may  be  favor¬ 
able  for  the  exothermicity.  However,  it  may  be 
said  that  both  ethylene  oxide  and  acetaldehyde  are 
formed  from  ethylene  and  atomically  adsorbed 
oxygen  on  the  Ag  surface.  Furthermore,  this  fact  is 
very  important  since  it  would  be  a  reason  for  the 
experimental  maximum  selectivity  more  than  6/7 
[5-7]. 

OVERALL  MECHANISM  OF  EPOXIDATION  OF 

ETHYLENE 

We  summarize  here  our  results  on  the  mecha¬ 
nism  of  the  epoxidation  of  ethylene  on  a  silver 
surface.  The  calculated  energy  barriers  and  the 
heats  of  the  reactions  are  summarized  in  Table  I. 
The  primarily  important  species  for  the  epoxida¬ 
tion  of  ethylene  on  a  silver  surface  is  the  superox¬ 
ide  O/  which  is  molecularly  adsorbed  on  the 
surface  in  the  bent  end-on  geometry.  Ethylene  at¬ 
tacks  the  terminal  oxygen  atom  of  the  superoxide, 
as  shown  in  Figure  1,  with  the  barrier  of  about  11 
kcal/mol,  and  then  the  reaction  proceeds  quite 
smoothly  leading  to  ethylene  oxide  as  shown  in 


Figure  2.  The  overall  reaction  is  exothermic  by  37 
kcal/mol  from  C2H4  +  O2  +  Ag  surface  or  by  10 
kcal/mol  from  C2H4  +  superoxide  on  Ag  surface. 
The  second  exothermicity  is  due  to  the  larger  ad¬ 
sorption  energy  of  the  atomic  oxygen  than  that  of 
the  superoxide. 

On  the  contrary,  the  complete  oxidation  of  ethy¬ 
lene  from  the  superoxide  species  is  forbidden  due 
to  the  existence  of  the  large  barrier  (72  kcal/mol) 
in  the  hydrogen  migration  step,  though  this  reac¬ 
tion  is  largely  exothermic  by  66  kcal/mol  from  the 
initial  compounds  or  by  40  kcal/mol  from  the 
superoxide  on  Ag  and  C2H4.  When  silver  surface 
does  not  exist,  the  epoxidation  reaction  is  a  very 
unfavorable  reaction  having  a  high-energy  barrier 
and  endothermicity.  Thus,  the  epoxidation  of  eth¬ 
ylene  by  the  superoxide  on  the  Ag  surface  is  highly 
efficient  and  selective. 

The  atomically  adsorbed  oxygen,  which  is  left 
on  the  surface  after  completion  of  the  epoxidation 
reaction  by  the  superoxide  or  which  may  exist  by 
the  dissociative  adsorption  of  O2  on  the  surface, 
has  two  reaction  channels  leading  to  ethylene  ox¬ 
ide  and  to  complete  oxidation.  The  selectivity  here 
seems  to  be  small  and  both  products  would  be 
obtained.  This  would  be  the  reason  why  some 
experiments  reach  the  selectivity  larger  than  6 /7. 

In  the  process  leading  to  ethylene  oxide,  the 
electron  back-transfer  from  the  reaction  adsorbate 
complex  to  the  metal  surface  should  be  important 
and  related  to  the  barrier.  We  note  that  both  elec¬ 
tron  acceptor  and  donor  may  work  to  reduce  the 
barrier  of  the  electron  transfer  step:  They  work  to 
stabilize  the  n  =  1  and  n  =  0  curves,  respectively. 
The  effects  of  Ce  and  halogen  as  promoters  of 
catalyst  are  well  known  experimentally  [5-7].  We 


TABLE  I 


Comparison  of  the  energy  barrier  and  the  heat  of  reaction  for  the  epoxidation  and  acetaidehyde  formation  by 
the  molecuiarly  and  atomicaiiy  adsorbed  oxygens  (kcai  /  moi). 


Reaction 

Energy  barrier 

Heat  of  reaction® 

Molecularly  adsorbed  oxygen 

02(a)  /  Ag  +  C2H4  ^  C2H4O  +  0(a)  /  Ag 

11.3 

10.2 

02(a)  /Ag  +  C2H4  ^  CH3CHO  +  0(a)  /Ag 

Atomically  adsorbed  oxygen 

71.5 

39.5 

0(a)  /  Ag  +  C2H4  — >  C2H4O  +  Ag 

-36 

21.5  (24.7) 

0(a)  /  Ag  +  C2H4  CH3CHO  +  Ag 

Without  Ag  surface 

39.1 

50.8  (52.3) 

O2  +  U2H4  ^  0  +  C2H4O 

31.2 

-13.2  (-11.3) 

O2  +  C2H4  — >  0  +  C2H4O 

52.9 

-38.0  (-35.4) 

^Values  in  parentheses  show  the  experimental  values. 
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speculate  that  these  promoters  act  in  this  electron- 
transfer  step.  Design  of  the  promoter  or  co-cata- 
lyst,  which  is  effective  to  this  electron-transfer  step, 
is  of  crucial  importance  since  it  would  be  a  key  for 
raising  up  the  selectivity  over  6/7. 


Activation  of  O  2  on  Cu,  Ag,  and  An 
Surfaces 

The  experimental  results  show  that  silver  is  a 
unique  catalyst  for  the  epoxidation  of  ethylene. 
However,  it  is  not  yet  well  understood  why  only 
silver  is  so  exceptional.  In  the  former  section,  we 
showed  that  the  superoxide  species  adsorbed  in 
the  end-on  form  on  a  Ag  surface  is  very  active  and 
selective  for  the  epoxidation  reaction.  Aiming  to 
clarify  the  reason  why  only  silver  is  an  effective 
catalyst  for  the  epoxidation  of  ethylene,  in  this 
section,  we  perform  a  comparable  study  for  the 
chemisorption  and  activation  of  oxygen  on  Cu,  Ag, 
and  Au  surfaces. 

We  take  M2O2  (M  =  Cu,  Ag,  Au)  as  adclusters. 
Both  electronic  structures  and  the  reactivity  of  the 
molecular  end-on  and  side-on  adsorptions  are 
studied.  The  geometries  of  the  molecular  end-on 
and  side-on  species  are  optimized  at  the  UHF 
level.  The  relative  stabilities  of  the  adsorbed  species 
is  studied  by  SAC/SAC-CI  method  [43-45],  since 
the  roles  of  lower  ground  and  excited  states  are 
quite  important  in  this  kind  of  surface  electronic 
processes  [34-36].  The  DAM  [31-33]  is  again 
adopted  to  investigate  the  electron  transferability 
of  these  surfaces. 

REACTIVITY  AND  STABILITY  OF 

SUPEROXIDE  SPECIES 

Figure  5  shows  an  illustration  of  the  geometries 
of  the  end-on  and  side-on  forms.  The  calculations 
are  performed  by  assuming  one-electron  transfer 


FIGURE  5.  Illustration  of  the  geometrical  conversion 
from  end-on  (#1)  to  side-on  (#4)  forms.  The  M  —  M 
(M  =  Cu,  Ag,  and  Au)  distances  were  fixed  at  the  lattice 
constants. 

from  the  bulk  to  the  adcluster  occurs.  The  opti¬ 
mized  O — O  distances  are  1.36-1.38  A  in  the  end- 
on  form,  which  are  close  to  or  slightly  greater  than 
that  of  a  free  O2  anion  (1.35  A)  [46],  and  that  in  the 
side-on  form  are  all  about  1.51  A,  The  calculated 
Cu — O  distance  are  shorter  than  the  Au — O  and 
Ag — O  distances,  reflecting  a  large  adsorption  en¬ 
ergy  and  a  short  lattice  constant  of  Cu. 

The  calculated  electronic  structure  of  O2  in  the 
end-on  and  side-on  adsorption  forms  is  superoxide 
O2  and  peroxide  Of",  respectively.  Table  II  shows 
the  net  charge  and  frontier  density  (or  spin  popu¬ 
lation),  which  are  related  to  the  reactivity  of  the 
superoxide  species.  In  all  three  species,  the  net 
charges  on  the  inside  (O^)  and  outside  (O^)  oxygen 
atoms  are  calculated  to  be  about  —0.6  and  —0,15, 
respectively,  while  the  outside  oxygen  atoms  have 
a  frontier  density  of  about  0.86,  which  is  much 
larger  than  that  of  the  inside  oxygen  atoms  (about 
0.12).  This  shows  that  the  outside  oxygen  is  much 
more  reactive  than  the  inside  one  and  that  the 
former  is  not  so  negative  as  the  latter  one,  the 
same  result  as  that  reported  before  [36]  for  a  linear 
Ag — O — O  system.  Actually,  in  the  epoxidation 
reaction  with  ethylene,  the  oxygen  that  reacts  with 


TABLE  II 


1  II  _ _ _ _ _ — — - 

Net  charge  and  frontier  density  of  the  end-on  superoxide  species  on  Cu,  Ag,  and  Au  surfaces. 

System^ 

Net  charge 

Frontier  density 

Ma 

Oa 

Ot, 

Ma 

Mb 

Oa 

Ob 

CU2O2 

-H0.060 

-0.271 

-0.643 

-0.146 

0.007 

0.002 

0.133 

0.858 

-hO.058 

-0.295 

-0.605 

-0.158 

0.014 

0.003 

0.119 

0.864 

AU2O2 

-h  0.095 

-0.285 

-0.644 

-0.166 

0.007 

0.002 

0.116 

0.876 

®The  geometry  of  the  end-on  form  is  shown  in  Fig.  1 . 
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ethylene  is  the  outside  one  as  shown  in  the  above 
section.  Based  on  the  analyses  of  the  net  charge 
and  the  frontier  density,  similar  reactivity  is  ex¬ 
pected  for  the  end-on  superoxide  species  on  Cu, 
Ag,  and  Au  surfaces. 

The  reactivities  of  the  superoxide  species  on  Cu, 
Ag,  and  Au  surface  for  ethylene  are  also  very 
similar  [39].  The  differences  among  Cu,  Ag,  and 
Au  surfaces  are  small,  if  the  superoxide  species 
exists  on  the  metal  surfaces.  Large  observed  differ¬ 
ences  in  the  catalytic  activity  of  these  metal  sur¬ 
faces  are  therefore  attributed  to  the  different  stabil¬ 
ities  of  the  adsorbed  oxygen  species,  particularly 
the  stability  of  the  superoxide  species. 

Next,  we  examine  a  conversion  from  the  end-on 
to  the  side-on  geometries.  This  process  accompa¬ 
nies  a  conversion  from  the  superoxide  to  the  per¬ 
oxide,  since  the  ground  state  of  the  end-on  and 
side-on  species  are  the  superoxide  and  peroxide, 
respectively,  as  shown  in  some  details  in  this  sec¬ 
tion.  Thus,  the  method  to  be  used  here  should  be 
able  to  describe  both  ground  and  excited  states  in 
a  good  accuracy.  For  this  purpose,  the  HF  method 
is  inadequate,  we  use  the  SAC/SAC-CI  method 
[43-45].  The  geometries  optimized  at  the  HF  level 
are  used  for  the  end-on  (#1)  and  side-on  (#4) 
forms,  respectively.  The  intermediate  geometries 
#2  and  #3,  which  are  illustrated  in  Figure  5,  are 
approximated  as  a  linear  function  of  the  OMM 
angle. 

Figure  6  shows  the  potential  energy  curves 
(PECs)  calculated  for  the  conversion  process  be¬ 
tween  the  end-on  and  side-on  forms  on  the  Cu, 
Ag,  and  Au  surfaces.  In  the  end-on  geometry,  the 
superoxide  CB2)  is  always  the  ground  state,  but  in 
the  side-on  geometry,  the  ground  state  is  the  per¬ 
oxide  species  The  superoxide  species  with 

the  ^A2  symmetry  is  less  important  since  it  never 
becomes  the  ground  state  in  this  conversion  pro¬ 
cess.  In  the  ^82  superoxide,  the  in-plane  tt*  orbital 
of  oxygen  is  singly  occupied  while  the  out-of-plane 
77*  orbital  is  doubly  occupied.  These  occupations 
are  reversed  in  the  ^^2  state.  The  crossing  of  the 
PECs  of  the  superoxide  and  peroxide  occurs  be¬ 
tween  the  end-on  and  side-on  geometries. 

Relative  stabilities  of  the  superoxide  and  perox¬ 
ide  in  the  end-on  and  side-on  geometries  and  the 
barrier  of  the  conversion  are  quite  important  for 
understanding  the  catalytic  activities  of  metals  for 
the  epoxidation  of  ethylene.  Such  information  is 
given  in  Figure  6.  In  the  end-on  form,  the  superox¬ 
ide  species  is  more  stable  than  the  peroxide  species: 
the  energy  difference  is  about  20  kcal /mol  for  Ag 


End-on  Side-on 

FIGURE  6.  Potential  energy  curves  calculated  by  the 
SAC/SAC-CI  method  for  the  conversion  between  the 
end-on  (#1)  and  side-on  (#4)  adsorption  states  in  the 
lowest  three  states  of  the  CU2O2,  Ag202,  and  AU2O2 
adclusters  with  n  =  ^. 


and  Au  and  5  kcal/mol  for  Cu.  In  the  side-on 
form,  the  peroxide  species  is  more  stable  than  the 
superoxide  species:  The  energy  difference  is  50.1, 
27.6,  21.2  kcal/mol  for  Cu,  Ag,  and  Au,  respec¬ 
tively.  The  energy  difference  between  the  end-on 
superoxide  and  the  side-on  peroxide  is  50.8,  23.1, 
and  —1.2  kcal/mol  for  Cu,  Ag,  and  Au,  respec¬ 
tively.  On  the  Cu  surface,  the  superoxide  species 
in  the  end-on  form  easily  converts  into  the  perox¬ 
ide  species  in  the  side-on  form  because  of  a  large 
stabilization  energy  and  for  a  lack  of  the  barrier: 
The  lifetime  of  the  end-on  superoxide  species 
should  be  very  short  on  the  Cu  surface.  On  the 
other  hand,  an  energy  barrier  of  about  20  kcal/mol 
is  calculated  for  the  conversion  on  Au.  On  the  Ag 
surface,  the  end-on  superoxide  should  have  a  con¬ 
siderable  lifetime  as  seen  from  the  PECs  shown  in 
Figure  6.  Based  on  the  above  results,  we  can  con¬ 
clude  that  the  reason  for  the  nonselectivity  of  Cu 
for  the  epoxidation  of  ethylene  is  due  to  the  ab¬ 
sence  of  the  stable  adsorbed  superoxide  species. 
However,  for  Au,  the  results  obtained  from  Figure 
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6  does  not  give  a  clear  explanation  for  its  inability 
of  this  reaction.  As  far  as  one-electron  transfer 
from  the  bulk  metal  into  the  adcluster  occurs  for 
Cu,  Ag,  and  Au,  the  reactivity  of  the  superoxide 
on  these  metals  should  be  similar. 

ELECTRON  TRANSFERABILITY  OF  Cu,  Ag, 
AND  Au 

The  results  listed  in  the  preceding  section  show 
that  the  oxygen  superoxide  species  adsorbed  on 
Au  surface  has  essentially  the  same  reactivity  as 
those  on  Cu  and  Ag  surfaces.  The  stabilities  of  the 
molecularly  adsorbed  oxygen  species  are  mainly 
due  to  the  electron-transfer  effect  from  the  bulk 
metal  as  already  shown  previously  for  silver 
[34-36]  and  palladium  [31,  32].  The  chemical  po¬ 
tentials  (work  functions)  of  the  Cu,  Ag,  and  Au 
surfaces  are  4.48,  4.52,  and  5.37  eV  for  the  (110) 
surfaces,  4.94,  4.74,  and  5.31  eV  for  the  (111)  sur¬ 
faces,  and  4.59,  4,64,  and  5.47  eV  for  the  (100) 
surfaces  [47].  The  Cu  surface  is  expected  to  have 
similar  properties  to  the  Ag  surface.  However,  it  is 
not  clear  whether  the  electron  transfer  also  occurs 
for  the  Au  surface.  We  therefore  investigate  here 
the  E(n)  curves  of  the  M2 — O2  systems  in  the 
DAM  in  order  to  clarify  why  Au  cannot  be  a  good 
catalyst  for  the  epoxidation  of  ethylene. 

Figure  7  is  a  display  of  the  E(n)  curves,  namely 
the  energy  of  the  adcluster  calculated  as  a  function 
of  n.  It  was  calculated  with  the  use  of  the  highest 
spin  coupling  model  [39].  The  optimized  end-on 
geometries  of  M2O2  adclusters  are  adopted  since 
they  are  the  most  stable  geometries  of  the  superox¬ 
ide  adsorbed  species.  The  E(n)  curves  are  upper 
convex,  and  at  n  =  1.0,  the  systems  are  more  sta¬ 
ble  than  at  n  =  0.0,  and  the  order  of  stability  is 
CU2O2  >  Ag202  >  AU2O2.  According  to  DAM 
[31],  the  electron  transfer  would  occur  between  the 
adcluster  and  the  bulk  metal  when  the  chemical 
potential  of  the  adcluster  becomes  equal  to  the 
chemical  potential  of  the  solid  surface,  i.e., 
dE{n)/dn  =  "/X.  For  CU2O2  and  Ag202  systems, 
the  tangents  of  the  E{n)  curves  become  equal  with 
the  experimental  chemical  potentials  at  about  n  = 
0.9.  Therefore,  from  the  concept  of  DAM  [31],  one 
electron  should  flow  from  the  Cu  and  Ag  bulk 
metals  into  the  adcluster  after  some  barrier.  This  is 
the  case  we  used  to  study  the  activation  of  O2  on 
the  surfaces  shown  in  the  preceding  section.  How¬ 
ever,  for  the  AU2O2  system,  though  the  energy  at 


FIGURE  7.  E{n)  curves  for  the  CU2O2,  Ag202,  and 
AU2O2  adclusters  in  the  highest  spin  coupling  model  at 
the  optimized  end-on  geometries. 


n  =  1.0  is  lower  than  that  at  n  =  0.0,  the  tangent  of 
the  E{n)  curve  is  still  smaller  than  the  experimen¬ 
tal  chemical  potential.  Therefore,  different  from  Cu 
and  Ag  metals,  one-electron  flow  does  not  occur 
from  the  Au  metal  into  the  adcluster.  This  point  of 
the  Au  surface  is  essentially  different  from  the 
other  metals  and  should  be  the  reason  why  the 
molecularly  adsorbed  oxygen  species  cannot  exist 
on  the  clean  gold  surface.  Thus,  the  inactivity  of 
the  gold  surface  for  the  oxygen  is  explained  by  the 
DAM  as  being  due  to  its  poor  donating  property. 

The  chemical  potential  of  Au  is  much  larger 
than  those  of  Cu  and  Ag.  We  note  that  this  differ¬ 
ence  is  attributed  actually  to  the  relativistic  effect 
in  gold  [48,  49].  The  importance  of  the  relativistic 
effects  to  the  valence-electron  properties  such  as 
chemical  shift  has  been  recently  clarified  in  this 
laboratory  [50-52]. 

We  further  note  that  the  electron  flow  is  diffi¬ 
cult  to  occur  between  the  clean  gold  metal  and  the 
adcluster  because  it  does  not  satisfy  the  condition 
clarified  by  the  DAM  [31].  However,  as  the  energy 
at  n  =  1,0  is  lower  than  that  at  n  =  0.0,  the  elec¬ 
tron  transfer  can  be  realized  if  the  chemical  poten¬ 
tials  of  the  gold  metal  can  be  lowered  by  an 
addition  of  promoters,  etc.  Experimentally,  the  re¬ 
cent  works  performed  by  Haruta  et  al.  [53,  54] 
show  that  gold  is  remarkably  active  for  the  oxida¬ 
tion  of  CO  and  hydrocarbons  when  it  is  supported 
on  suitable  metal  oxides.  This  may  be  explained  by 
such  effect  of  the  metal-oxide  support. 
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BRIEF  SUMMARY 


small  substitution  of  hydrogen  with  methyl  group, 


The  electron  transfer  from  metal  to  O2  is  a  key 
factor  for  the  chemisorption  of  oxygen  on  a  metal 
surface,  as  shown  previously  [32-37] .  The  differ¬ 
ences  in  the  relative  stabilities  of  different  oxygen 
species  on  different  metal  surfaces  are  attributed 
to  the  differences  in  the  electron-donating  ability. 
The  present  DAM  calculations  show  that,  for  Cu 
and  Ag  surfaces,  one-electron  transfer  from  the 
bulk  metal  into  the  adcluster  occurs  after  some 
barriers.  On  the  other  hand,  electron  transfer  is 
difficult  to  occur  for  the  Au  surface  since  the  gold 
surface  has  the  higher  work  function.  This  is  why 
molecular  adsorption  of  O2  does  not  occur  on  the 
Au  surface. 

On  Cu  surface,  peroxide  is  much  more  stable 
than  superoxide,  and  there  is  essentially  no  barrier 
for  the  conversion  from  superoxide  to  peroxide,  so 
that  the  lifetime  of  the  superoxide  species  should 
be  too  short  to  react  with  ethylene.  On  Ag  surface, 
however,  the  superoxide  species  would  have  con¬ 
siderable  lifetime  on  the  surface.  On  Au  surface, 
the  superoxide  species  does  not  exist  because  the 
molecular  adsorption  of  O2  does  not  occur  due  to 
the  difficulty  of  the  electron  transfer  from  the  clean 
bulk  metal.  We  think  that  this  difference  in  the 
stabilities  of  the  end-on  superoxide  species  on  Cu, 
Ag,  and  Au  surfaces  is  a  reason  of  the  observed 
difference  in  the  catalytic  activity  of  these  metals 
for  the  epoxidation  of  ethylene. 

The  above  conclusion  for  the  origin  of  the  dif¬ 
ferent  catalytic  activity  of  Cu,  Ag,  and  Au  surfaces 
for  the  epoxidation  of  ethylene  suggests  a  possibil¬ 
ity  of  making  a  new  catalyst  for  this  reaction.  The 
key  is  .an  appropriate  lifetime  of  the  end-on  super¬ 
oxide  species  on  the  metal.  Silver  realizes  it  with¬ 
out  any  special  modification.  However,  if  one  can 
realize  such  superoxide  species  on  some  metals 
using  some  modification,  we  think  that  the  essen¬ 
tial  factor  is  realized. 


H 

H 


:c=c; 


ethylene 


H 

H 


c=c: 


propylene 


H 

H 


causes  such  a  drastic  change  of  the  reaction.  It  is 
interesting  to  understand  why.  To  know  the  differ¬ 
ence  of  the  oxidation  mechanism  between  ethylene 
and  propylene  will  lead  to  a  better  understanding 
of  the  general  oxidation  mechanism  of  other  olefins 
on  silver  surface.  Further,  in  the  chemical  industry, 
propylene  oxide  is  an  important  material,  so  that 
to  know  why  silver  is  so  poor  for  the  conversion  of 
propylene  to  propylene  oxide  is  a  starting  point  for 
a  design  of  a  new  effective  catalyst.  This  section  is 
a  summary  of  our  recent  study  on  the  oxidation 
mechanism  of  propylene  [40]. 

Two  possible  reasons  may  be  considered  for  the 
difference  in  the  reaction  of  ethylene  and  propyl¬ 
ene  on  a  silver  surface.  One  possibility  is  that  the 
reaction  routes  are  the  same  but  the  methyl  sub¬ 
stituent  causes  a  large  change  in  the  barrier  height 
and/or  in  the  stability  of  the  intermediates  in  the 
course  of  the  reaction.  Another  possibility  is  that 
an  entirely  different  reaction  route  comes  out  for 
the  existence  of  the  methyl  group.  We  have  inves¬ 
tigated  these  two  possibilities.  As  in  the  case  of 
ethylene,  we  study  the  reactions  of  the  two  active 
oxygen  species  on  a  silver  surface:  the  molecularly 
adsorbed  superoxide  species  and  the  dissociatively 
adsorbed  atomic  oxygen  species. 

The  calculational  methods  are  the  same  as  those 
discussed  in  the  second  section.  The  DAM  [31-33] 
was  used  to  include  the  effect  of  the  bulk  metal 
such  as  the  electron  transfer  and  the  image  force. 
The  basis  sets  are  the  same  as  before,  and  the 
geometries  of  the  reactants,  intermediates,  transi¬ 
tion  states,  and  the  products  were  optimized  by 
the  Hartree-Fock  method  and  the  energies  were 
calculated  by  the  MP2  method  in  order  to  include 
the  effects  of  electron  correlations. 


Oxidation  Mechanism  of  Propylene  on 
Silver  Surface 

Though  silver  is  an  effective  catalyst  for  the 
epoxidation  of  ethylene  (85-87%  conversion)  [5-7], 
it  is  a  very  poor  catalyst  for  the  epoxidation  of 
propylene  (2-5%  conversion)  [12-16].  A  relatively 


ATTACK  ON  OLEFINIC  CARBON 

We  first  study  the  reaction  route,  which  is  es¬ 
sentially  the  same  as  that  of  ethylene  [38]:  The 
reaction  starts  from  the  interaction  between  the 
adsorbed  oxygen  and  the  doubly  bonded  carbon 
atom  of  propylene.  Figure  8  is  an  illustration  of  the 
adcluster  for  the  initial  interaction  of  propylene 
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(a)  (b) 


FIGURE  8.  Model  adclusters  for  the  olefinic  carbon 
attack  mechanism:  (a)  on  the  terminal  olefinic  carbon, 
(b)  on  the  central  olefinic  carbon. 


with  the  superoxide  species.  Since  two  carbons  in 
propylene  are  different,  two  different  attacks  were 
considered.  The  interaction  with  atomic  oxygen 
was  similarly  considered.  The  reaction  may  pro¬ 
cess  as  follows: 


CH3  H  (2) 

\  / 

O 

propylene  oxide 


C=C 


H 

+  O2  or  O 
H 


H-migration 

- 5 

Ag 


ICH3— CH2— CH=0  aldehyde 
|(CH3)2C=^0  ketone  ^  ^ 


In  (3),  aldehyde  is  a  product  of  the  terminal  C 
attack  and  ketone  is  a  product  of  the  central  C 
attack,  and  they  are  further  oxidized  to  CO2  and 
H2O. 

Figure  9  shows  the  optimized  geometries  and 
the  energy  diagrams  for  the  reaction  of  the  molec- 
ularly  adsorbed  oxygen  with  the  terminal  C  of 
propylene.  It  is  similar  to  Figure  2  for  the  reaction 
with  ethylene:  The  substitution  of  one  hydrogen 
with  methyl  group  does  not  cause  a  large  change 
for  the  reaction  route  and  the  energetics.  In  this 
reaction  mechanism,  the  route  going  to  propylene 
oxide  should  be  quite  smooth  without  any  high- 
energy  barrier  and  without  any  too  stable  interme¬ 
diate,  while  the  route  leading  to  the  aldehyde 
should  be  essentially  blocked  up  by  the  existence 
of  a  high-energy  barrier  in  the  hydrogen  migration 
step.  The  diagram  for  the  reaction  of  the  superox¬ 
ide  with  the  central  carbon  atom  of  propylene  [Fig. 
8(b)]  was  similar  to  Figure  9  [40].  Thus,  if  the 
reaction  proceeds  along  this  reaction  route,  pro¬ 
pylene  oxide  should  be  produced  in  high  selectiv¬ 
ity  due  to  the  catalytic  activity  of  the  superoxide 


FIGURE  9.  Energy  diagram  for  the  reaction  between  the  terminal  carbon  atom  of  propylene  and  the  molecularly 
adsorbed  superoxide  on  a  Ag  surface.  The  route  leading  to  propylene  oxide  and  aldehyde  are  shown  by  the  solid  and 
broken  lines,  respectively. 
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species  adsorbed  on  silver.  Also,  for  the  reaction 
with  atomically  adsorbed  oxygen,  a  similar  conclu¬ 
sion  as  that  of  ethylene  may  be  obtained  as  the 
reaction  route  and  the  energy  diagrams  of  propy¬ 
lene  are  similar  to  ethylene  [38]. 

The  results  shown  above  mean  that  if  the  reac¬ 
tion  proceeds  through  the  oxygen  attack  on  the 
olefinic  carbon  of  propylene,  silver  should  be  as 
good  a  catalyst  as  for  ethylene  for  the  epoxidation. 
However,  this  result  contradicts  with  the  experi¬ 
mental  observation  that  the  conversion  of  propyl¬ 
ene  into  propylene  oxide  is  only  2-5%  on  the 
silver  catalyst  [12-17].  We  therefore  examine  an¬ 
other  possibility  that  a  reaction  route  entirely  dif¬ 
ferent  from  those  of  ethylene  may  exist.  Since 
propylene  has  a  methyl  group,  but  ethylene  does 
not,  we  next  examine  the  reaction  route  involving 
the  reaction  of  this  methyl  group  with  the  reactive 
oxygen  on  a  silver  surface. 

ATTACK  ON  ALLYLIC  HYDROGEN 

Propylene  is  different  from  ethylene  by  an  exis¬ 
tence  of  a  methyl  group.  We  examine  here  the 
reaction  route  involving  the  reaction  of  this  methyl 
group  with  the  superoxide  and  the  atomic  oxygen 
on  a  silver  surface.  Figure  10  shows  the  initial 
attack  of  the  superoxide  [Fig.  10(a)]  and  atomic 
oxygen  [Fig.  10(b)]  on  the  methyl  hydrogen,  and 
the  reactions  process  as  follows: 

CH3^  /H 

C=c  +  Oj/Ag 

H  H 

^  CH2— CH— CH2/Ag  +  H— O— O/Ag 

(4) 

CH2— CH— CH2/Ag  +  O/Ag  +  HO/Ag 

(5) 

CH3  H 

;;:c=cc;  +  O/Ag 

H  H 

CHj— CH— CHj/Ag  +  HO/Ag  (6) 

These  reactions  produce  the  allyl  intermediate  ad¬ 
sorbed  on  the  silver  surface,  which  is  oxidized 
further  to  CO2  and  H2O.  Since  allyl  radical  or 
anion  is  a  rather  stable  intermediate,  this  reaction 
route  is  feasible. 

Figure  11  shows  the  optimized  reaction  route 
for  the  attack  of  superoxide  on  the  methyl  hydro¬ 
gen  of  propylene  as  given  by  (4)  and  (5).  The 
present  result  was  added  to  the  previous  one  for 


H,,,,  ^ 

H 


-Ob 


(a)  (b) 

FIGURE  10.  Model  adclusters  for  the  allylic  H  attack 
mechanism:  (a)  reaction  with  molecularly  adsorbed 
superoxide  species,  (b)  reaction  with  atomically 
adsorbed  oxygen. 


the  carbon  attack  given  in  Figure  9.  The  optimized 
geometries  of  the  intermediates  and  the  transition 
states  for  the  present  H  attack  are  given  in  the 
lower  side,  while  those  for  the  previous  C  attack 
are  given  on  the  upper  side.  The  energy  levels  of 
the  intermediates  and  the  transition  states  for  the 
present  reaction  route  are  lower  than  those  of  the 
previous  one.  The  barrier  for  the  attack  on  the 
methyl  hydrogen  was  calculated  to  be  10.8 
kcal/mol,  which  is  lower  than  the  barrier  for  the 
carbon  attack,  13.7  kcal/mol,  and  then  the  system 
becomes  allyl  intermediate  and  the  hydroperoxy 
group  adsorbed  on  silver  [Eq.  (4)].  This  intermedi¬ 
ate  is  also  more  stable  than  the  previous  one,  but 
the  hydroperoxy  group  is  unstable  and  further 
converted  into  OH  and  O  as  given  by  Eq.  (5):  The 
barrier  in  this  step  is  only  2.8  kcal/mol  and  the 
stabilization  energy  is  as  large  as  42.6  kcal/mol. 
When  two-site  interaction  between  allyl  and  silver 
surface  is  further  allowed,  the  system  is  further 
stabilized  by  18.1  kcal/mol.  Thus,  the  reaction 
route  starting  from  the  allylic  H  attack  is  always 
lower  than  that  starting  from  the  olefinic  C  attack, 
so  that  the  reactions  represented  by  Eqs.  (4)  and  (5) 
should  proceed  more  easily  than  the  reactions 
given  by  Eqs.  (2)  and  (3). 

We  next  examine  the  similar  reaction  given  by 
Eq.  (6)  involving  the  atomic  oxygen  adsorbed  on 
silver.  Figure  12  shows  the  optimized  reaction 
route  and  its  energetics.  The  lower  figures  give  the 
optimized  geometries  for  the  allylic  H  attack  and 
the  upper  ones  for  the  olefinic  C  attack.  The  barrier 
for  the  allylic  H  attack  is  essentially  zero:  only  0.1 
kcal/mol.  The  allyl  intermediate  is  more  stable 
than  the  intermediate  of  the  previous  C  attack: 
when  two-site  interaction  between  allyl  and  silver 
surface  is  allowed,  it  is  further  stabilized  by  18.1 
kcal/mol.  Thus,  again,  for  the  atomic  oxygen  on 
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FIGURE  1 1 .  Comparison  of  the  energy  diagrams  for  the  reactions  between  propylene  and  the  molecularly  adsorbed 
oxygen.  The  route  for  the  allylic  H  attack  Is  shown  by  the  solid  line,  and  those  for  the  carbon  attack  are  shown  by  the 
broken  and  waved  lines. 
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FIGURE  12.  Comparison  of  the  energy  diagrams  for  the  two  reactions  between  propylene  and  the  atomically 
adsorbed  oxygen  on  a  Ag  surface. 
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the  silver  surface,  the  allylic  H  attack  is  an  easier 
reaction  than  the  olefinic  C  attack.  The  allyl  inter¬ 
mediate  thus  produced  is  stable  on  the  silver  sur¬ 
face  so  that  it  will  be  further  attacked  by  the 
oxygen  species  on  the  surface  and  would  finally  be 
converted  into  CO2  and  H2O. 

Thus,  for  both  superoxide  and  atomic  oxygen 
on  a  silver  surface,  the  reaction  starting  from  the 
allylic  H  attack  proceeds  more  easily  than  the  one 
starting  from  the  olefinic  C  attack.  Therefore,  the 
conversion  to  propylene  oxide  does  not  effectively 
occur  on  a  silver  surface. 

The  origin  of  the  activation  of  this  allyl  hydro¬ 
gen  is  due  to  the  stability  of  the  preceding  ad¬ 
sorbed  allyl  intermediate.  We  have  investigated 
the  nature  of  the  allyl  intermediate  adsorbed  on  a 
silver  surface.  Figure  13  shows  the  optimized  ge¬ 
ometry  of  the  allyl  intermediate  on  Ag2  calculated 
with  n  0  and  n  =  1.  We  see  that  the  adsorption 
energy  is  larger  with  n  =  1  than  with  n  =  0,  indi¬ 
cating  that  the  allyl  species  on  silver  is  actually  an 
allyl  anion  species.  Actually,  the  charge  on  the 
allyl  group  was  -0.52  for  =  1,  while  it  was 
—  0.27  in  the  case  of  n  =  0.  The  existence  of  the 
anion  allyl  species  on  silver  surface  had  been  sug¬ 
gested  experimentally  by  Madix  et  al.  [23]  to  be 
the  intermediate  leading  to  the  complete  oxidation. 

BRIEF  SUMMARY 

Figure  14  is  a  summary  of  the  reaction  mecha¬ 
nism  of  propylene  on  a  silver  surface.  The  allylic 
hydrogen  is  more  reactive  than  the  olefinic  carbon. 


due  to  the  stability  of  the  allyl  anion  species  on  the 
silver  surface,  so  that  propylene  is  converted  to 
allyl  intermediate  and  further  converted  to  CO2 
and  H2O.  If  an  olefin  does  not  have  an  allylic 
hydrogen,  like  ethylene,  styrene,  etc.,  it  is  con¬ 
verted  to  epoxide  in  high  selectivity.  Even  if  it  has 
an  allylic  hydrogen  but  the  allyl  intermediate  is 
not  stable  like  in  norbornene,  the  route  leading  to 
epoxide  is  more  preferable.  This  explains  the  selec¬ 
tivity  of  the  silver  surface  for  the  epoxidation  reac¬ 
tions  of  olefins. 

We  note  that  the  present  results  clearly  show 
that  the  route  leading  to  propylene  oxide  also 
exists  on  a  silver  surface.  However,  it  is  not  the 
most  favorable  reaction  path.  The  selectivity  is  just 
between  2  and  5%  experimentally.  Since  propylene 
oxide  is  an  important  material  in  chemical  indus¬ 
try,  it  is  interesting  whether  we  can  block  up  the 
route  leading  to  the  allylic  intermediate. 


Concluding  Remarks 

We  reported,  in  this  study,  the  mechanisms  of 
the  epoxidation  and  complete  oxidation  reaction  of 
propylene  on  a  silver  surface  and  the  stabilities 
and  activities  of  the  oxygen  species  adsorbed  on 
Cu,  Ag,  and  Au  surfaces.  We  performed  the  (U)HF, 
MP2,  and  SAC/SAC-CI  calculations  with  the  use 
of  the  DAM  which  involved  the  interaction  be¬ 
tween  bulk  metal  and  admolecules  with  considera¬ 
tion  of  electron  transfer  and  image  force  correction. 


FIGURE  13.  Stability  of  the  allyl  species  adsorbed  on  Ag2  cluster  in  the  neutral  {n  ~  0)  and  anion  (n  =  1)  adcluster 
states. 
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FIGURE  14.  Mechanism  of  the  complete  oxidation  of  propylene  over  a  silver  surface;  (a)  with  molecularly  adsorbed 
oxygen,  (b)  with  atomically  adsorbed  oxygen. 


For  the  epoxidation  of  ethylene,  the  active 
species  is  the  superoxide,  which  is  molecularly 
adsorbed  on  the  surface  in  the  bent  end-on  geome¬ 
try.  Ethylene  attacks  the  terminal  oxygen  atom  of 
the  superoxide  leading  to  ethylene  oxide  quite 
smoothly.  The  importance  of  the  silver  surface  is 
quite  evident.  On  the  other  hand,  the  atomically 
adsorbed  oxygen  species  is  not  selective:  Both 
epoxidation  and  complete  oxidation  products 
would  be  obtained.  However,  to  promote  the  se¬ 
lectivity  of  the  atomically  adsorbed  species  will 
certainly  promote  the  total  selectivity  to  epoxida¬ 
tion  product.  In  the  process  leading  to  ethylene 
oxide,  the  electron  back-transfer  from  the  reaction 
adsorbate  complex  to  the  metal  surface  should  be 
important. 

The  origin  of  the  catalytic  activity  of  the  silver 
surface  is  due  to  its  ability  to  stabilize  the  superox¬ 
ide  species.  On  Cu  surfaces,  peroxide  is  much 
more  stable  than  superoxide,  so  that  the  lifetime  of 
the  superoxide  species  should  be  too  short  to  react 
with  ethylene.  On  Au  surfaces,  the  superoxide 
species  does  not  exist  because  the  molecular  ad¬ 
sorption  of  O2  does  not  occur  due  to  the  difficulty 
of  the  electron  transfer  from  the  clean  bulk  metal. 

For  the  oxidation  of  propylene  on  silver  surface, 
two  competitive  mechanisms  exist.  The  dominant 
mechanism  for  the  epoxidation  of  olefins  is  initi¬ 
ated  by  the  activation  of  the  olefinic  carbon.  The 
dominant  mechanism  for  the  complete  oxidation 
of  propylene  is  due  to  the  activation  of  the  allylic 
C — H  bond.  The  latter  is  more  favorable  due  to 
the  formation  of  the  stable  adsorbed  allyl  interme¬ 


diate,  which  is  the  motivation  force  for  the  com¬ 
plete  oxidation  of  propylene. 

We  note  that  the  proposed  two  competitive 
mechanisms  not  only  reflect  the  difference  be¬ 
tween  ethylene  and  propylene  but  also  explain 
rather  well  the  available  experimental  facts  for  the 
epoxidation  and  complete  oxidation  of  olefins. 
Therefore,  it  reflects  a  general  mechanism  to  un¬ 
derstand  the  oxidation  of  olefins  over  silver  sur¬ 
face. 
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ABSTRACT:  We  have  developed  procedures  for  determining  the  potential,  electron 
density,  and  current  at  a  planar  metal  surface  with  an  applied  static  field  in  vacuum. 
These  calculations  are  made  using  density-functional  theory  within  the  local  density 
approximation  (LDA)  for  the  Kohn-Sham  exchange  and  correlation.  Several  different 
techniques  were  compared  including  the  use  of  different  expressions  for  the  exchange 
and  correlation  energies.  The  steady-state  response  of  field  emission  to  a  laser  is  found 
using  Floquet  methods  to  solve  the  dipole  approximation  of  the  time-dependent 
Schrodinger  equation  with  the  static  potential  obtained  from  density-functional  theory. 
These  simulations  show  that  there  is  a  resonance  in  the  response  that  causes  a  significant 
increase  in  the  tunneling  current.  This  resonance  occurs  for  electrons  that  are  promoted 
above  the  barrier  by  absorbing  quanta  from  the  laser  when  the  line  integral  of  the 
momentum  between  the  turning  points  is  equal  to  h /2.  ©  1997  John  Wiley  &  Sons,  Inc.  Int 

J  Quant  Chem  65:  857-865,  1997 


Introduction 

Photofield  emission  [1,  2],  the  study  of  the 
energy  distribution  of  electrons  emitted  from 
a  metal  surface  subjected  to  an  intense  static  elec¬ 
tric  field  plus  a  weaker  radiation  field,  is  used  to 
characterize  the  surface  density  of  states  in  metals. 

Lee  and  Robbins  [3]  measured  the  time  depen¬ 
dence  of  the  total  current  from  a  field  emitter 
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when  the  laser  beam  was  chopped  with  an  elec¬ 
tromechanical  shutter.  They  found  that  under  con¬ 
ditions  that  are  typical  of  the  photofield  studies, 
the  rise  time  of  the  current  is  approximately  1  ms. 
They  showed  by  analysis  that  the  thermal  relax¬ 
ation  time  is  within  one  order  of  magnitude  of  the 
rise  time,  which  they  measured.  Thus,  Lee  and 
Robbins  concluded  that  the  laser-induced  increase 
in  the  current  is  primarily  the  result  of  tip  heating, 
and  this  raises  doubt  regarding  much  of  the  pub¬ 
lished  photofield  data.  However,  few  references 
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have  been  made  to  this  study  by  Lee  and  Robbins, 
and  thermal  effects  are  not  mentioned  in  many  of 
the  subsequent  photofield  studies. 

It  is  the  purpose  of  this  study  to  use  numerical 
simulations  based  on  the  local  density  approxima¬ 
tion  of  Kohn-Sham  density-functional  theory  to 
examine  the  nonthermal  effects  of  laser-assisted 
field  emission  in  order  to  clarify  the  mechanism 
for  photofield  emission.  A  number  of  different 
methods  have  been  used  to  model  laser-assisted 
field  emission,  but  most  of  these  are  one-particle 
solutions  [4“6]  in  which  the  time-dependent 
Schrodinger  equation  is  solved  for  a  specified  ap¬ 
proximation  of  the  potential,  and  all  of  the  effects 
of  electron-electron  interactions  are  neglected.  A 
many-particle  approach  [7,  8]  that  is  based  on 
nonequilibrium  Green's  functions  [9]  shows  con¬ 
siderable  promise,  but  we  chose  to  use  density- 
functional  theory  [10,  11]  in  the  present  study 
because  we  consider  this  procedure  to  be  better 
understood  due  to  a  longer  and  more  extensive 
history  of  application. 


Analysis 

The  density-functional  formalism,  which  was 
introduced  by  Hohenberg,  Kohn,  and  Sham  [12, 
13],  is  a  many-particle  approach  in  which  the  prop¬ 
erties  of  a  system  (such  as  a  metal  surface)  are 
determined  as  functions  of  only  the  electron  den¬ 
sity  n{x).  We  use  a  jellium  model  in  which  the 
positive  ions  within  the  metal  half-space  (x  <  0) 
are  replaced  by  a  uniform  positive  charge  back¬ 
ground  with  density  n  +  (x).  The  many-particle 
problem  of  determining  the  potential  and  electron 
density  at  a  metal  surface  may  be  solved  through 
density-functional  theory  by  finding  a  consistent 
solution  for  the  following  set  of  four  equations: 
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Here,  the  one-particle  Schrodinger  equation  (1) 
is  solved  to  determine  the  wave  functions  for  a 
series  of  energies  representing  the  distribution 
of  normal  energies  in  the  metal.  The  electron  den¬ 
sity  n(x)  is  determined  by  normalizing  the  sum¬ 
mation  of  the  probability  densities  in  Eq.  (2).  The 
Coulomb  portion  of  the  effective  potential  re¬ 
quired  in  Eq.  (1)  is  determined  by  integration  of 
Poisson's  equation  as  shown  in  Eq.  (3X  in  which  it 
is  assumed  that  the  Coulomb  potential  and  electric 
field  are  zero  deep  within  the  metal.  A  number  of 
different  methods  have  been  used  to  approximate 
the  exchange-correlation  contribution  to  the  effec¬ 
tive  potential  [14-16].  In  most  calculations  we  have 
used  the  Wigner  interpolation  form  as  shown  in 
Eq.  (4),  which  was  derived  using  the  local  density 
approximation  (LDA),  but  we  have  also  tested  an 
approximation  by  Gell-Mann  and  Brueckner  [16] 
which  is  more  appropriate  for  high  electron  densi¬ 
ties.  More  exact  calculations  have  shown  [17]  that, 
for  an  electron  at  a  jellium  edge,  the  Fermi  hole 
calculated  within  the  LDA  approximates  the  exact 
hole.  Thus,  the  corresponding  LDA  force  field  at 
the  surface  is  fairly  accurate;  thus  it  is  reasonable 
to  use  this  approximation  in  simulating  field  emis¬ 
sion.  Conventional  mks  units  are  usedl  in  place  of 
atomic  units  throughout  this  work,  so  that  Eq.  (4) 
contains  the  symbol  Aq  denoting  the  Bohr  radius, 
which  is  approximately  0.052917  nm. 


Solutions  for  Static  Problems 

In  static  problems  we  solve  the  system  of  Eqs. 
(l)-(4)  by  means  of  iteration.  We  begin  by  approx¬ 
imating  the  effective  potential  that  is  required  in 
Eq.  (1)  by  using  either  the  Fowler-Nordheim 
model  [18]  or  a  closed-form  approximation  of  a 
variational  solution  [19].  The  numerically  efficient 
method  of  back-propagation  [20]  is  used  to  solve 
the  Schrodinger  equation  without  forming  a  ma¬ 
trix.  The  solutions  are  obtained  for  a  large  number 
of  energies  Ej  (for  J  =  1  to  N)  to  simulate  the 
distribution  of  normal  energies  which  was  derived 
by  Modinos  [21]. 

We  have  used  two  different  methods  to  intro¬ 
duce  an  applied  static  field  into  the  formulation  of 
Eqs.  (l)-(4)  in  order  to  simulate  field  emission 
from  a  metal  surface,  and  we  have  found  that 
these  two  methods  give  identical  results.  In  the 
first  approach  the  normalization  of  the  electron 
density  in  Eq.  (2)  is  adjusted  so  that  there  is  a  net 
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charge  within  the  descretized  region,  and  the  mag¬ 
nitude  of  this  charge  will  produce  the  specified 
value  of  the  applied  static  field.  In  this  method  the 
applied  static  field  does  not  appear  explicitly  in 
the  calculations.  In  the  second  approach  the  nor¬ 
malization  of  the  electron  density  in  Eq.  (2)  is 
adjusted  so  that  the  descretized  region  is  neutral, 
and  the  applied  static  field  is  specified  throughout 
the  entire  space  including  the  metal.  All  of  the 
examples  in  this  article  are  for  tungsten  metal.  We 
have  assumed  that  tungsten  has  a  density  of  19.30 
g/cm^  and  that  there  are  6  free  electrons  per  atom. 
Thus,  we  assume  a  Fermi  energy  of  19.1  eV,  a  bulk 
electronic  density  of  3.79  X  10^^/mtr^  (0.0562  in 
atomic  units),  and  a  Wigner-Seitz  radius  of  0.0857 
nim  (1.62  bohrs). 

Figures  1  and  2  show  the  potential  barrier  and 
the  electron  density  at  a  tungsten  surface  with 
applied  static  fields  of  5  and  2  V /nm,  respectively. 
In  both  of  these  figures  the  dashed  curves  repre¬ 
sent  the  pototial  barrier  for  the  Fowler-Nordheim 
model  [18],  which  requires  specifying  the  work 
function  (4.5  eV)  and  includes  a  correction  from 
image  theory.  The  results  of  our  iterative  calcula¬ 
tions  are  shown  as  solid  curves  for  the  potential 
barrier,  and  dotted  curves  for  the  electron  density. 
The  iterative  solutions  for  these  two  figures  re¬ 


quired  under  relaxation  for  convergence,  so  that  at 
each  step  in  the  iterations  the  potential  was  up¬ 
dated  by  setting  =  aVj(x)  +  (1  - 

a)Vj_i(x).  A  relaxation  factor  (a)  of  0.001  was 
used  in  the  iterations  with  a  total  of  200  energies. 
A  total  of  1000  intervals  was  used  on  the  x  axis  to 
descretize  a  length  of  2  nm  of  metal  and  6.5  nm  of 
vacuum  with  an  increment  of  A  x  =  0.0085  nm. 

The  iterative  static  density-functional  solutions 
for  the  potential  that  are  shown  in  Figures  1  and  2 
converge  with  consistent  results  as  the  depth  of 
the  metal  that  is  discretized  is  varied  from  2  to  8 
nm,  the  number  of  electron  energies  is  changed 
from  50  to  1000,  and  the  number  of  intervals  is 
varied  from  200  to  1500.  The  upper  values  given 
for  these  parameters  are  practical  limits  for  calcu¬ 
lations  with  a  personal  computer.  The  length  of 
vacuum  that  is  descretized  must  be  increased  for 
very  low  values  of  the  applied  static  field  so  that 
the  potential  at  the  far  end  of  the  grid  is  well 
below  the  Fermi  level.  There  was  no  significant 
change  when  the  calculations  for  Figures  1  and  2 
were  repeated  using  the  approximation  by  Gell- 
Mann  and  Brueckner  [16]  in  place  of  the  Wigner 
interpolation  form  for  the  exchange-correlation 
contribution  to  the  effective  potential  as  shown  in 
Eq.  (4).  Figures  1  and  2  show  that  both  the  height 
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FIGURE  1 .  Potential  barrier  and  electron  density  at  a  tungsten  surface  with  a  static  field  of  5  V/nm.  The  potential  and 
electron  density  are  represented  by  a  solid  curve  and  a  dotted  curve,  respectively.  The  dashed  curve  is  the  potential  for 
the  Fowler-Nordheim  model. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


859 


HAGMANN 


CO 

I 

* 

CL 


(D 

Z 

LU 

Q 

Z 

o 

CL 


O 

LU 

-i 

LU 


(3 

O 


FIGURE  2.  Potential  barrier  and  electron  density  at  a  tungsten  surface  with  a  static  field  of  2  V/nm.  The  potential  and 
electron  density  are  represented  by  a  solid  curve  and  a  dotted  curve,  respectively.  The  dashed  curve  is  the  potential  for 
the  Fowler-Nordheim  model. 


of  the  barrier  and  the  length  of  the  tunneling 
region  are  decreased,  thus  increasing  the  current, 
as  the  applied  static  field  is  increased. 

Figure  3  shows  the  maximum  height  of  the 
potential  barrier  at  a  tungsten  surface  as  a  function 
of  the  applied  static  field.  Values  obtained  using 


the  Fowler-Nordheim  model  are  shown  with  a 
solid  curve,  and  the  results  of  our  iterative  calcula¬ 
tions  are  shown  as  solid  squares  connected  with  a 
dashed  curve.  As  in  the  first  two  figures,  the 
iterative  calculations  were  made  using  a  relaxation 
factor  of  0.001,  a  total  of  200  energies,  and  a  depth 


FIGURE  3.  Barrier  height  at  a  tungsten  surface  as  a  function  of  the  applied  static  field.  Iterative  solutions  are  shown  as 
solid  squares  connected  by  a  dashed  curve,  and  the  solid  curve  is  for  the  Fowler-Nordheim  model. 
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of  2  nm  of  metal.  The  results  in  Figure  3  appear  to 
show  that  there  is  fairly  good  agreement  between 
density-functional  theory  and  the  Fowler-Nord- 
heim  model.  However,  it  may  be  seen  in  Figures  1 
and  2  that  while  the  maximum  height  of  the  bar¬ 
rier  may  be  in  good  agreement  for  the  two  models, 
the  peak  is  moved  closer  to  the  surface  and  the 
length  of  the  tunneling  region  is  increased  with  the 
Fowler-Nordheim  model.  Thus,  we  conclude  that 
the  value  of  the  tunneling  current  would  be  under¬ 
estimated  with  the  Fowler-Nordheim  model. 


Steady-State  Response  of  Field 
Emission  to  a  Laser 

We  begin  by  replacing  Eq.  (1)  with  the  dipole 
approximation  [22]  for  the  one-dimensional  time- 
dependent  Schrodinger  equation  for  a  static  poten¬ 
tial  plus  a  radiation  field: 

d'^%  d% 

— - ^  +  [1/0  +  exEnCOs((ot)]%  =  ih - , 

2m  dx^  ^  ^  ^  dt 

(5) 

where  the  radiation  is  polarized  in  the  axial  direc¬ 
tion,  and  the  static  potential  Vq  and  the  magnitude 
of  the  electric  vector  of  the  optical  field  Eq  may  be 
functions  of  x.  Floquet's  theorem  [23]  may  be  used 
to  write  the  solution  of  Eq.  (5)  in  the  following 
form: 

00 

E  </';,Nexp[ -t(Ej  +  (6) 

—  00 

where  the  are  functions  of  x  that  must  be 
determined.  We  have  shown  previously  [6]  that 
consistency  of  Eqs.  (5)  and  (6)  for  arbitrary  values 
of  time  requires  that  the  following  ordinary  differ¬ 
ential  equation  is  satisfied: 

N-1  N+l]^^0  “ 

We  have  used  several  different  methods  [24]  to 
solve  Eq.  (5).  In  the  first  procedure,  which  is  the 
one  used  for  the  examples  in  this  study,  shooting 
methods  [25]  are  used  to  solve  the  system  of  si¬ 
multaneous  ordinary  differential  equations  result¬ 
ing  from  Eq,  (7).  We  require  that  each  wave  func¬ 
tion  is  consistent  with  a  single  plane  wave 


having  the  appropriate  energy  incident  upon  the 
barrier,  but  a  full  spectrum  of  energies  is  required 
for  the  reflected  and  transmitted  waves.  Recursion 
is  used  with  finite  difference  approximations  to 
march  through  the  barrier,  thus  determining  the 
wave  function  at  the  far  side  of  the  barrier  as 
functions  of  the  known  incident  wave  and  the 
unknown  reflected  waves  at  the  near  side.  Consis¬ 
tency  of  the  transmitted  wave  function  with  only 
outgoing  transmitted  waves  is  enforced  which  re¬ 
sults  in  a  numerical  solution  for  the  wave  func¬ 
tions  at  all  points  in  space.  This  is  an  extension  of 
the  numerically  efficient  method  of  back-propa¬ 
gation  [20],  which  we  use  to  solve  the  Schrodinger 
equation  for  static  problems. 

It  is  possible  to  replace  Eq.  (2)  with  a  double 
summation  over  the  indices  I  and  N  to  determine 
the  electron  density  as  a  function  of  position  and 
time,  and  to  replace  Eq.  (3)  with  an  expression  for 
the  time-dependent  scalar  potential.  However,  it  is 
not  clear  as  to  what  should  be  used  in  place  of  Eq. 
(4)  to  determine  the  time-dependent  exchange-cor¬ 
relation  contribution  to  the  effective  potential.  Sev¬ 
eral  different  formulations  have  been  presented  to 
extend  the  static  density-functional  theory  to  a 
time-dependent  formulation  [26-30],  but  we  have 
not  yet  implemented  any  of  them.  In  particular, 
we  are  concerned  that  the  random  phase  approxi¬ 
mation  (RPA)  may  not  be  valid  at  optical  frequen¬ 
cies.  Others  have  used  an  adiabatic  approximation 
[31,  32]  in  which  they  assume  that  the  dynamic 
solution  is  equivalent  to  a  sequence  of  static  solu¬ 
tions  that  are  obtained  for  various  times  during  a 
cycle  of  the  optical  signal.  The  adiabatic  approxi¬ 
mation  has  been  used  to  determine  optical  second 
harmonic  generation  with  metals.  However,  we 
feel  that  it  is  difficult  to  justify  this  approach, 
except  in  the  far  infrared  spectrum,  because  it 
neglects  all  effects  of  the  exchange  of  photons  with 
the  optical  source,  a  mechanism  which  is  evident 
in  Eqs.  (6)  and  (7). 

As  a  first  approach  to  the  time-dependent  prob¬ 
lem,  we  have  simulated  the  steady-state  response 
of  field  emission  to  a  laser  by  assuming  that  the 
time-averaged  response  by  the  electrons  is  the 
same  as  for  the  static  problem,  and  the  effects  of 
the  time-dependent  response  are  included  in  the 
motion  of  the  plasma  which  causes  screening  of 
the  radiation  field.  Dominance  of  the  static  re¬ 
sponse  may  be  seen  in  that  even  with  a  laser 
power  flux  density  of  10^®  W/m^,  with  a  typical 
applied  static  field  of  5  V /nm,  the  optical  electric 
field  is  0.055  percent  of  the  static  field.  Thus,  we 
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use  density^functional  theory  to  determine  the  ef¬ 
fective  potential  considering  only  the  static  field, 
and  then  solve  Eq.  (5)  allowing  for  the  exponential 
decay  of  the  optical  field  caused  by  screening 
within  the  metal.  Others  have  used  hydrodynamic 
models  to  simulate  the  interaction  of  an  intense 
laser  pulse  with  a  metal,  and  they  have  used  the 
Drude  approximation  to  determine  the  frequency 
dependence  of  the  complex  conductivity  in  order 
to  estimate  the  penetration  of  optical  fields  into  the 
metal  [33].  We  also  use  the  Drude  model  [34],  and 
thus  we  assume  that  the  depth  into  the  metal  at 
which  the  optical  fields  decay  by  1  /e  is  given  by 


1 


where  the  plasma  frequency,  corresponds  to  a 
wavelength  of  54.3  nm  for  tungsten.  For  example, 
the  depth  of  penetration  in  tungsten  is  8.65,  8.66, 
and  10.3  nm  at  wavelengths  of  10  /xm,  1/xm,  and 
100  nm,  respectively.  We  have  found  that  the  re¬ 
sults  of  our  calculations  are  independent  of  the 
depth  of  metal  that  is  descretized  when  this  depth 
is  80  nm  or  greater,  so  a  depth  of  100  nm  was  used 
in  each  of  the  following  examples  which  simulate 
the  steady-state  response  of  field  emission  to  a 
laser. 

Figure  4  shows  the  ratio  of  the  tunneling  cur¬ 
rent  with  a  laser  to  that  with  no  laser  as  a  function 


of  the  optical  wavelength  for  electrons  with  nor¬ 
mal  energies  of  0.95,  0.96,  0.97,  0.98,  0.99,  and  1.00 
(from  top  to  base  of  the  figure)  of  the  Fermi  level. 
The  calculations  were  made  for  a  tungsten  surface 
with  an  applied  static  field  of  5  V /nm  and  a  laser 
power  flux  density  of  10^®  W/m^. 

A  resonance  in  which  the  current  is  significantly 
increased  by  the  laser  may  be  seen  in  each  curve  of 
Figure  4.  The  mechanism  for  this  resonance  is 
reinforcement  of  the  wave  function  by  reflections 
at  the  turning  points.  Thus,  the  resonance  occurs 
for  electrons  that  are  promoted  above  the  barrier 
by  absorbing  quanta  from  the  optical  field  when 
the  line  integral  of  the  momentum  between  the 
turning  points  is  equal  to  h /2,  where  h,  is  Planck's 
constant.  It  may  be  seen  in  Figure  3  that  with  an 
applied  static  field  of  5  V /nm  a  single  photon  at 
730  nm  has  sufficient  energy  to  promote  an  elec¬ 
tron  at  the  Fermi  level  to  an  energy  above  the 
potential  barrier.  For  the  special  case  of  a  square 
barrier  the  resonance  requires  that  the  barrier 
length  is  an  integral  multiple  of  one-half  the  De 
Broglie  wavelength  for  electrons  that  are  promoted 
above  the  barrier  by  absorbing  single  photons  [35]. 
This  relation  of  the  barrier  length  to  the  De  Broglie 
wavelength  is  also  the  criterion  for  the  transmis¬ 
sion  coefficient  to  be  maximum  for  the  static  prob¬ 
lem  in  which  the  energy  of  a  particle  is  above  a 
square  barrier  [36].  For  square  barriers  we  find 


OPTICAL  WAVELENGTH,  MICRONS 

FIGURE  4.  Current  ratio  with  /  without  a  laser  vs.  optical  wavelength  for  electrons  with  normal  energies  of  0.95,  0.96, 
0.97,  0.98,  0.99,  and  1.00  (top  to  base)  in  tungsten  with  a  static  field  of  5  V/nm  and  an  optical  flux  of  10^°  W/m®. 
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that  the  resonance  has  multiple  sharp  peaks  [35] 
instead  of  the  single  broad  peak  that  is  seen  in 
each  curve  of  Figure  5.  These  differences  are  con¬ 
sistent  with  results  for  the  static  problem  [36]  and 
are  attributed  to  the  decreased  definition  of  the 
turning  points  when  there  are  no  discontinuities  in 
the  potential  [37]. 

Figure  5  shows  the  ratio  of  the  tunneling  cur¬ 
rent  with  a  laser  to  that  with  no  laser  as  a  function 
of  the  optical  wavelength  allowing  for  the  full 
distribution  of  energies  in  the  metal  at  tempera¬ 
tures  of  -  273,  200,  1000,  1500,  and  2000°C,  for  the 
curves  from  the  top  to  the  base  of  the  figure, 
respectively.  The  calculations  were  made  for  a 
tungsten  surface  with  an  applied  static  field  of  5 
V/nm  and  a  laser  power  flux  density  of  10^° 
W/m^.  The  resonance  is  more  prominent  at  lower 
temperatures,  and  this  may  be  understood  because 
of  the  more  narrow  distribution  of  energies.  We 
have  repeated  the  calculation  for  200°C  using  the 
Fowler-Nordheim  model  for  the  potential  barrier. 
The  results  are  similar,  but  the  static  current  den¬ 
sity  is  reduced  by  22%,  and  the  resonance  has  a 
current  enhancement  of  0.227%  at  a  wavelength  of 
337  nm  instead  of  0.242%  at  416  run.  Each  of  these 
changes  may  be  attributed  to  the  somewhat  greater 
length  and  height  of  the  barrier  with  the  Fowler- 
Nordheim  model. 

Figure  6  shows  the  ratio  of  the  tunneling  cur¬ 
rent  with  a  laser  to  that  with  no  laser  as  a  function 


of  the  optical  wavelength  for  a  tungsten  surface  at 
200°C  with  a  laser  power  flux  density  of  10^® 
W/m^.  The  value  of  the  applied  static  field  is  4.1, 
4.3,  4.5,  4.7,  5.0,  and  5.5  V /nm,  for  the  curves  from 
the  top  to  the  base  of  the  figure,  respectively.  This 
figure  shows  that  as  the  applied  static  field  is 
reduced,  the  resonance  shifts  to  a  shorter  wave¬ 
length,  but  there  is  also  an  increased  response  at 
longer  wavelengths.  We  find  that  in  the  limit  as 
the  applied  static  field  is  reduced  to  values  much 
less  than  4.0  V/nm  the  behavior  approaches  that 
of  the  photoelectric  effect,  that  is,  a  sharp  all-to- 
nothing  transition  in  the  transmission  occurs  at  a 
single  wavelength.  This  may  be  understood  be¬ 
cause,  for  weak  applied  static  fields,  the  height 
and  length  of  the  barrier  are  so  large  that  the 
probability  of  tunneling  is  negligible  so  that  trans¬ 
mission  requires  promotion  above  the  barrier  by 
the  sustained  absorption  of  a  photon. 


Discussion 

Our  results  agree  with  experimental  data  from 
photofield  studies  in  that  the  increase  in  the  emit¬ 
ted  current  that  is  caused  by  a  laser  is  proportional 
to  the  laser  power  flux  density,  maximum  for 
polarization  normal  to  the  emitting  metal  surface, 
and  from  the  electrons  that  absorb  a  single  photon. 
However,  it  is  generally  assumed  that  the  spectral 


FIGURE  5.  Current  ratio  with /without  a  laser  vs.  optical  wavelength  for  tungsten  at  -273,  200,  1000,  1500,  and 
2000°C  (top  to  base)  with  a  static  field  of  5  V/nm  and  an  optical  flux  of  10“'°  W/m^. 
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FIGURE  6.  Current  ratio  with  /  without  a  laser  vs.  optical  wavelength  for  tungsten  at  200°C  with  static  fields  of  4.1 , 4.3, 
4.5,  4.7,  5.0,  and  5.5  V/nm  (top  to  base)  and  an  optical  flux  of  10^°  W/m^. 


response  of  photofield  emission  is  rather  simple; 
strongest  for  illumination  near  the  blue  end  of  the 
visible  spectrum  and  virtually  undetectable  at 
longer  wavelengths.  By  contrast,  our  simulations 
suggest  that  the  spectral  response  is  more  com¬ 
plex,  and  that  it  is  quite  sensitive  to  the  value  of 
the  applied  static  field.  We  believe  that  this  differ¬ 
ence  may  be  caused  by  (1)  the  focus  of  photofield 
studies  being  on  electron  energy  distributions 
rather  than  the  spectral  response,  and  (2)  the  pres¬ 
ence  of  thermal  effects  in  the  data.  Our  results 
suggest  that  the  nonthermal  effects  which  we  have 
simulated  occur  on  a  time  scale  on  the  order  of  10 
fs  so  that  these  effects  could  be  separated  from 
thermal  effects  by  using  lasers  that  are  amplitude 
modulated  at  radio  frequencies  in  photofield  stud¬ 
ies. 
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ABSTRACT:  Structural  parameters  and  energy  have  been  calculated  for  Ci_3  and  C9 
clusters  using  density  functional  theory  through  the  deMon  program.  The  Ci_3  clusters 
were  fully  optimized  using  deMon;  different  basis  sets  were  tested  in  order  to  choose  the 
suitable  one  to  be  used  in  the  C9  clusters.  In  the  case  of  C2  the  results  were  compared 
with  experimental  values.  DZVP2  basis  was  selected  because  it  always  gave  the  closest 
value  to  the  experimental  data.  The  C9  carbon  clusters  were  designed  in  relation  to 
C(IOO)  diamond  surface.  The  energies  of  the  nonrelaxed  and  relaxed  surfaces  with  and 
without  hydrogen  were  calculated.  In  the  same  way  CH3-relaxed  and  CH2-relaxed 
species  were  calculated.  The  diamond  growth  mechanism  proposed  in  the  literature  was 
evaluated  by  an  energy  analysis.  The  C — CH3  distance  is  reported.  ©  1997  John  Wiley  & 
Sons,  Inc.  Int  J  Quant  Chem  65:  867-875,  1997 


Introduction 

Diamond  surface  has  attracted  a  lot  of  atten¬ 
tion  because  of  the  technological  applica¬ 
tions  in  thin  films  grown  by  chemical  vapor  depo¬ 
sition  (CVD)  techniques  [1].  Many  experimental 
and  theoretical  studies  have  been  made  over  the 
last  decades.  Nowadays  several  ideas  have  been 
suggested  about  the  characteristics  of  the  diamond 
surface,  the  mechanism  of  diamond  growth  and 
the  precursor  agent  of  this  chemisorption  process. 
Correspondence  to:  E.  Agacino  Valdes. 
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Although  the  first  theoretical  studies  of  dia¬ 
mond  surfaces  focussed  in  (111)  surface,  at  the 
present,  it  is  known  that  the  (100)  face  is  prevalent 
in  diamond  growth  under  conventional  CVD  con¬ 
ditions  [2].  This  orientation  has  the  lowest  density 
defect,  making  it  more  appropriate  for  electronic 
applications. 

Density  functional  theory  (DFT)  will  be  applied 
to  the  study  of  the  energies  involved  in  the  relax¬ 
ation  and  chemisorption  process  of  C(IOO)  dia¬ 
mond  surface  to  grow  thin  film;  to  do  this,  C9 
carbon  clusters  resembling  the  C(IOO)  diamond 
surface  will  be  used.  Before,  small  carbon  clusters 
C1.3  were  calculated  using  different  basis  sets  in 
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order  to  select  an  adequate  one  to  be  used  in  C9 
carbon  clusters. 


Method  and  Methodology 

In  this  work,  the  code  deMon  based  in  density 
functional  theory  was  used.  This  program  imple¬ 
ments  the  linear  combinations  of  Gaussian-type 
orbitals  (LCGTO-DF)  [3].  These  methods  have  been 
described  extensively.  The  calculations  were  made 
in  local  spin  density  approximation  (LSD A). 

The  geometries  of  the  C1.3  clusters  and  C9 
(non-relaxed  hydrogenated,  relaxed,  and  relaxed 
hydrogenated)  were  fully  optimized  by  deMon, 
but  in  other  C  9-carbon  clusters  only  the  adsorbed 
species  (CH3  and  CH2)  were  optimized.  The  start¬ 
ing  geometry  for  the  surface  cluster  was  taken 
from  other  calculations  involving  surface  models 
[4]  and  therefore  it  was  kept  frozen. 

For  the  small  carbon  clusters  (C1.2),  four  basis 
sets  were  tested:  STO-3G(33/3),  DZVP(621/ 
41/1*),  DZVP2(721/51/1*),  and  TZVP(7111/ 
411/1*).  The  results  of  the  C2  cluster  were  com¬ 
pared  with  experimental  and  theoretical  data.  The 
basis  set  which  produced  the  best  compromise 
between  bond  energy  and  bond  distance  was  cho¬ 
sen  as  suitable  to  be  used  in  C9  carbon  clusters. 

C  ATOM 

It  is  known  that  a  carbon  atom  has  a  (2s^2p^) 
^P  ground  state  with  two  unpaired  electrons.  The 
excited  state  (2s^2p^)  is  ^S,  with  four  unpaired 
electrons  which  explain  the  tetravalency  of  this 
element  and  all  the  chemistry  of  the  hydrocarbons. 

Table  I  shows  the  calculated  total  energy  of  the 
singlet  and  triplet  states.  Notice  that,  in  agreement 
with  the  experimental  evidence,  the  triplet  state  is 
the  most  stable  species,  but  the  computed  differ¬ 
ence  between  singlet  and  triplet  states  is  only 
about  5  X  10"^  for  all  the  basis  considered.  It  is  a 
very  low  value  compared  with  experimental  one. 
The  DZVP2  basis  together  with  TZVP  and  DZVP 
gave  the  closest  energy  value  in  relation  to  the 
experiments. 

C2  MOLECULE 

Dicarbon  molecule  has  eight  valence  electrons, 
and  the  molecular  orbital  theory  establishes  a 
KK(cr^2s)^(a-/2s)^(7r^2p)^  electronic  configuration 


TABLE  I _ 

Total  energies  for  a  carbon  atom. 


Basis  set 

£7-  (hartrees) 

Singlet 

Triplet 

STO-3G 

-36.89954 

-36.95530 

DZVP 

-37.41906 

-37.46365 

DZVP2 

-37.42278 

-37.46727 

TZVP 

— 

-37.46729 

[5] 

— 

-37.68 

[6] 

— 

-37.6886 

which  corresponds  to  a  ground  state.  A 

(7r^^2p)\a^2py  external  electronic  configuration  is 
the  ^II„  excited  state.  The  spectrum  of  this 
molecule  has  been  extensively  studied  and  at  the 
beginning  it  was  found  that  the  Mulliken  bands 
(which  arise  from  the  ^  lower  state)  appeared  at 
comparable  intensity  with  the  Swan  bands  (which 
arise  from  the  lower  state).  These  results  raise 
the  question  about  which  state  is  the  ground  state 
[7],  Today,  Ballik  and  Ramsay's  [8]  observations 
have  fixed  ^2^  at  0.075  eV  below  ^11^^,  and  ^2^  is 
the  ground  state  for  only  a  small  margin. 

Fougere  and  Nesbet  [9]  have  studied  the  elec¬ 
tronic  structure  of  this  molecule  over  the  range 
Rg  =  1.5^0  to  S.O^Q  using  different  Slater-type  or¬ 
bitals  basis  sets  and  two  different  levels  of  config¬ 
uration  interaction.  The  best  calculations  obtained 
by  Fougere  and  Nesbet  were  1.2855  A  and  6.76  eV 
for  the  ^  2^  state;  for  the  triplet  state  the  best  bond 
energy  result  (5.45  eV)  was  not  the  best  equilib¬ 
rium  distance  (1.3937  A)  because  another  calcula¬ 
tion  gave  4.00  eV  and  1.3261  A,  respectively.  As  it 
can  be  seen,  the  state  is  predicted  as  the 
ground  state.  At  the  present  6.25  ±  0.2  eV  [10]  and 
6.12  eV  [11]  are  recognized  as  the  best  experimen¬ 
tal  values  of  C2  bond  energies. 

Tables  II  and  III  show  the  total  energies  and  the 
bond  lengths  computed  for  the  triplet  and  the 
singlet  state  and  the  experimental  results  cited 
above.  Other  calculations  are  also  tabulated. 

It  is  important  to  notice  that  the  total  energies 
calculated  in  both  two  states  ^X^  and  are 
very  close;  this  is  the  reason  for  which  the  determi¬ 
nation  of  the  ground  state  has  been  controversial 
[12]. 

The  DZVP2  basis  gives  the  best  bond  energy  in 
comparison  to  experimental  and  theoretical  re¬ 
sults.  For  the  bond  length  the  DZVP2  basis  is  not 
the  best,  in  this  case  TZVP  basis  is  better,  but  since 
the  differences  in  bond  lengths  are  of  the  order  of 
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TABLE  II 


Total  energies  for  a  ^Xg  state  in  dicarbon  molecule. 

Basis  set 

Ej  (hartrees) 

To  (A) 

Do  (eV) 

STO-3G 

-74.21381 

1 .2637 

8.25 

DZVP 

-75.18474 

1 .2734 

7.01 

DZVP2 

75.18873 

1 .2786 

6.92 

TZVP 

75.19739 

1.2518 

7.15 

[9] 

— 

1 .2855 

6.76 

[10] 

— 

1.243 

6.25  ±  0.2 

[11] 

— 

1.242 

6.12 

[13] 

— 

1 .2422 

6.36 

[14] 

75.531 

1.252 

6.24 

1%,  this  parameter  is  not  as  important  as  the 
energy,  and  DZVP2  basis  is  the  most  suitable  to  be 
used.  These  results  are  in  agreement  with  consid¬ 
erations  about  a  best  description  C2  ground  state 
must  use  a  large  configuration  mixing  in  its  ground 
state,  therefore,  the  results  can  be  improved  using 
an  extended  basis  [15]. 

C3  MOLECULE 

The  C3  molecule  is  just  barely  linear  as  is  in¬ 
dicated  by  its  very  small  bending  frequency 
(63  cm"^).  Although  Douglas  [16]  has  established 
that  C3  is  linear,  others  say  that  this  molecule 
might  be  slightly  nonlinear  [17].  Liskow  et  al.  [18] 
have  made  self-consistent  field  (SCF)  and  configu¬ 
ration  mixing  calculations  and  found  that  a  reason¬ 
ably  large  basis  set,  including  d  functions,  is  nec¬ 
essary  to  account  the  small  bending  frequency,  but 
the  configuration  mixing  procedure  is  not  needed; 
they  computed  the  orbital  energies  as  a  function  of 
the  angle  at  ~  1.27  A  of  distance  and  found  that 
the  larger  basis  predicts  C3  to  be  linear,  in  agree¬ 
ment  the  experiment.  Peric  Radic  [19]  also  studied 
the  ground  state  and  the  first  two  excited  states  of 
C3  which  are  experimentally  known;  they  used  a 


TABLE  III 


Total  energies  for  a  state  in  dicarbon  molecule. 

Basis  set 

Ej  (hartrees) 

/■o(A) 

Do  (eV) 

STO-3G 

-74.21701 

1 .3371 

8.34 

DZVP 

-75.20247 

1.3197 

7.49 

DZVP2 

75.20554 

1 .3235 

7.37 

TZVP 

75.21277 

1.3081 

7.57 

[9] 

1.3937 

5.45 

[10] 

— 

— 

6.17 

[14] 

75.551 

1.318 

6.16 

TABLE  IV _ 

Total  energies  for  a  lineal  tricarbon  molecule. 


Basis  set 

Ej  (hartrees) 

/•o(A) 

Do  (eV) 

STO-3G 

-111.51367 

1.334 

17.67 

DZVP 

-112.98469 

1.309 

16.19 

DZVP2 

-112.98888 

1.3132 

16.01 

TZVP 

-113.00586 

1 .2962 

16.47 

[18] 

-113.38082 

1.328 

— 

[20] 

— 

— 

13.97 

multiconfiguration  SCF,  plus  extensive  configura¬ 
tion  mixing  calculation  and  their  results  were  in 
agreement  with  experiment.  Both  authors  found 
the  state  is  the  ground  state. 

Different  geometrical  possibilities  were  calcu¬ 
lated  for  the  singlet  and  triplet  state:  linear,  non¬ 
linear,  and  a  triangle  (cycle).  These  last  two  ge¬ 
ometries,  when  fully  optimized  transform  to  the 
linear  geometry.  Table  IV  presents  these  results. 
The  singlet  was  always  the  predicted  ground  state. 
The  Xg  state  is  the  ground  state,  in  agreement 
with  the  experimental  evidences.  As  in  the  case  of 
the  dicarbon  molecule,  DZVP2  was  the  best  in 
bond  energies,  but  for  the  bond  length  TZVP  gave 
the  best  result. 

Since  the  bond  distance  differences  are  small, 
but  not  the  energy  differences,  the  DZVP2  basis, 
which  gave  the  bond  energy  closest  to  the  experi¬ 
mental  data,  was  used  to  model  the  C9  carbon 
clusters. 

C9  CARBON  CLUSTERS 

Figure  1(a)  shows  a  bcc  unit  cell  (a  =  3.567  A) 
of  diamond;  notice  that  the  C(IOO)  surface  has 
successive  parallel  chains  in  (110)  direction.  When 
the  surface  is  formed,  the  lack  of  saturation  of  the 
surface  carbons  induce  a  relaxation  process.  Hamza 
et  al.  [21]  found  that  raising  the  temperature  to 
1300  K,  C(IOO)  surface  produces  a  LEED  pattern 
made  up  of  two  domains  with  2x1  symmetry; 
forming  a  77  symmetric  dimer  bond  between  two 
surface  carbon  atoms  of  two  consecutive  chains 
[Figure  1(b)]  [22].  Atomic  force  microscopy  (AFM) 
[23]  and  scanning  tunneling  microscopy  [4]  has 
also  evidenced  the  C(IOO)  2X1  surface  recon¬ 
struction.  It  is  the  most  accepted  geometry,  at  least 
in  experimental  conditions  of  diamond  growth. 

The  structure  of  a  hydrogenated  surface  has 
been  under  discussion  for  many  years;  the  mono- 
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(c) 


FIGURE  1.  (a)  Unitary  cell  of  diamond  (Ref.  [1]); 

(b)  relaxed  surface  C(100)  2  x  1;  (c)  monohydrogenated 
surface  diamond  C(100)  2x1. 


hydrogenated,  dihydrogenated,  and  alternating 
structures  have  been  evaluated  to  establish  the 
most  stable  one.  Thomas  et  al.  [24]  report  that 
atomic  hydrogen  doses  do  not  remove  the  2x1 
LEED  pattern  even  after  doses  equivalent  to  100 
monolayers  of  hydrogen.  Using  HREELS  and  LEED 
techniques,  Thomas  et  al.  [24]  reported  the  mono¬ 
hydrogenated  dimer  row  surface  as  the  most  sta¬ 
ble.  No  evidence  was  observed  for  other  species 
even  after  a  large  atomic  hydrogen  exposure. 

The  symmetric  dimer  with  monohydride  termi¬ 
nation  [Figure  1(c)]  is  the  most  likely  structure. 
Therefore,  the  dihydrogen  and  alternating  configu¬ 
rations  were  not  considered.  This  is  consistent  with 
theoretical  calculations  [25]. 

Several  authors  have  used  the  cluster  approach 
to  study  the  chemisorption  process  involved  in 
C(IOO)  diamond  growth.  It  is  based  in  the  gener¬ 


ally  accepted  idea  that  the  chemisorption  process 
is  a  localized  phenomenon  of  orbital  interaction. 
C9  is  the  best  carbon  cluster  for  studying  the 
reconstructed  C(IOO)  diamond  surface. 

One  of  the  first  C9  carbon  cluster  calculations 
was  made  by  Verwoerd  [26].  He  studied  the  recon¬ 
struction  of  both  the  clean  Si(lOO)  and  C(IOO)  sur¬ 
faces  and  the  process  of  hydrogen  chemisorption 
onto  these  two  surfaces  by  a  semiempirical  method 
and  using  a  C9  carbon  cluster  [27].  Starting  from 
the  ideal  parameters,  Verwoerd  made  a  whole 
optimization  using  a  semiempirical  MNDO 
method.  For  clean  and  monohydrogenated  recon¬ 
structed  C(IOO)  surfaces,  he  predicted  a  symmetric 
dimer;  the  C-C  distances  calculated  for  these 
dimers  were  1.43  and  1.67  A,  respectively.  The 
dihydride  structure  was  predicted  as  unstable 
(C“C  distance  in  the  dimer  of  0.74  A),  because  of 
the  repulsive  interaction  between  the  electron  den¬ 
sities  of  the  two  hydrogens.  Taking  a  C9  carbon 
cluster  model  similar  to  Verwoerd's,  we  calculated 
the  relaxed  and  nonrelaxed  surfaces,  with  and 
without  chemisorbed  hydrogens  and  with  CH2 
and  CH3  adsorbed.  Verwoerd  reported  3.09  eV  for 
relaxation  energy  and  4.39  eV  for  binding  energy 
of  hydrogen.  Mehandru  and  Anderson  [25b]  ob¬ 
tained  2.52  and  4.39  eV,  respectively.  They  use 
semiempirical  methods,  but  Mehandru  and  An¬ 
derson  use  a  cluster  larger  than  Verwoerd. 

An  interesting  work  that  combined  STM  images 
and  theoretical  calculations  of  molecular  dynamics 
in  a  semiempirical  DFT  was  made  by  Frauenheim 
et  al.  [4].  They  used  a  supercell  of  126  carbon 
atoms  for  C(OOl)  surface  and  reported  a  symmetric 
dimer  with  a  distance  of  1.41  A  for  a  clean  surface 
and  1.50  A  for  a  monohydrogenated  surface.  Other 
details  of  the  structural  parameters  of  Frauenheim 
et  al.  [4]  are  presented  in  Table  V.  We  took  their 
relaxed  structural  parameters. 


TABLE  V _ 

Relaxed  structural  parameters  of  Cg  carbon 
clusters  (Ref.  [4]). 


Surf,  struct. 

Bond  lengths  (A) 

Bond  angles  (deg) 

cdoo)  2  X 1 

(1-1)  1.41 

1-1-2  =  109 

(1-2)  1.50 

2-1-2  =  113.7 

C(100)2  X  1:H 

(1-2)  1.07 

1-2-2  =  112 

(2-2)  1 .58 

1- 2-3  =  113 

2- 2-3  =  104 

870 


VOL  65,  NO.  5 


THEORETICAL  CALCULATIONS  OF  CARBON  CLUSTERS 


The  C9  carbon  cluster  was  constructed  consider¬ 
ing  two  consecutive  diagonal  chains  [see  the  bcc 
unitary  cell  in  Figure  1(a)],  with  two  more  layers 
below.  The  dimensions  of  the  cluster  correspond  to 
the  diagonal  (2.5223  A)  in  the  bcc  unit  cell.  As  in 
the  unit  cell,  the  cluster  has  four  layers  with  an 
interlayer  spacing  which  is  1  /4  of  the  lattice  con¬ 
stant  of  the  bcc  cell.  Table  VI  presents  the  Carte¬ 
sian  coordinates  of  the  nonrelaxed  C9  carbon  clus¬ 
ter  taken  from  the  bcc  cell  parameters  [Fig.  2(a)]; 
Figures  2(b)  and  2(c)  present  the  corresponding 
relaxed  and  relaxed  hydrogenated  structures  using 
Table  V  parameters  as  the  starting  point. 

In  order  to  saturate  the  valencies  of  the  nonsur¬ 
face  carbons,  12  virtual  hydrogens  have  been  used; 

NON-RELAXED  CLUSTER  C(100)  1x1 
WITH  HYDROGEN  CHEMISORBED 


TABLE  VI _ _ 

Cartesian  coordinates  of  the  nonrelaxed 
Cg-carbon  clusters. 


Atom 

X 

Y 

Z 

Cl 

0.0000 

0.0000 

0.0000 

C2 

1.2612 

0.0000 

0.8918 

C3 

2.5223 

0.0000 

0.0000 

C4 

0.0000 

2.5223 

0.0000 

C5 

1.2612 

2.5223 

0.8918 

Ce 

2.5223 

2.5223 

0.0000 

C7 

0.0000 

1.2612 

-0.8918 

Cs 

1.2612 

1.2612 

-1.7835 

C9 

2.5223 

1.2612 

-0.8918 

RELAXED  CLUSTER  C(100)  2x1 
WITHOUT  HYDROGEN  CHEMISORBED 


C 


o 

c 

(a) 


O 


0 


c 


€ 


(b) 


FIGURE  2.  Geometrical  structure  of  carbon  clusters:  (a)  hydrogen  nonrelaxed;  (b)  reiaxedi  and  (c)  relaxed 
monohydrogenated. 
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relaxation  energy  is  smaller  than  that  calculated  by 
Verwoerd  [26]  and  Mehandru-Andersen  [25b]. 


O 


O  t 


FIGURE  2.  (continued) 


Table  VII  shows  the  total  energies  computed  for 
nonrelaxed  hydrogen,  relaxed,  and  relaxed  mono- 
hydrogenated  C9  carbon  cluster  and  the  energies 
involved  in  the  relaxation  and  hydrogen 
chemisorption  process. 

The  experimental  results  are  qualitatively  repro¬ 
duced.  The  relaxation  process  produces  a  stabiliza¬ 
tion  of  the  cluster,  but  notice  that  the  presence  of 
hydrogen  stabilizes  it  too  much.  The  computed 


Methyl  Adsorption  and  Growth  of  the 
First  Layer  of  Diamond 

Because  of  the  unique  properties  of  diamond, 
great  efforts  have  been  put  into  the  synthesis  of 
this  material  to  be  used  in  advanced  materials 
applications.  The  first  attempt  to  obtain  diamond 
was  in  1955,  but  the  process  required  high  pres¬ 
sures  and  temperatures.  After  1955  a  great  number 
of  investigators  began  to  explore  the  possibility  of 
synthesizing  diamond  at  low  pressure  and  temper¬ 
ature,  that  is,  in  the  region  of  its  metastability.  The 
chemical  vapor  deposition  (CVD)  technique  of  dia¬ 
mond  films  was  a  reality  in  1958,  but  the  growth 
rates  were  very  slow  and  graphitic  carbon  was 
deposited  simultaneously;  hydrogen  was  neces¬ 
sary  to  remove  the  graphitic  phase. 

In  1975,  Deryagin's  group  [28]  reported  the  use 
of  a  gas  activation  technique  which  resulted  in 
dramatic  increases  in  diamond  growth  rates,  while 
eliminating  much  of  the  graphitic  deposition. 
Starting  in  the  1980s  many  reports  were  published 
of  successful  diamond  growth  in  low  pressure 
using  new  gas  activation  techniques. 

It  is  now  well  established  that  diamond  can  be 
deposited  at  low  pressures  (from  1  to  10“^  atm.) 
and  temperatures  between  700  and  1000°  C  using 
the  CVD  technique.  In  this  technique,  a  mixture  of 
methane  and  ~  98%  of  hydrogen  is  activated  by 
passing  it  through  a  plasma  or  a  hot  filament;  this 
process  produces  atomic  hydrogen  and  carbon 
radicals,  and  the  deposition  of  these  species  over 
the  diamond  surface  produces  the  growth  of  dia¬ 
mond  and  graphitic  phases.  A  highly  preferential 
etching  of  the  graphitic  phases  is  produced  by 
atomic  hydrogen  [1]. 

Although  different  small  hydrocarbon  frag¬ 
ments  have  been  analyzed  as  possible  precursor 
species  in  diamond  growth,  at  present  it  is  ac- 


TABLE  VII _ _ _ 

Total  energies  for  the  nonrelaxed  and  the  relaxed  cluster,  with  and  without  hydrogen. 

Cg  cluster  Nonrelaxed  +  2H  Relaxed 

Etotai  (hartrees)  -348.10161  -346.95654 

^relax.  —  ^(relaxed  +  2H)  (^nonrelaxed +  2H) 

^quims.  (H)  ~  ^  /  2^(relaxed  + 2H)  ~  ^(relaxed) 


Relaxed  +  2H 
-348.23939 


872 


VOL.  65,  NO.  5 


THEORETICAL  CALCULATIONS  OF  CARBON  CLUSTERS 


cepted  that  CH3  is  the  dominant  growth  species 
[29]. 

Mehandru  and  Anderson  [25b]  calculated  CH2 
and  CH3  energy  barriers  in  the  presence  of  the 
hydrogen  vacancies  on  the  monohydrogenated 
surface  and  reported  for  CH2  and  CH3,  a  C-CH2 
and  C-CH3  of  1.48  and  1.59,  respectively;  the 
corresponding  binding  energies  are  4.38  and  3.38 
eV  in  the  most  stable  sites.  We  have  calculated  the 
energy  of  C9-H,  CH3,  C9-H,  CH*,  and  C9-2H, 
CH2  using  the  deMon  program. 

Five  steps  in  the  growth  mechanism  were  com 
sidered  as  suggested  by  Garrison  et  al.  [30]:  (1)  A 
hydrogen  leaves  the  surface  and  a  radical  site  is 
formed.  (2)  The  methyl  radical  is  adsorbed  over  a 
radical  site.  (3)  A  second  hydrogen  belonging  to 
methyl  radical  leaves  the  surface  and  a  methylen 
radical  is  formed.  (4)  The  dimer  bond  is  broken  by 
a  methylene  group;  a  methyliden  group  and  a 
radical  site  are  formed.  (5)  A  methyliden  group 
attacks  the  radical  site  and  a  bridge-methylene 
structure  is  formed.  A  new  layer  of  diamond  has 
been  produced  and  the  hydrogens  of  the  methy¬ 
lene  group  can  be  substituted  for  new  methyl 
radicals  that  arrive  to  the  surface. 

The  calculated  energies  confirm  that,  in  the 
mechanism  suggested  by  Garrison  et  al.  [30],  step 


TABLE  VIII _ 

Total  energies  for  the  relaxed  cluster  with  methyl 
and  methylene  species  adsorbed  and  energy 
changes  involved  in  the  steps  of  the  mechanism. 


Cg-H.CHg 

Cg-H,  CH2 

£,otai  (hartrees) 

-386.51 

-385.54 

<^^surf  ~  *^ads  (A) 

1.53 

— 

Steps 

AEleV 

Step  1 

+0.01 

step  2 

+0.5 

steps 

-0.69 

Step  4 

-0.002 

steps 

+0.89 

3  is  an  activated  process  that  produces  an  interme¬ 
diate  radical  state.  Table  VIII  presents  the  results 
and  Figure  3  presents  a  scheme  of  this  process. 
Another  interesting  point  in  the  mechanism  is  step 
4.  The  problem  is  whether  the  methylen  group  will 
attack  the  dimer  bond  or  will  displace  the  top 
hydrogen.  In  order  to  choose  between  these  two 
possibilities,  an  electrostatic  potential  calculation 
was  made  for  the  C9-H,  CH2  structure.  Figure  4 
presents  the  electrostatic  potential  curves  in  the 
plane  bisecting  the  dimer  for  this  structure. 


FIGURE  3.  Steps  1  and  2  of  the  reaction  mechanism. 
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FIGURE  4.  Electrostatic  pdtetitial  curves  in  the  plane 
bisecting  the  Cg-H,  CHg  cluster. 


The  electrostatic  potential  cutrves  predict  that 
the  methylen  group  look  fOIr  interaction  with  the 
dimer  rather  than  for  the  tOp  hydrogen.  Then,  step 
4  is  potentially  possible:  The  methylen  group 
breaks  the  dimer  bond  and  it  does  not  displace  the 
top  hydrogen. 


Conclusions 

A  full  optimization  of  C9  cluster  predicts  the 
relaxation  of  the  C(IOO)  surface  as  the  most  stable 
and  confirm  the  important  role  of  Hydrogen  in  the 
stabilization  of  this  surface. 

The  energy  involved  in  the  step  2  is  negative 
and  the  intermediate  Cg-H,  CH2  has  a  barrier 
energy  of  34.08  eV.  The  electrostatic  potential 
shown  the  step  4  is  potentially  possible.  In  order 
to  improve  the  results  quantitatively,  a  partial 
optimization  of  clusters  larger  than  Cg  are  being 
calculated. 
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ABSTRACT:  A  spin-averaged  Hartree-Fock  (SAHF)  procedure  is  examined  within 
the  framework  of  the  configuration-averaged  methodology  proposed  earlier.  The  SAHF 
method  produces  reasonable  total  energies  and  can  be  successfully  used  for  the  calculation 
of  electronic  spectra,  especially  in  the  cases  of  marked  energy  separation  between  the 
spin  states.  An  example  is  given  with  the  spectroscopy  of  Mn^"^  ions  in  the  ZnS  crystal 
where  the  SAHF  procedure  provides  a  more  accurate  interpretation  of  the  electronic 
spectrum  than  that  obtained  by  other  self-consistent-field  methods.  ©  1997  John  Wiley  & 
Sons,  Inc.  Int  J  Quant  Chem  65:  877-884,  1997 


Introduction 

The  applications  of  quantum  chemical  meth¬ 
odologies  to  investigations  of  open-shell 
transition-metal  (TM)  systems  are  complicated  by 
the  large  number  of  spin  states  often  associated 
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with  such  systems.  The  number  of  spin  states  in  an 
open-shell  system  depends  on  the  number  of 
open-shell  orbitals  and  electrons.  In  general,  the 
number  of  microstates  available  to  an  open-shell 
system  is  given  by  Eq.  (1),  with  N  and  m  the 
numbers  of  open-shell  electrons  and  spatial  or¬ 
bitals,  respectively; 


/  2m  \  ^ 

I  N  j  “  Nl(2m-N)l 


(1) 


and  the  number  of  states  for  a  given  multiplicity 


International  Journal  of  Quantum  Chemistry,  Vol.  65,  877-884  (1997) 
©  1997  John  Wiley  &  Sons,  Inc. 


CCC  0020-7608  /  97  /  050877-08 


STAVREV  AND  ZERNER 


by[l] 

K,n(S)  =  |(2S  +  N)/2)((N-  25)72) 
“((2S  +  N  +  2)/2) 

^((N  -  25  -  2)72) 

N„,w(5)  =  M„,j^(5) -M„,;,(5  +  1),  (3) 

with  =  number  of  a  spin  electrons,  n^>n^, 
and  S  =  \{n^  ~~  N  ^  +  and  it  is  under¬ 

stood  that  the  second  term  in  Eq.  (2)  is  zero  should 
(N  -h  2S  +  2)  >  2m,  Note  that  (N  +  2S)/2  = 
and  (N“2S)/2  =  n^.  The  first  term  in  Eq.  (3)  is 
the  number  of  times  electrons  can  be  arranged 
in  m  a-spin  orbitals  multiplied  by  the  number  of 
times  that  electrons  can  be  arranged  in  m 
j8-spin  orbitals.  The  second  term  subtracts  from  the 
first  those  microstates  that  have  the  value  5^  =  5 
but  that  belong  to  multiplicities  greater  than  S. 

In  the  case  of  a  single  TM  center,  several  hun¬ 
dreds  of  possible  states  can  arise  and  this  number 
grows  quickly  to  10^-10^  for  TM  dimers,  10^^ -10^^ 
for  systems  with  four  metals,  etc.  This  large  num¬ 
ber  of  possible  states  can  create  a  large  number  of 
states  close  in  energy  to  the  lowest-energy  state 
and  often  prevents  the  self-consistent-field  (SCF) 
convergence,  leading  also  to  spatial  or  spin  insta¬ 
bilities  and  thus  invalidating  the  pure  spin  de¬ 
scription  of  the  system.  The  calculation  of  the 
ground  state  is,  therefore,  a  difficult  task  and  re¬ 
quires  a  special  approach  to  the  treatment  of 
molecules  with  a  large  number  of  open  shells. 

Previous  work  on  TM  systems  has  demon¬ 
strated  that  an  iterative  configuration  averaged 
Hartree-Fock  (CAHF)  procedure  [1]  is  suitable  for 
the  description  of  electronic  properties  and  geo¬ 
metric  characteristics  of  metal  clusters  [2-7].  The 
CAHF  procedure  has  been  compared  with  other 
methods  which  also  average  over  multiplets  and 
effective  degeneracies,  and  the  CAHF  reliability 
has  been  demonstrated  especially  for  molecular 
systems  with  extensive  open  shells  [2].  The  CAHF 
operator  was  developed  within  the  restricted 
open-shell  Hartree-Fock  (ROHF)  approximation 
and  generalized  for  systems  with  an  arbitrary 


number  of  electrons  and  open-shell  orbitals  [2,  3]. 
For  this  purpose,  a  Fock  operator  for  the  averaged 
energy  of  a  configuration  is  considered.  The  ener¬ 
gies  and  the  orbitals  from  such  an  operator  can  be 
used  to  form  a  reference  for  a  subsequent  spin 
projection  to  obtain  the  pure  spin  states  using,  e.g., 
Rumer  diagrams  [1,  8,  9]  or  unitary  group  tech¬ 
niques  [10,  11].  Subsequent  configuration  interac¬ 
tion  (Cl)  can  also  be  performed  with  such  conve¬ 
niently  obtained  orbitals,  allowing  for  a  proper 
check  of  the  ground  state  and  a  refinement  of  the 
total  wave  function  and  its  properties. 

It  was  the  aim  of  this  work  to  present  a  further 
development  of  the  CAHF  theory  that  allows  for  a 
better  approach  to  the  spin-state  problem  through 
a  selection  of  the  configuration-averaged  states  by 
a  spin  eigenvalue.  This  leads  to  a  smaller  and 
significantly  simplified  pool  of  possible  states  and 
can  present  a  better  reference  for  the  subsequent 
CL  An  example  is  given  with  the  spectroscopy  of 
the  Mn^"^  ion  in  the  ZnS  lattice  where  the  new 
approach  is  compared  with  the  ROHF  calculations 
performed  previously  on  the  same  system  [12] 
using  the  ZINDO  program  and  the  INDO/S 
Hamiltonian  [13]. 


Theory 

In  general,  we  are  trying  to  calculate  the  aver¬ 
age  energy  of  a  system  with  N  electrons  in  tn 
spatial  orbitals.  The  number  of  possible  mi¬ 
crostates  is  given  by  Eq.  (1).  We  assume  that  the 
states  are  described  by  spin  and  spatially  adapted 
linear  combinations  of  Slater  determinants, 

(^>2 . . .  I  constructed  from  a  set  of  orthonormal 
spin  orbitals  =  5,^. 

The  number  of  determinants  of  N  electrons  in 
m  spatial  orbitals,  when  one  of  the  orbitals  is 
doubly  occupied,  given  by 


Since  each  such  determinant  yields  the  Coulomb 
integrals  in  the  expression  for  its  energy,  this  is 
also  the  number  of  integrals  that  appear  in  all 
possible  determinants  [Eq.  (1)]. 

Accordingly,  for  i  ^  ;,  the  number  of  Jij  inte¬ 
grals  that  appears  in  all  possible  determinants  is 
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given  by 


2m  -  2\ 
N-2  j' 


(5) 


as  the  electrons  in  (/>,  and  can  be  of  a  and  /3 
spin  independently.  The  general  Coulomb  integral, 
Jjj,  is  defined  as 

=  If  dril)  dT(2)(^f  (l)(^;(2)(l/ri2)</>,.(l)<^/2). 

(6) 


In  a  simple  fashion,  the  number  of  exchange 
integrals,  when  considering  all  possible  deter- 
minants  of  Eq.  (1),  is  given  by 


2m  -  2^ 
N-2  j' 


(7) 


with 


bitals,  the  next  two  are  those  for  open-shell  contri¬ 
butions  to  the  energy,  and  the  last  term  gives  the 
coupling  between  the  closed  and  the  open  shells. 
Here,  i  and  j  stand  for  the  doubly  occupied 
closed-shell  orbitals,  and  p  and  n  label  the  open- 
shell  orbitals  with  fractional  occupations  and 
belonging  to  open  shells  jul  and  v,  respectively.  In 
the  general  case,  p  and  n  refer  to  individual  or¬ 
bitals,  and  each  /jl  and  v  define  a  separate  open- 
shell  Fock  operator.  Most  often,  sets  of  orbitals 
having  the  same  vector  coupling  coefficients,  a 
and  b  in  Eq.  (9),  are  grouped  together  into  shells 
described  by  the  same  open-shell  operator  [1,  14]. 
Comparing  Eq.  (9)  with  (10)  gives  the  vector  cou¬ 
pling  coefficients  for  a  given  open  shell  r,  with 
fractional  occupation  =  N^/m/. 


Urr  b^ 


2m,(N,  -  1) 

N,(2m,  -  1)  ■ 


(11) 


=  11  dril)  dTi2)<f>*a)<pf(2)aM<l>i(2)(t>ja). 

(8) 


The  vector  coupling  coefficients  between  the  dif¬ 
ferent  open  shells  are  equal  to  1: 


In  this  case,  both  4>^  and  (jyj  need  to  be  of  a  or  jS 
spin,  respectively,  for  the  factor  2  rather  than  the  4 
of  Eq.  (5).  The  average  total  energy  of  the  open-shell 
is  then  obtained  through  averaging  over  the  total 
number  of  integrals  for  all  possible  states.  This 
leads  to 


N 

E„^,(open)  =  —2^h„  + 
m  ^ 


N(N-  1) 
2m(2m  —  1) 


X 


E/„„  +  2E(2/„, 

n  n<p 


.  (9) 


To  use  the  ROHE  operator  [14],  the  energy  must  be 
cast  in  the  form  of 


E  =  2Zhi+ 

i  ^  / 


m 


Ur,  =  fc,,  ==  1.  (12) 

The  configuration  average  thus  obtained  is  a 
mix  of  various  states  with  different  total  spin.  This 
spin  contamination  is  generic  for  the  CAHF  and 
typically  presents  a  difficulty  in  the  determination 
of  the  pure  spin  state  specified  in  the  ROHE  calcu¬ 
lations.  When  a  fraction  of  the  total  CAHF  states  is 
selected  by  the  total  spin,  one  obtains  a  signifi¬ 
cantly  smaller  range  of  configurations;  we  further 
refer  to  them  as  spin-averaged  states  [see  Eq.  (2)]. 
This  spin-averaged  Hartree-Fock  (SAHF)  proce¬ 
dure,  similar  to  the  CAHF  methodology,  operates 
in  the  reduced  manifold  of  spin-eigenstates  within 
a  given  configuration,  leaving  unsampled  (in  the 
average)  a  large  number  of  unwanted  configura¬ 
tions. 

For  the  SAHF,  the  vector  coupling  coefficients 
have  the  following  form: 


+  7  E  E  E  E  %n^i2a^J^„  - 

^  fjL  V  n&v 

+  ^E«/2/,,,-K,.p).  (10) 

L  V 


a(S,  m,N) 


m 

(m^  -  1)N^ 


X 


mNiN-  1)  + 


N(N-  4) 
2 


The  first  two  terms  in  Eq.  (10)  account  for  con¬ 
tributions  from  electrons  in  the  closed  shell  or- 


+  2S(S  -L  1) 


(13) 
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b(S,m,N)  = 


m 


(m^  -  1)N^ 
2N 


X 


m 


-(N-l)  +  N(N-4) 


advantages  of  the  SAHF  approach  are  illustrated 
in  the  next  section. 


Application 


+  4S(S  +  1) 


(14) 


To  derive  Eq.  (14)  (perhaps  there  is  a  simpler 
way!),  we  consider  four  cases  and  the  genealogical 
construction  of  spin  states  [1].  Each  case  gives  rise 
to  the  occurrence  of  at  least  one  Coulomb  integral. 
In  case  one,  there  is  one  up-spin  electron  in  both 
orbitals  and  Such  a  triplet-coupled  pair 
gives  rise  to  and  occurs  multiplied  by  the 

number  of  times  the  remaining  determinant  can 
couple  to  give  an  eigenstate  of  spin  eigenvalue 
S  [more  correctly,  eigenvalue  of  S(S  +  1)].  If 
S  >  1,  this  is  (N^_2,n-2(S  -  1)  +  N^_-2,n-2(S)  + 
^m-2,N-2(S  +  D)  times  [see  Eq.  (2)].  If  S  =  1/2, 
the  multiplicative  factor  is  (N^-2,  n-2(^)  + 
^m~2,N-2(S  +  l)),andif  S  =  0,  N„^_2,^_2(S)(from 
the  addition  rule  of  angular  momentum,  S  =  |S  + 
1|,  IS  +  1|  -  1, . . .  IS  -  1|).  A  second  case  has  an 
up-spin  electron  in  and  a  down-spin  in  (j)j, 
forming  an  (aj8  -  /3a)/ \/2  singlet-coupled  pair. 
Such  a  pair  contributes  Jij  +  K^j  and  occurs 
N„^„2,n-2(5)  times.  The  third  case  contains  two 
electrons  in  </>,  and  one  in  or  visa  versa.  Each 
such  situation  contributes  2(2  and  occurs 

(K-2,n^3(S  +  1/2)  +  N,-2,n-2(S  -  1/2))  times 
if  S  >  1/2  or  N„,_2,n-2(^)  if  S  =  0.  Finally,  the 
case  of  two  electrons  in  each  of  these  orbitals 
contributes  4/^^  — 2K,y  and  occurs  N^-2, 
times.  Combining  these  terms  and  dividing  by 
^(S),  which  is  very  tedious  and  best  done 
symbolically,  yields  the  average  number  of  J's  and 
K's.  Interestingly,  the  special  cases  delineated 
above  take  care  of  themselves  providing  that  the 
binomial  terms  in  Eq.  (2)  with  an  ill-defined  argu¬ 
ment  are  set  equal  to  zero.  The  comparison  is  then 
made  with  Eq.  (10)  with  all  open-shell  fractional 
occupancy  n„  ==  n.  =  N/m  to  obtain  the  and  b;: 
of  Eqs.  (13)  and  (14). 

J-i  occurs  N,„„ijv-2(^)  times,  whereas,  of 
course,  two  electrons  in  one  orbital  contribute  no 
Kji.  In  comparing  with  Eq.  (10),  however,  7,^  occurs 
n^rii  {la^i  -  i?j/)/4  times  (since  Jn  =  allowing 
the  same  expression  to  be  used  for  as  for  a^j 
and  for  b^^  as  for  b^,  i  ^  ;,  and,  thus,  the  SAHF 
procedure  is  characterized  by  a  single  open-shell 
operator  as  it  was  for  the  CAHF  procedure.  The 


We  illustrate  the  SAHF  approach  for  the  case  of 
two  electrons  in  two  orbitals,  which  is  represented 
by  the  six  microstates  shown  in  Figure  1.  The 
average  energy  obtained  by  the  CAHF  method 
over  all  the  six  states  is  then  split  within  the  SAHF 
framework  into  two  pieces,  corresponding  to  the 
three  possible  singlets  (Fig.  2)  and  the  triplet  states 
that  form  this  configuration.  Figure  1  shows  two 
possible  ways  to  obtain  the  averaged  energies, 
through  the  averaging  over  all  the  individual  mi¬ 
crostates  (left  block)  or  through  the  pure  spin  states 
(on  the  right).  The  results  are  the  same  as  shown  in 
the  figure. 


- 


4-1- 


^2=  '  '1^4-  ■  = 

l<j)2a  (1>2P|  l(j)ia  l<t)ip  4)2^1  I4>ia 


^6“ 
i(l)lp  <j>2a| 


El=2hl  +J1,1 
E2  =  2h2  +  J2,2 
E3  =  hl  +h2+Jl.2'Kl,2 
E4  =  hi  +h2  +  Jl,2-Ki,2 
E5  =  hl  +h2  +  Jl,2 
E6  =  hl  +h2  +  Ji,2 


or  =  ^3  ,  «  HI4  and 

3e=  hi  +h2+Jl.2-Ki,2 
1xirO=.(^5-^^6)/V2 
1E=  hi  +h2+Jl,2  +  Ki,2 

1xpo  =  (Wl  +  ^2)/V2 
1E  =  hi  -Fh2  +  (J1,1  +J2,2)/2-Ki.2 
1E  =  hl  +h2  +  (Jl.1  +J2,2)/2-}-  Ki,2 


Eave  =  Ecahf  =  hi  +  h2  +  (J 1 , 1 + J2,2  +  4J1 . 2  -2  Ki ,  2)/6 


EQCMf  =  hi  h2  +  (J1 .1+ J2,2  +  4J1,2  -2  Ki  ,2)/4 


Ehhf=hi+h2  +  (Jl,2-Ki,2/2) 


FIGURE  1.  Configuration  averaging  illustrated  for  the 
simplest  case  of  two  electrons  in  two  spatial  orbitals,  h 
stands  for  the  one-electron  terms  in  the  energy 
expressions;  J  and  K  are  the  Coulomb  and  the  exchange 
integrals,  respectively.  At  the  bottom  of  the  table,  the 
CAHF  energy  is  compound  with  other  averaging 
procedures,  such  as  “grand  canonical  Hartree-Fock” 
(gchf)  and  “hyper-Hartree-Fock”  (hhf)  ([15]  and  [16], 
respectively). 
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The  relative  energies  obtained  by  various  meth¬ 
ods  for  the  similar  case  of  a  TM  ion  in  an 
octahedral  ligand  field  (two  electrons  in  the  upper 
doubly  degenerate  orbitals,  d^i  and  d^2_^2)  are 
depicted  in  Figure  3.  On  the  right  side  of  Figure  3, 
the  Cl  energies  of  the  ground-state  triplet  and  the 
first  two  excited-state  singlets  that  originate  from 
the  three  dominant  configurations  are  shown.  We 
see  that  the  open-shell  triplet  and  singlet  SCF 
calculations,  ROFIF-3  and  ROHF-1,  respectively, 
represent  a  good  reference  for  the  corresponding 
lower-lying  spin  states.  SAHF-1  gives  closer  en¬ 
ergy  for  the  higher-lying  singlet  than  the  other 
methods.  In  all  the  cases,  RHF  is  a  poor  reference 
for  either  singlet  or  triplet  states.  Upon  projection, 
the  CAFIF  produces  higher  energies  for  the 
ground-state  triplet  and  singlets,  but  is  a  good 
reference  for  higher-lying  triplets,  as,  e.g.,  those 
originating  from  mixed  e^)  configurations,  not 
included  in  any  of  the  SCF  schemes  shown  here. 
Note  that  the  ROHF-3  and  SAHF-3  correspond  to 
the  same  triplet  state  which  is  unique,  (see  also 
Fig.  1). 

We  conclude  from  these  simple  considerations 
that  configuration-averaging  models  provide  a 
good  reference  for  the  spectroscopic  Cl  calcula¬ 
tions.  The  advantages  offered  by  the  SAHF  ap- 


Energy  (eV) 

I  —  RHF 


5 


3 


2h 


1 


-H-  —  S2 

—  SAHF-1 

-ROHF.1  ^  S1 

—  CAHF 


0 


-ROHF-3  4.  ^  TR 


FIGURE  3.  Relative  energies  of  the  states  produced  by 
various  SCF  methods  (left)  compared  with  the  Cl 
energies  for  the  three  lowest  states  originating  from 
configuration  (right). 


li 

il-\-L  X/_ 

{1Ei  +  lEs+e  +  ''E2}A3 

SAHF 


—  /-i-  i-L  JL  /_i_  -i-\  _i_ 

ii  \-L*-L/  _  _l\-L’-L/  _l 

{IEi  +  '•E5+6  +  lEa  +  3E3  +  ^Es-e  +  3E4}/6 


CAHF 

FIGURE  2.  Comparison  between  the  SAHF  and  CAHF  in  the  case  of  two  electrons  in  two  orbitals.  The  numbering 
corresponds  to  the  pictures  on  top  of  Figure  1. 
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proach  can  be  demonstrated  with  relative  ease  in 
the  cases  where  there  is  a  distinct  separation  be¬ 
tween  the  ground  and  the  excited  spin  states.  We 
illustrate  this  idea  with  the  examination  of  the 
electronic  spectrum  of  a  ion  in  a  cubic  crystal 
field  (Fig.  4),  which  has  an  unique  sextet  ground- 
state  originating  from  the  high-spin  con¬ 
figuration  in  a  tetrahedral  field. 

The  electronic  spectrum  of  ZnSiMn  has  been 
thoroughly  investigated  in  the  last  several  decades 
[17-24],  Five  distinct  bands  have  been  observed  in 
the  visible  region  corresponding  to  d-d  transitions 
from  the  ground  state  to  the  manifold  of 
quartet  states  originating  from  the  configurations 
with  y  =  1-4.  The  configuration  con¬ 
tributes  to  to  the  doublet  excited  states  and  re¬ 
quires  excitation  of  two  electrons  from  the  ground 


tl  configuration.  This  requires  a  double  spin-flip 
during  the  electronic  transition  and  imposes  a  strict 
spin  prohibition  on  such  bands.  Doublet  states  are, 
therefore,  considered  unlikely  to  contribute  mea¬ 
surably  to  the  d-d  absorption  spectrum  of  ZnSrMn. 

In  a  previous  work  [12],  we  calculated  the  d-d 
bands  of  the  ZnS:Mn  spectrum  using  the  ROHF 
ground  state  as  a  reference  for  the  subsequent  Cl 
that  forms  the  quartet  states  using  the  spectro¬ 
scopic  intermediate  neglect  of  differential  overlap 
(INDO/S)  model  [13].  We  applied  the  same  ap¬ 
proach  here  using  the  SAHF  states  as  a  reference. 
The  results  are  given  in  Table  1.  We  see  that  the 
level  of  accuracy  is  considerably  improved  by  the 
SAHF  method  and  the  average  error  decreases  to 
the  level  of  the  vibrational  broadening  of  the  spec¬ 
troscopic  bands:  0.14  eV.  Following  the  SAHF  cal¬ 
culations,  the  Ml  level  is  again  calculated  above 
the  state  by  160  cm”\  as  compared  to  the  100 
cm“^  that  we  calculated  earlier  using  the  ROHF 
approximation  [12].  The  relative  positioning  of 
these  two  states  is  controversial  [18,  20]:  The  first 
doublet  state  that  we  calculate  using  Cl  single  and 
double  excitations  is  at  3,72  eV,  beyond  the  ob¬ 
servable  spectroscopic  range  for  ZnS  with  a  band 
gap  absorption  starting  at  nearly  3.6  eV.  Therefore, 
the  possible  contribution  of  doublet  states  to  the 
ZnS:Mn  electronic  spectrum  should  be  ruled  out. 


FIGURE  4.  Term  level  diagram  for  a  ion  in  a  cubic 
ligand  field  and  the  sextet-quartet  transitions  related  to 
the  ZnS:Mn  absorption  spectrum. 


TABLE  I _ 

Calculated  and  experimental  band  energies  (eV) 
for  ZnS:Mn;  ab  initio  and  experimental  data  are 
taken  from  [22]. 


Transition 

Ab  initio^ 

ROHF'= 

This 

work 

Experiment 

2.68 

2.72 

2.61 

2.34 

^  a% 

2.83 

2.76 

2.68 

2.49 

^  a^'E 

2.96 

2.78 

2.69 

2.67 

2.96 

2.78 

2.71 

2.67 

^  b% 

3.31 

3.18 

3.07 

2.89 

^  b% 

3.39 

3.47 

3.15 

3.19 

^  b^E 

3.52 

3.56 

3.46 

Average 

absolute 

error 

0.31 

0.24 

0.12 

®MR-CIS  calculations  employing  Watcher’s  (14,9,5)  basis 
with  extra  diffuse  functions  for  Mn  (14,11,6)  [8,  7,  4]  and 
Roos-Siegbahn’s  (10, 6)  [7,  4]  basis  for  S;  from  [25]. 

^  ROHF  stands  for  transition  energies  calculated  using  ROHF 
sextet  state  as  a  reference  for  the  subsequent  configuration 
interaction  (see  [2]). 
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The  absorption  spectrum  is  essentially  reproduced 
from  sextebto-quartet  d-d  excitations  and  the 
agreement  with  the  experiment  (see  [24])  is  im¬ 
proved  significantly  by  the  SAHF  procedure. 

The  SAHF  procedure  is  the  same  as  the  ROHF 
approximation  for  the  case  of  maximum  multiplic¬ 
ity,  i.e.,  S  —  =  m.  It  appears  that  it  is  a  reason¬ 

able  starting  position  for  any  Cl  of  the  multiplicity 
over  which  the  average  is  taken,  but  need  not  to  be 
the  best  average  for  the  cases  of  lower  multiplicity. 
Returning  for  the  moment  to  the  case  of  two  elec¬ 
trons  in  two  orbhals,  the  singlet  average  contains 
1/2  and  The  \(f>j^j\ 

state  may  be  of  such  high  energy  as  to  render  the 
average  orbitals  obtained  not  as  good  for  the  low¬ 
est-lying  singlets  as  that  for  the  CAHF  case  of 
Figure  1,  since  the  energy  there  is  lowered  by 
including  the  three  triplet  states  in  this  average 
(see,  e.g..  Fig.  2).  Indeed,  in  such  cases,  it  can  be 
difficult  to  converge  the  SCF. 

It  is  interesting  also  to  develop  an  operator  that 
presents  the  average  of  a  given  multiplicity  for  an 
open-shell  structure  of  N  electrons  in  m  =  iV  or¬ 
bitals.  This  would  be  the  case,  e.g.,  of  the  singlets 
or  the  triplets  possible  from  through  of 
Figure  1.  There  are 

N(S)  =  1  with  S  =  =  N/2 

^  ((2S  +  N)/2)  “  ((2S  +  N  +  2)/2) 
with  S  <  S„,„^  (15) 

such  states  (one  singlet,  and  one  triplet  for 
through  i/(g  of  Figure  1).  Thus,  the  coupling  coeffi¬ 
cients  are  obtained  in  the  form 

lUii  =  bii,  Uij  =  1  all  f,  j 
bii  =  2  all  i 
hj  =  2[^7^(i?^  -  1)  +  Tj„(rj„  -  1) 

-2r)^rii^/N(S)j/m(m  -  1)  all 

(16) 

This  development,  unfortunately,  generates  a  dif¬ 
ferent  operator  for  each  orbital;  for  example,  in  the 
common  case  of  the  ROHF  singlet  operator  for  two 
electrons  in  two  orbitals,  this  is  two  open-shell 
operators.  Such  an  approach  quickly  becomes  un¬ 
wieldy.  For  the  case  of  antiferromagnetic  spin  cou¬ 
pling,  however,  where  electrons  of  a  given  spin  are 
'"frozen"  in  the  corresponding  orbitals  of  a  given 
center,  say  C  and  D,  this  approach  again  becomes 


simple,  leading  to 

=  1  all  z,  j 

b,j  =  2  zgC(T),  /eD(T);  or 
/eC(i),  ;GD(i) 

=  0  otherwise.  (17) 


This  leads  to  two  open-shell  operators,  one  for  all 
up-spin  domains,  C(  t )  and  D(  t ),  and  one  for  the 
down-spin  spin  domains,  C(l)  and  D( i ): 


For  this  to  be  useful,  the  orbitals  must  be  localized, 
as  they  often  are  for  d  orbitals  in  open-shell  situa¬ 
tions,  and  electrons  of  up-  and  down-spin  must  be 
assigned  to  different  domains  as  well  as  specifying 
the  overall  value,  i.e.,  A(  T )/  B(  i  ),  C(  t ) . .  •  .  This 
operator  has  been  used  several  times  by  us  with 
good  success  [26]. 
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ABSTRACT:  The  red  shift  of  the  first  absorption  band  of  benzene  in  the  condensed 
phase  is  calculated.  The  solid-phase  calculations  are  performed  using  the  experimental 
crystal  structure.  For  the  liquid  two  different  approaches  are  considered.  One  is  the 
herringbone  structure  suggested  by  X-ray  experiments,  and  the  other  uses  supermolecules 
generated  by  Monte  Carlo  simulation.  Attention  is  given  to  the  contribution  of  the  usual 
parallel  and  T-shape  conformations  of  the  benzene  dimers  obtained  in  the  simulation. 
Comparison  with  experiment  shows  a  relatively  poor  result  for  the  herringbone  structure 
and  good  agreement  for  the  supermolecular  structures  generated  by  the  simulation. 

©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  885-891,  1997 


Introduction 

In  this  study  we  calculate  and  analyze  the  red 
shift  of  the  first  absorption  band  of  benzene  in 
the  condensed  phase  as  compared  to  the  gas  phase. 

The  interest  in  the  structure  [1]  and  spectra  of 
clusters  and  condensed-phase  benzene  is  related  to 
several  topics  of  interest  in  chemistry,  physics,  and 
biology.  It  naturally  relates  to  the  importance  of 
the  interaction  between  aromatic  7r-systems.  This 
present  study  also  relates  to  our  interest  in  solvent 
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effects  in  molecular  absorption  spectroscopy  [2,  3]. 
For  the  calculation  of  the  solvatochromic  shift  of 
liquid  benzene,  we  use  two  structures.  One  is  the 
well-known  herringbone  structure  [4]  and  the  other 
is  a  disordered  structure  obtained  from  Monte 
Carlo  (MC)  simulation.  For  completeness,  we  also 
consider  the  first  absorption  transition  of  the  solid 
benzene.  All  quantum  mechanical  calculations  are 
made  using  the  INDO/CIS  model  [5]  as  imple¬ 
mented  in  the  ZINDO  program  [6],  and  we  use  the 
so-called  supermolecular  (SM)  approach.  In  the 
case  of  the  solid  and  the  herringbone  structure  of 
the  liquid,  two  configurations  are  used  (Fig.  1).  In 
the  first  we  use  14  benzene  molecules  both  for  the 
solid  and  the  liquid,  as  seen  iq  Figure  1(a)  (top).  In 
the  second,  we  use  only  13  benzene  molecules. 

CCC  0020-7608  /  97  /  050885-07 
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Monte  Carlo  Simulation 


The  MC  simulation  is  performed  using  standard 
procedures  for  the  Metropolis  sampling  technique 
[8]  in  the  canonical  ensemble,  where  the  number  of 
molecules  N,  the  volume  V,  and  the  temperature 
T  are  fixed.  As  usual,  periodic  boundary  condi¬ 
tions  in  a  cubic  box  [9]  are  used.  In  our  simulation, 
we  use  125  molecules  of  benzene.  The  volume  of 
the  cubic  box  is  determined  by  the  experimental 
density  of  the  liquid,  p  =  0.8990  g/cm^  [10],  and 
the  temperature  is  T  =  25°C.  There  has  been  much 
discussion  about  the  quadrupolar  interaction  of 
benzene  and  its  relation  to  the  structure  [11]  (see 
next  section).  In  our  theoretical  experiments,  we 
are  willing  to  generate  a  disordered  structure  for 
the  liquid.  Therefore,  the  benzene  molecules  inter¬ 
act  only  by  the  Lennard-Jones  potential: 


on  on 


E  E4s,7 

i  I 


(1) 


FIGURE  1.  Crystalline  and  the  herringbone  structures 
of  solid  and  liquid  benzene.  The  top  figures  show  14 
benzene  molecules.  The  bottom  figures  show  instead  13 
molecules  and  includes  one  bulk  benzene  in  the  center. 
Molecules  1 , 2,  3,  and  4  generate  the  unit  cel!  as  defined 
by  Marten  [4]. 


This  corresponds  to  including  one  benzene  and  its 
12  nearest  neighbors  of  the  well-known  Fbca  space 
group  [4].  This  second  case,  as  seen  in  Figure  1(b) 
(bottom),  as  opposed  to  the  first  case,  includes  a 
central  benzene  molecule  to  represent  the  "'bulk." 
Calculations  are  then  performed  at  these  four 
structures  but  in  each  case  using  a  fixed  geometry. 

For  the  case  of  the  disordered  liquid  benzene 
the  structures  generated  by  the  MC  simulation  [7] 
are  used.  Each  SM  is  composed  of  14  benzene 
molecules.  As  we  shall  see,  this  corresponds  to 
including  the  first  coordination  shell  as  obtained 
from  the  radial  distribution  function  (RDF).  In 
addition,  however,  several  SM  calculations  are 
performed.  The  actual  number  of  SM  structures 
used  takes  into  account  the  statistical  correlation 
between  MC  structures.  The  final  excitation  energy 
is  obtained  averaging  over  the  transition  energies 
obtained  for  each  structure.  A  comparison  of  these 
different  procedures  will  be  made. 


where  is  the  sum  over  the  atoms  of  molecule  a, 
E^  is  the  sum  over  the  atoms  of  molecule  b,  is 

the  distance  between  atoms  i  and  ;, 

/  cr-oy,  and  cr-  and  are  the  parameters  of 
atom  i.  The  potential  parameters  used  were  devel¬ 
oped  by  Jorgensen  and  co-workers  [12]:  aQ  =  3.75 

o 

A,  Sq  =  0.11  kcal/mol,  and  =  %  =  0.  As  the 
environment  of  the  liquid  system  has  no  signifi¬ 
cant  effect  on  the  conformational  equilibrium  for 
the  benzene  molecule  [4,  14],  we  consider,  in  the 
simulation,  rigid  conformations  of  the  benzene 
molecules  in^the  structure  with  r^c  =  1-40  A, 
rcH  =  1.088  A,  and  all  angles  6  =  120°.  We  recog¬ 
nize  that  distortions  of  the  benzene  structure  would 
lead  to  considerably  enhanced  intensity  of  the  for¬ 
bidden  transition  we  examine.  We  do  not 
believe,  however,  that  the  position  of  the  band 
maximum  will  be  much  effected  [15]. 

In  the  calculation  of  the  pairwise  energy,  each 
molecule  interacts  with  all  other  molecules  within 
a  center-of-mass  separation  that  is  smaller  than  the 
cutoff  radius  =  13.11  A.  For  separations  larger 
than  r^,  we  use  the  long-range  correction  of  the 
potential  energy  [13].  Over  the  125  benzene 
molecules  included  in  the  simulation,  each  mole¬ 
cule  interacts  effectively  with  ^  60  neighbors. 

The  initial  configuration  is  generated  randomly, 
considering  the  position  and  the  orientation  of 
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each  molecule.  A  new  configuration,  or  one  MC 
step,  is  generated  by  selecting  all  molecules  se¬ 
quentially  and  trying  to  translate  it  in  all  the 
Cartesian  directions  and  also  rotate  it  around  a 
randomly  chosen  axis.  The  maximum  allowed  dis¬ 
placement  of  the  molecules  is  auto-adjusted  after 
50  MC  steps  to  give  an  acceptance  rate  of  new 
configurations  around  50%.  The  maximum  rota¬ 
tion  angle  is  fixed  during  the  simulation  in.  80  ^ 
±15°.  The  full  simulation  [7]  involves  a  thermal- 
ization  stage  of  about  5000  MC  steps  followed  by 
an  averaging  stage  of  48,000  MC  steps.  During  the 
averaging  stage  some  thermodynamical  proper¬ 
ties,  as  the  internal  energy  and  the  heat  capacity  at 
constant  volume,  are  calculated,  and  they  are 
in  complete  agreement  with  that  calculated  by 
Jorgensen  and  co-workers  [12].  The  radial  distribu¬ 
tion  function  is  also  calculated  during  the  averag¬ 
ing  stage  in  the  simulation. 


Results  and  Discussions 

The  structure  of  the  liquid  benzene  has  been  of 
interest  for  many  years.  Ever  since  the  X-ray  study 
of  Narten  [4],  the  herringbone  structure  is  believed 
to  prevail.  This  structure  is  illustrated  in  Figure  1 
along  with  the  known  experimental  structure  of 
solid  benzene.  In  the  case  of  the  benzene  dimer  the 
interest  is  basically  related  to  the  relative  stabiliza¬ 
tion  of  the  parallel  and  T-shape  structure  [16].  A 
herringbone  structure  for  the  dimer  has  also  been 
proposed  [17].  More  recent  studies  [18]  have  indi¬ 
cated  that  a  parallel-displaced  configuration  seems 
to  be  more  stable  than  the  T-shape  favorite  struc¬ 
ture  for  the  dimer.  It  seems  clear  from  all  these 
studies  that  the  parallel-displaced,  T-shape,  and 
T-shape-displaced  structures  are  very  close  in 
energy. 

From  these  considerations  it  seems  appropriate 
to  consider  SM  calculations  of  the  liquid  benzene 
with  all  these  possibilities,  not  only  the  herring¬ 
bone  structure.  Thus  several  structures  generated 
by  the  MC  simulation  will  be  used.  As  these  struc¬ 
tures  are  statistically  correlated,  we  will  select 
structures  that  are  separated  by  a  certain  number 
of  MC  steps.  This  number  results  from  the  time 
correlation  function  and  the  statistical  inefficiency 
analysis  [19].  In  fact,  we  select  structures  that  are 
separated  by  twice  the  correlation  time  (2%),  ob¬ 
tained  after  the  integration  of  the  time  correlation 
function.  Clearly,  ''time"  here  should  mean  MC 


step.  The  calculated  time  correlation  function,  C(0, 
and  the  statistical  inefficiency,  s,  for  the  energies 
of  the  liquid  benzene  structures  are  shown  in  Fig¬ 
ure  2.  The  calculated  C(0  function  gives  the  corre¬ 
lation  time  value  of  400  steps.  This  is  related 
to  the  statistical  inefficiency  and  the  relation  s  -  19^ 
holds.  Thus  we  use  only  MC  structures  that  are 
separated  by  800  MC  steps.  As  the  entire  simula¬ 
tion  took  48,000  MC  steps,  only  60  are  statistically 
relevant  and  used  in  our  studies. 

Figure  3  shows  a  comparison  between  the  calcu¬ 
lated  RDF  from  the  MC  simulation  and  the  experi¬ 
mental  RDF  of  Narten  [4],  As  can  be  seen,  the 
overall  agreement  between  these  two  results  is 
very  good.  The  results  ^show  a  first  broad  peak 
ranging  from  3.0  to  7.6  A,  Integration  of  this  peak 
gives  the  coordination  number  of  =  12.5.  This 
theoretical  result  is  of  interest  in  two  different 
aspects.  First,  the  experimental  result  of  Narten 
gives  Ng  =  12,  in  close  agreement  with  our  theo¬ 
retical  value.  This  suggests  that  Ng  is  not  enough 
to  uniquely  characterize  the  liquid  as  having  a 
herringbone  structure.  The  second  aspect  is  of  more 
technical  interest.  ==  12.5  signifies  that,  on  the 
average,  each  benzene  molecule  has  13  (rounding 
off)  neighbors  and  therefore  the  supermolecule 
should  have  14  molecules.  Hence  all  three  systems, 
including  the  solid  and  the  herringbone,  will  be 
calculated  with  the  same  number  of  molecules. 
The  difference  is,  however,  that  in  the  case  of  the 
MC  liquid  a  total  of  60  SM  quantum  mechanical 
calculations  are  performed. 

In  all  cases  the  excitation  energy  to  the  — 

71*)  state  of  benzene  is  calculated  including  all  the 
^ig  ^  ^2  u  transitions  of  each  individual  benzene; 
that  is,  from  28  occupied  molecular  orbitals  into  28 
molecular  orbitals  of  the  virtual  space. 

Table  I  summarizes  the  calculated  numerical 
results  and  compares  them  with  the  experimental 
values  for  the  red  shift  of  benzene  in  condensed 
phase. 

Figure  4  shows  the  calculated  result  for  the 
excitation  energy  of  the  MC  liquid  together  with 
the  final  average  (-306  cm“^)  and  the  experimen¬ 
tal  [20]  (  —  320  cm“^)  values  for  the  shift  compared 
to  the  gas  phase.  It  should  be  noted  that  the  shift 
for  an  individual  structure  may  differ  by  a  few 
hundreds  of  cm“^,  but  the  "ensemble  average" 
final  result  is  in  very  good  agreement  with  the 
experimental  result.  It  is  interesting  to  analyze, 
wherever  possible,  the  contribution  of  the  individ¬ 
ual  benzene  molecules.  This  is,  of  course,  not  al¬ 
ways  possible.  However,  we  note  that  in  many 
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(a) 


[MC  steps] 

FIGURE  2.  Time  correlation  function  (top)  and  the  statistical  inefficiency  (bottom)  of  the  MC  simulation.  See  text. 


2  5  8  11 

r  [Angstrom] 


FIGURE  3.  Comparison  of  the  calculated  and  experimental  [4]  carbon-carbon  radial  distribution  function  of  the  liquid 
benzene. 


888 


VOL.  65,  NO.  5 


BENZENE  IN  CONDENSED  PHASE 


TABLE  I _ 

Summary  of  the  calculated  and  experimental 
results  for  the  energy  shift  (cm 

in  the  absorption  spectra  of  benzene  in 
condensed  phases.^ 


System 

N 

^^cafc. 

Afexp.  [20] 

Gas 

1 

0 

0 

Solid'" 

13 

-260 

-260  ±  30 

14 

-240 

Liquid  (herringbone)'^ 

13 

-100 

-320  ±  30 

14 

-60 

Liquid  (herringbone)'^ 

13 

-100 

-320  ±  30 

14 

-40 

Liquid  (MC)® 

14 

-306 

-320  ±  30 

\s  the  number  of  molecules  included  in  the  quantum 
calculations. 

^  Lattice  parameters  [4]  (7.460, 9.666,  7.034)  in  A. 

^  Density  of  0.874  g  /  cm^  corresponding  to  the  lattice  pa¬ 
rameters  [4]  (6.75,  9.74,  9.03)  in  A. 

^  Density  of  0.899  g  /cm^  (used  in  the  MC  simulation)  corre¬ 
sponding  to  the  lattice  parameters  (6.75, 9.74, 8.78)  in  A. 

®  Average  value  over  60  SM  structures. 


structures  the  transition  is  somewhat  localized  in 
two  benzene  molecules.  Interesting  enough,  these 
two  molecules  are  found  to  be  in  well-known 
conformations,  like  the  T-shape-displaced  and  the 
parallel-displaced  (see  Fig.  5  for  illustration). 
Therefore  we  now  look  at  systems  composed  of 


T -shape-displaced 


Parallel-displaced 


FIGURE  5.  Illustration  of  MC  structures  showing 
the  T-shape,  displaced,  and  parallel-displaced  dimer 
conformations.  These  dimers  give  a  large  contribution  to 
the  total  red  shift.  See  text  and  Table  II. 


300 


0 


-1200  ' — ^ ^ ^ ^ 

0  10  20  30  40  50  60 

Configuration  Number 

FIGURE  4.  Calculated  red  shift  obtained  from  the 
disordered  MC  structures  of  the  liquid  benzene.  Also 
shown  is  the  average  value  (solid  line)  and  the 
experimental  result  (dashed  line). 


only  two  benzene  molecules.  We  find  from  the  MC 
simulation  that  25%  of  the  benzenes  have  the 
nearest  neighbor  within  an  angle  between  0  and 
20°,  corresponding  to  the  parallel  and  parallel- 
displaced  conformations  of  the  dimer.  Similarly, 
~  15%  have  its  nearest  neighbor  within  an  angle 
between  70  and  90°,  corresponding  to  the  T-shape, 
T-shape-displaced,  and  L-shape  conformations.  The 
average  contribution  of  these  "ordered"  dimer 
conformations  is  compared,  in  Table  II,  with  the 
result  obtained  using  all  dimers  generated  in  the 
Monte  Carlo  simulation  irrespective  of  the  angle. 
Dimers  alone  are  not  able  to  give  a  good  result  for 
the  red  shift,  but  it  is  clear  that  those  ordered 
dimer  conformations  make  contributions  that  are 
significant.  We  note,  in  particular,  that  these  paral¬ 
lel  conformations  give  larger  shifts  than  the  per¬ 
pendicular  ones.  This  seems  to  suggest  that  these 
ordered  dimer  conformations  of  benzene  are  of 
importance  in  understanding  the  spectroscopic  red 
shift  of  the  liquid  phase. 
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TABLE  II _ 

Summary  of  the  average  contribution  of  different 
dimer  conformations  to  the  calculated  shift  for 
the  82^(77  -  7T*)  absorption  transition  of 
liquid  benzene.^ 


System 

N 

AEcalc,  (cm 

MC  T-shape 

2 

-98 

MC  T-shape-displaced 
and  L-shape 

2 

-135 

MC  parallel 

2 

-179 

MC  parallel-displaced 

2 

-197 

MC  dimer 

2 

-103 

MC  supermolecule 

14 

-306 

®A/  is  the  number  of  molecules  included  in  the  quantum 
calculations. 


In  the  case  of  the  herringbone  structure  with  14 
molecules,  the  calculated  shift  is  too  small,  only 
-60  cm“h  This  discrepancy,  however,  is  not  found 
in  the  similar  structure  of  the  solid.  In  this  case  the 
calculated  result  is  —240  cm“^  in  good  agreement 
with  the  experimental  value  of  -260  cm“^  [20]. 

A  possible  deficiency  of  the  use  of  the  herring¬ 
bone  structure  shown  in  Figure  1(a)  is  that  there  is 
no  benzene  molecule  in  the  center  of  the  box  to 
represent  the  "bulk."  To  look  at  this  we  have  used 
the  structures  shown  in  Figure  1(b)  for  the  herring¬ 
bone  and  the  solid.  The  calculated  results,  how¬ 
ever,  changed  only  from  the  previous  —  60  to  - 100 
cm“^  showing  only  slight  improvement.  For  the 
solid  it  changed  from  the  previous  -  240  to  -  260 
cm"\  bringing  the  shift  to  a  better  agreement  with 
the  experimental  result. 


Summary  and  Conclusions 

In  this  work  we  have  calculated  the  red  shift  of 
the  first  absorption  band  of  benzene  in  the  con¬ 
densed  phase.  The  herringbone  structure  used  for 
the  liquid  gives  a  red  shift  that  is  only  15-25%  of 
the  experimental  value.  In  turn,  the  use  of  a  disor¬ 
dered  liquid  obtained  from  Monte  Carlo  simula¬ 
tion  gives  a  red  shift  that  is  in  very  good  agree¬ 
ment  with  experiments.  In  this  disordered  liquid 
most  of  the  usual  structures  of  the  benzene  dimer, 
such  as  the  parallel,  parallel-displaced,  T-shape, 
T-shape-displaced,  etc.,  are  included  naturally,  but 
there  is  little  long-range  order.  The  calculated  red 
shift  for  crystalline  benzene  is  also  in  very  good 
agreement  with  the  experimental  shift. 
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ABSTRACT:  The  Kohn-Sham  density  functional  theory  '"exchange'"  potential  v^(r)  = 
8Ef^[  p]/5p(rX  where  Ef^[  p]  is  the  "exchange""  energy  functional,  is  composed  of  a 
component  representative  of  Pauli  correlations  and  one  that  constitutes  part  of  the 
correlation  contribution  to  the  kinetic  energy.  The  Pauli  term  is  the  work  done  W^%t)  in 
the  field  obtained  by  Coulomb's  law  from  the  Fermi  hole  charge  distribution 

constructed  from  the  Kohn-Sham  orbitals.  The  correlation-kinetic  term  is  the  work  done 
in  the  field  ZfKr)  derived  from  the  kinetic-energy-density  tensor  involving  the 
first-order  correction  to  the  Kohn-Sham  single-particle  density  matrix.  The  sum  of  these 
fields  is  conservative,  so  that  the  total  work  done  is  path-independent.  There  is  no 
explicit  correlation-kinetic  contribution  to  the  "exchange"  energy  Ef^[  p].  Its  contribution 
is  manifested  via  the  Kohn-Sham  orbitals  generated  via  the  potential  v^(r).  The  functional 
EfH  p]  is  thus  expressed  in  virial  form  entirely  in  terms  of  the  Pauli  field  In  this 

article,  we  determine  and  study  the  structure  of  the  correlation-kinetic  component  field 
Z^^Kr)  and  work  for  the  nontmiform  electron  density  system  in  atoms  and  at 

metal  surfaces.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  893-906,  1997 


Introduction 

The  Kohn-Sham  [1]  (KS)  density  functional 
theory  [2,  3]  "exchange-correlation"  energy 
functional  p]  of  the  density  p(r)  and  its  func- 

Correspondence  to:  V.  Sahni. 

Contract  grant  sponsor:  Research  Foundation  of  the  City 
University  of  New  York. 


tional  derivative  (potential)  v^^(r)  =  dE^^l  p]/ 
8p(r)  incorporate  correlations  due  to  the  Pauli  ex¬ 
clusion  principle  and  Coulomb  repulsion  as  well 
as  the  correlation  contribution  to  the  kinetic  en¬ 
ergy.  The  functional  and  derivative  also  have  a 
rigorous  physical  interpretation  [4-7]  in  which  they 
are  described  more  insightfully  in  terms  of  these 
correlations.  The  interpretation  involves  a  field 
^(r),  which,  in  turn,  is  a  sum  of  two  fields.  The 
first  of  these  accounts  for  Pauli  and  Coulomb 
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correlations  and  is  determined  by  Coulomb's  law 
from  its  source  which  is  the  quantum  mechanical 
Fermi-Coulomb  hole  charge  distribution.  The  sec¬ 
ond  ZTr)  accounts  for  the  correlation-kinetic  con- 

'■c 

tribution,  and  its  source  is  the  difference  between 
the  kinetic-energy-density  tensors  for  the 
Kohn-Sham  noninteracting  and  Schrodinger  inter¬ 
acting  systems.  The  corresponding  field  is  the 
derivative  of  the  resulting  tensor.  The  functional 
derivative  is  the  sum  of  the  work  W^^(r)  and 
W^(r)  done  in  the  fields  and  Z^(r),  respec¬ 

tively.  Since  the  field  ^(t)  is  conservative,  the  sum 
of  the  work  done  is  path-independent.  The  quan¬ 
tum  mechanical  exchange-correlation  p]  and 
correlation-kinetic  T^[  p]  energy  components  of  the 
functional  Ef^[  p]  can  also  be  expressed  in  virial 
form  in  terms  of  the  respective  fields 
Zf(r)  which  give  rise  to  them.  (As  in  Schrodinger 
theory,  there  is  no  self-interaction  in  the  physical 
description  [4,  5]  of  the  Kohn-Sham  theory  elec¬ 
tron-interaction  energy  functional  and  its  deriva¬ 
tive.) 

In  a  similar  vein,  the  KS  theory  "exchange" 
energy  Ef^[  p],  and  its  functional  derivative  v^ii) 
=  8Ef^[  p]/3p(tX  can  also  be  described  [8]  in 
terms  of  a  field  representative  of  Pauli  correlations 
and  one  representing  part  of  the  correlation- 
kinetic  effect.  Further,  within  the  density  func¬ 
tional  theory  representation  [3,  9]  of  Hartree-Fock 
(HF)  theory,  it  is  also  possible  to  define  [5,  10]  an 
"exchange"  energy  functional  p]  and  po¬ 
tential  p]/5p(r),  whereby  the  HF 

density  and  energy  are  obtained.  This  functional 
and  derivative  can  also  be  described  [4,  5,  10]  in 
terms  of  a  field  representative  of  Pauli  correlations 
as  well  as  one  representing  the  corresponding  cor¬ 
relation-kinetic  effects.  Finally,  in  exchange-only 
[11,  12]  density  functional  theory,  the  correspond¬ 
ing  "exchange"  energy  functional  p]  and 

potential  are  defined  [11]  as  those  of  the 

optimized  potential  method  [13]  (OPM).  Once 
again,  this  energy  and  potential  can  be  described 
[14]  in  terms  of  a  field  representative  of  Pauli 
correlations  and  one  representing  kinetic  effects. 
The  sum  of  these  fields  is  again  conservative.  In 
each  of  the  above  cases,  the  field  which  accounts 
for  Pauli  correlations  is  obtained  by  Coulomb's 
law  from  a  Fermi  hole  charge  distribution.  The 
resulting  potential  is  then  the  work  done  to  move 
an  electron  in  the  field.  This  field  and  potential 
have  been  studied  previously  [6,  15-21]  for  atoms 
and  metal  surfaces  and  their  structure  for  these 
nonuniform  density  systems  understood.  The  defi¬ 


nitions  of  the  corresponding  kinetic  components 
differ,  but  the  fields  and  potentials  are  all  derived 
in  the  same  manner  from  kinetic-energy-density 
tensors.  In  this  article,  we  determine  and  study  the 
structure  of  the  kinetic  component  of  the  KS  "ex¬ 
change"  potential  as  applied  to  atoms  and  metal 
surfaces. 

We  begin  in  the  next  section  by  defining  the 
"exchange"  energy  functionals  and  potentials  for 
the  cases  discussed  above.  Although,  in  principle, 
the  KS,  HF,  and  OPM  orbitals  are  different,  in 
practice,  their  differences  are  minimal  in  that  the 
structure  of  the  corresponding  local  "exchange" 
potentials  is  essentially  equivalent.  In  our  calcula¬ 
tions  on  atoms  (third  section),  we,  therefore,  as¬ 
sume  the  KS  "exchange"  potential  and  orbitals  to 
be  those  of  the  optimized  potential  method.  For 
the  metal  surface  calculations  (fourth  section),  we 
employ  an  accurate  representation  of  the  KS  "ex¬ 
change"  potential  derived  [18,  19]  via  restricted 
differentiation  of  the  functional  Ef^[  p].  This  po¬ 
tential  does,  however,  have  the  exact  [22-24] 
structure  of  the  KS  "exchange"  potential  both  in 
the  asymptotic  vacuum  and  metal  bulk  regions.  A 
summary  of  the  results  and  concluding  remarks 
are  made  in  the  fifth  section. 


Deflnitions  of  “Exchange”  Energy 
Functionals  and  Potentials  in  Terms 
of  Fields 

In  this  section,  we  define  the  "exchange"  energy 
functional  and  potential  in  terms  of  fields  within 
(a)  the  Kohn-Sham  theory  of  the  fully  interacting 
system,  (b)  the  Kohn-Sham  theory  representation 
of  Hartree-Fock  theory,  and  (c)  the  exchange-only 
Kohn-Sham  theory. 

KOHN-SHAM  THEORY 

In  KS  theory,  the  "exchange"  potential  z;^(r), 
which  is  the  derivative  of  the  functional  Ef^[  p],  is 
[8]  the  sum  of  a  Pauli  W/^(r)  and  correlation- 
kinetic  W^^^^(r)  component: 

dEf^[  p] 

=  ",  =  W/"(r)  -  W/'>(r).  (1) 

8p(r) 

Here,  W/^(r)  is  the  work  done  to  move  an  electron 
in  the  field  determined  via  Coulomb's  law 
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from  the  Fermi  hole  charge  p^(t,  r').  Thus, 

W/S(r)  =  -  ■  dV,  (2) 

•'co 


where 


=  f 


pf  (r,r')(r  -  r') 


dr'. 


(3) 


In  turn,  the  Fermi  hole  pf^(r,  rO  at  r'  for  an  elec- 
tron  at  r  is  defined  in  terms  of  the  idempotent 
Dirac  density  matrix  y/r,  r')  constructed  with  the 
KS  orbitals  ^^(r)  as 


pf(r,r')=  -|%(r,r')lV2p(r),  (4) 


where 


0.  The  expression  for  the  exchange  potential  z;^(r) 
of  Eq.  (1)  is  exact  [8],  because  it  incorporates  all 
terms  linear  in  the  coupling  constant.  Higher-order 
perturbation  theory  leads  to  the  correlation  com¬ 
ponent  of  the  KS  energy  and  potential. 

There  is  no  explicit  correlation-kinetic  contribu¬ 
tion  [8]  to  the  KS  ''exchange"  energy  p].  The 
contribution,  however,  is  manifested  indirectly  via 
the  KS  orbitals  so  that  p],  which  is  also 
related  to  its  functional  derivative  i;^(r)  by  the 
virial  theorem  [25],  is  expressed  [4]  entirely  in 
terms  of  the  field  as 


1  .  .p(r)pf(r,rO 
Ef^l  p]  =  -f  j - 1:^ - - dr  dr', 


ir  —  r 


=  -  J  p(r)r  ■  Vu^(r)  dr. 


(10) 

(11) 


ys(r,r')  =  D  </'*(!■)'/', (rO,  (5) 

i 


=  J  p(r)r  ■  dr,  (12) 


such  that  y/r,  r)  =  p(r).  The  work  W(0^(r)  is  that 
done  to  move  an  electron  in  the  field  Z(^Xr): 

*c 


W,f>(r)  =  -  rzWrO-dl',  (6) 

where 


z(r;[7^]) 

p(r) 


(7) 


The  field  is  further  defined  in  terms  of  a 

field  z(r)[y^])  whose  component  z^(r)  is  derived 
from  the  kinetic-energy-density  tensor  t^i^(T;[y^]) 
as 


p=l 


where 


32  \ 


=  (9) 


and  7^  (r,  r')  is  the  first-order  correction  to  the  KS 
density  matrix  7g(r,  rO  as  obtained  via  perturbation 
theory  by  an  expansion  of  the  system  wave  func¬ 
tion  in  terms  of  the  electron-interaction  coupling 
constant.  The  sum  of  the  work  [W/^(r)  ~ 
is  path-independent  since  V  x  -  ZfKh]  = 


with 


f  p(r)r  •  ZfKr)  dr  =  0.  (13) 


KOHN-SHAM  REPRESENTATION  OF 

HARTREE-FOCK  THEORY 

The  KS  "exchange"  energy  functional  p] 

which  leads  to  the  HF  theory  ground-state  energy 
and  density  is  defined  as  [4,  5,  10] 

p]  =  p;y^n  +  T™[  p],  (14) 

where  E^^[  p;y^^]  is  the  HF  theory  exchange 
energy  determined  from  the  idempotent  Dirac 
density  matrix  7™(r,r0  constructed  from  the  HF 
orbitals  [The  definitions  of  these  properties 

are  the  same  as  Eqs.  (4),  (5),  and  (10)  with  il/^ir) 
replaced  by  <^™(r).]  The  correlation-kinetic  en¬ 
ergy  p]  is  defined  as 

(15) 

where  r^^[  p]  is  the  HF  theory  kinetic  energy,  and 
Tg[  p],  the  kinetic  energy  of  noninteracting  fermions 
obtained  from  orbitals  of  the  KS  equation  which 
leads  to  the  HF  theory  density  and  energy. 

The  exchange  energy  E]^^[  p;  7^^]  component  is 
defined  in  terms  of  an  electric  field  ^^^^(r)  derived 
via  Coulomb's  law  from  the  HF  theory  Fermi  hole 
p^^(r,  rO  in  the  same  manner  as  Eq.  (12). 
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The  correlation-kinetic  energy  p]  compo¬ 
nent  is  expressed  in  terms  of  a  field  Z™(r)  as 

T™[  p]  =  -j  drp(r)r  •  Z™(r),  (16) 

where 

Z™(r)  =  -^[z(r;  [y^^])  -  z(r;  [y^^])],  (17) 

the  field  z(r)  is  as  defined  by  Eqs.  (8)  and  (9),  and 
yg^^(r,  rO  is  the  idempotent  density  matrix  con¬ 
structed  from  the  orbitals  of  the  KS  equation  which 
leads  to  the  HF  density  and  energy. 

The  KS  exchange  potential  z;^^(r)  which  gener¬ 
ates  the  HF  theory  density  and  energy  is 

cpKSHFr  ] 

^  ^  ^  (18) 
8p(r) 

where  W/^^(r)  and  W^^^(r)  are,  respectively,  the 
work  done  in  the  fields  and  Zj^^(r).  [See 

Eqs.  (2)  and  (6).]  Once  again,  the  sum  of  the  work 
[W^^^(r)  +  Wj^^(r)]  is  path-independent  since  V  x 

[rf"(r)  +  Zfir)]  =  0. 

KOHN-SHAM  EXCHANGE-ONLY  THEORY 

The  exchange-only  [11,  12]  version  of  KS  theory 
corresponds  to  the  optimized  potential  method 
[13]  (OPM).  The  resulting  exchange  energy 
E^^^[  p;  %^^^]  and  potential  i;^^^(r)  are  deter¬ 
mined  by  a  self-consistent  solution  of  the  OPM 
integral  and  differential  equations.  The  exchange 
energy  is  obtained  through  Eq.  (10)  via  the  Fermi 
hole  p^^^(r,r0  or  density  matrix  rO  con¬ 

structed  from  the  OPM  orbitals  </)P^^(r).  The  po¬ 
tential  z;^^^(r)  may  be  expressed  as  [14] 

yOPM(r)  ^  +  w,/r),  (19) 

where  W^(r)  is  the  work  done  in  the  field  ^x^r) 
determined  via  Coulomb's  law  [see  Eq.  (2)]  from 
the  Fermi  hole  Pj,,(r,  r')  constructed  from  the  or¬ 
bitals  of  the  Work  Formalism  [21,  26]  (WF) 

differential  equation.  [This  equation  is  the  same  as 
the  KS  differential  equation  with  replaced  by 
W^(r).]  With  yg'^^fr,  r'),  the  corresponding  idempo¬ 
tent  density  matrix,  the  work  (r)  is  that  done  in 


the  field  ZTr),  where 

‘■s 

Z,,(r)  -  ^(z(r;  [r""])  -  z(r;  |t,»'1)1  (2® 

and  where  the  field  z(r)  is  as  defined  previously  in 
terms  of  the  appropriate  kinetic-energy-density 
tensor.  [See  Eqs.  (8)  and  (9).]  In  this  exchange-only 
case,  the  virial  theorem  [25]  leads  to  the  energy 
relation 

E?™lp;r°™|  -Er[p;r,”'] 

=  Jp(r)r  •  Z^  (r)  rfr.  (21) 

We  note  that  in  the  derivation  of  Eq.  (19)  it  is 
assumed  that  the  OPM  and  WF  densities  are 
equivalent,  which  is  essentially  the  case  for  both 
finite  and  extended  systems.  However,  the  off-di¬ 
agonal  elements  of  y^^^(r,r')  and  y™(r,r0  are 
not  assumed  to  be  equivalent  so  that  the  field 
Zf(r)  is  finite.  Furthermore,  the  work  W^(r)  and 
W^  (r)  are  separately  path-independent. 

To  summarize,  the  exchange  potential  of  KS 
theory  [Eq.  (1)]  and  that  of  its  representation  of  HF 
theory  [Eq.  (18)]  and  of  the  OPM  [Eq.  (19)]  are  all 
composed  of  two  components.  The  first,  represen¬ 
tative  of  Pauli  correlations,  is  the  work  done  in  a 
field  determined  by  Coulomb's  law  from  the  Fermi 
hole  charge  constructed  with  the  corresponding 
orbitals.  The  second,  representative  of  correlation- 
kinetic  effects,  is  the  work  done  in  a  field  deter¬ 
mined  from  a  corresponding  kinetic-energy-den¬ 
sity  tensor. 


Structure  of  Correlation-Kinetic  Field 
and  Potential  in  Atoms 

To  determine  an  accurate  representation  of  the 
correlation-kinetic  field  Z^^Kr)  and  potential 
W/^Kr)  in  atoms,  we  assume  the  KS  exchange 
potential  v^(r)  to  be  that  of  the  optimized  potential 
method  We  then  determine  the  work  W^(r) 

by  a  self-consistent  solution  of  the  Work  Formal¬ 
ism  differential  equation.  In  other  words,  we  as¬ 
sume  the  "exchange"  problem  to  be  defined  by  the 
case  described  by  Eq.  (19).  We,  however,  refer  to 
the  potentials  and  W^(r)  as  the  KS  ex¬ 

change  potential  v^(r)  and  its  Pauli  component 
W/^(r),  respectively,  since,  as  noted,  the  difference 
between  the  two  sets  of  potentials  is  minimal.  The 
differences  [Vv^(r)  -  and  [z;^(r)  ~  W/^(r)] 
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are  then  the  correlation-kinetic  field  Zf\r)  and 
potential  respectively. 

In  Figures  1  and  2,  we  plot  the  Pauli  component 
field  due  to  the  Fermi  hole  charge  for  Ne 

and  Ar  atoms.  Note  the  shell  structure  as  exhibited 
by  the  field  and  the  fact  that  it  exists  primarily 
within  the  K-shell,  becoming  less  significant  in  the 
outer  shells,  decaying  asymptotically  as  -1/r^.  In 
Figures  3  and  4,  we  plot  the  corresponding  correla¬ 
tion-kinetic  component  field  Zj^Kr)  for  these 
atoms.  In  these  graphs,  we  also  indicated  the  ra¬ 
dial  probability  density,  so  that  it  is  evident  that 
the  field  ZpXr)  exists  principally  in  the  intershell 
region  and  is  oscillatory  there.  Furthermore,  the 
magnitude  of  this  field  is  greatest  in  the  region 
between  the  K  and  L  shells,  becoming  much 
smaller  in  the  outer  intershell  regions.  The  field 
ZPXr)  is  also  short-ranged  in  comparison  to  the 
Pauli  field  vanishing  at  about  the  com¬ 

mencement  of  the  outermost  occupied  shell.  It  is 


also  interesting  to  note  that  there  is  a  small  compo¬ 
nent  of  this  field  present  in  the  interior  of  the 
atom.  Finally,  observe  that  the  field  Z^^Kr)  is  much 
smaller,  about  a  fifth  of  the  magnitude  of  the  Pauli 
field 

In  Figures  5  and  6,  we  plot  the  Pauli  W/^(r) 
and  correlation-kinetic  components  of  the 

KS  "exchange"  potential  v^(r)  for  Ne  and  Ar, 
respectively.  As  known  [6],  the  Pauli  component 
W/^(r)  exhibits  a  shell  structure  and  is  monotonic 
with  a  positive  slope  since  the  field  is 

negative  throughout  space.  It  decays  asymptoti¬ 
cally  as  —  1/r  in  the  classically  forbidden  region 
as  does  v^(rX  Since  the  correlation-kinetic  field 
Zp>(r)  is  oscillatory  and  exists  in  the  intershell 
regions,  the  corresponding  potential  ex¬ 

hibits  a  dip  in  these  regions,  vanishing  well  within 
the  outermost  occupied  shell.  It  is  this  correlation- 
kinetic  component  which  then  gives  rise  to  the 
well-known  bumps  of  the  KS  potential  v^(r)  in  the 
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FIGURE  2.  The  Pauli  component  field  for  the  Ar  atom. 


intershell  regions.  Again,  we  note  that  there  is  a 
small  component  of  this  potential  in  the  interior  of 
atoms.  Finally,  observe  that  is  an  order  of 

magnitude  smaller  than  the  Pauli  potential  W/^(r). 
Thus,  the  correlation-kinetic  effects  contributing 
to  the  KS  potential  v^(r)  are  small.  As  such,  it  is 
justified  to  represent  the  potential  v^(r)  in  finite 
systems  by  its  Pauli  component  The  high 

accuracy  of  this  approximation  with  regard  to  total 
ground-  and  excited-state  energies,  ionization  po¬ 
tentials,  and  electron  affinities  has  been  demon¬ 
strated  in  the  literature  [4,  5,  15,  21,  26]. 


Structure  of  Correlation-Kinetic  Field 
and  Potential  at  Metal  Surfaces 

For  the  jellium  [27]  and  structureless  pseudopo¬ 
tential  [28]  models  of  a  metal  surface,  there  is 
translational  symmetry  in  the  plane  parallel  to  the 


surface  and  the  effective  potential  of  the  electrons 
is  local.  Thus,  the  KS  orbitals  are  of  the  form 

(22) 

where  (k|[,X|j)  are  the  momentum  and  position 
vectors  parallel  to  the  surface,  and  (fc,  x),  the  com¬ 
ponents  perpendicular  to  it.  The  metal  is  assumed 
to  occupy  the  negative  half-space.  For  these  mod¬ 
els,  the  exact  asymptotic  structure  of  the  KS  "ex¬ 
change"  potential  z;^(r)  and  of  its  Pauli  component 
W/^(r)  in  the  classically  forbidden  vacuum  region 
has  been  determined  [20,  22-24].  The  analytical 
expressions  for  these  structures  which  are  valid  for 
fully  self-consistently  determined  orbitals  show  both 
potentials  to  decay  in  an  —x~^  image-potential-like 
manner,  but  with  different  decay  coefficients  which 
depend  upon  the  metal  Fermi  energy  and  surface 
barrier  height.  Thus  [see  Eq.  (1)],  the  asymptotic 


898 


VOL  65,  NO.  5 


CORRELATION-KINETIC  COMPONENT  STRUCTURE 


r  (in  a.u.) 

FIGURE  3.  The  correlation-kinetic  component  field  Z/JMr)  for  the  Ne  atom.  The  radial  probability  density  is  plotted  as 
the  dashed  line. 


structure  of  the  correlation-kinetic  component 
is  also  known  exactly  to  decay  as 

WW(;c)  =  (23) 

^  X 

X^oo 

where  the  decay  coefficient  is 


to  be  exactly 

V^(x)  =  -kp/TT. 

X~>  “00 


For  the  KS  "exchange"  potential  v^(r),  we  em¬ 
ploy  an  accurate  representation  of  it  derived  [18, 
19]  by  restricted  differentiation  of  the  energy  func¬ 
tional  Ef^[  p].  The  resulting  expression  for 

the  potential  which  is  in  terms  of  the  density  p(r) 
and  Slater  potential  [29]  V/(r)  is 


y(0)(r)  = 


2dp(r)/dkp  dk 


—  (p(r)y/(r)),  (25) 


where  V^(r)  is  defined  as 


and  where  (3^  =  W/e^,  with  W  the  barrier  height, 
€p  =  kl/2  the  Fermi  energy,  kp  =  l/ar^  the  Fermi 
momentum,  a~^  =  (97r/4)^/^  and  the  Wigner- 
Seitz  radius.  The  asymptotic  value  of  the  KS  "ex¬ 
change"  potential  in  the  metal  bulk  is  also  known 


r') 

^/(r)=/-r - ^26) 

|r  -  r'l 

Thus,  the  potential  uf  Xr)  incorporates  the  intrinsic 
nonlocality  of  the  problem  through  the  dynamic 
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0.01  O.l  1  10 

r  (in  a.u.) 


FIGURE  4.  The  correlation-kinetic  component  field  Z/JHr)  for  the  Ne  atom.  The  radial  probability  density  is  plotted  as 
the  dashed  line. 


Fermi  hole  charge.  It  also  possesses  [18,  22,  24]  the 
correct  asymptotic  structure  of  v^(r)  in  both  the 
asymptotic  vacuum  and  metal-bulk  regions. 

Now  for  the  orbitals  of  Eq.  (22),  the  expression 
for  the  Slater  potential  in  dimensionless  coordi¬ 
nates  normalized  to  the  Fermi  momentum  is 
[22-24] 

V/(z)  _  4 

(Skp/lv)  TTpniz) 

X  dk'  dk<f)*iz)(f>^.(z)Gik,k’;  z),  (27) 
•'o  *^0 

where 

G(k,k';  z)  =  ^  ^  dqj(q,z) 

•'o 

+  dqjiq,  z){Sj^q)  +  S^iq)}, 

•'A- A' 


Jiq,  z)  =  2  r  dz'e-^l^-^Vfc(z')#(z'), 

—  CO 

(  \2  _ 

S^(q)  =  tan-' - — - 

Aa 

=  S;,(^)Ia-.  A', 


and  where  p^iz)  is  the  density  normalized  to  the 
bulk  value  p  =  /c|/37r^: 

p„(z)  =  3  r  a  -  k^M, (z)l^  dk.  (28) 

•'o 

In  these  equations,  A  =  (1  —  k^Y^^,  A'  =  (1  - 
and  z  =  kpX.  The  expression  for  the  Pauli 
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FIGURE  5.  The  Pauli  and  correlation-kinetic  potentials  for  the  Ne  atom. 


component  field  is  [20] 

4 

Okj/l'ir)  '>rp„(z) 

X  ^ dk'  f'"' dkcl)*(z)<t>k'(z)Ha,k’-,  z),  (29) 
•'o  •'o 

where 

H(k,k')  z)  =  ttA'^  ^  ^  dqqM{q,z) 

•'0 

+  dqqMiq,  z){S^(q)  +  Sy(q)}, 
•'a- A' 

M(q,  z) 

^  if  dz'sgniz  -  z')e~^^^~^'^(l}j^iz')(f)piz'), 

—  CO 


The  work  done  W^iz)  is  then 

W,^^(z)  .z  ^^(z') 

ov^'-IrnlTz^)*'  o® 

For  orbitals  generated  by  model  effective  poten¬ 
tials,  the  spatial  integrals  J(q,  z)  and  M(q,  z)  above 
can  be  determined  analytically.  Thus,  for  such 
orbitals,  the  Slater  potential  V"/(z)  and  the  field 
are  then  entirely  in  terms  of  finite-region 
momentum-space  integrals  and  therefore  easily 
determined  for  ^//-electron  positions  throughout 
space. 

The  expressions  for  the  Slater  potential  V^(z)  of 
Eq.  (27)  and  of  the  work  W/^(z)  of  Eq.  (30)  are 
valid  for  arbitrary  orbitals  <^>^(2).  In  our  calcula¬ 
tions,  we  employ  those  of  the  jellium  and  finite- 
linear-potential  model  [30]  of  a  metal  surface.  The 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


901 


SOLOMATIN  AND  SAHNI 


FIGURE  6.  The  Pauli  W^^Xr)  and  correlation-kinetic  potentials  for  the  Ar  atom. 


advantage  of  employing  these  orbitals  is  that  they 
are  semianalytical  and  lead  to  results  essentially 
equivalent  [17]  to  those  of  fully  self-consistent  cal¬ 
culations.  Furthermore,  as  noted  above,  the  inte¬ 
grals  J(q,  z)  and  M(q,  z)  can  then  be  obtained 
analytically.  The  KS  effective  potential  for  this 
model  is 

v^ffiz)  =  Fz[e(z)  -  e(z  “  z^)]  +  we(z  -  z^), 

(31) 

for  which  the  orbitals  are 

<^^(z)  =  sin[fcz  +  8{k)]0(-z) 

x[eiz)  “  0(z  -  Zj,)] 

+  Dj^exp(-Kj^z)0(z  -  z^),  (32) 


where  k  =  ^^TE ,  =  V2(W-  £),  ^  =  zz^^/^  - 

U.  U  =  F  =  {kl/l)/z„  z,  =  (fc2/2)z,/W, 

E  is  the  energy,  W  is  the  barrier  height,  and 
Ai{  4)  and  Bi{  are  the  Airy  functions.  The  phase 
factor  8{k)  and  the  coefficients  B^,  Q,  and  are 
determined  by  the  requirement  of  continuity  of  the 
wave  function  and  its  logarithmic  derivative  at 
z  =  0  and  z  =  z^. 

The  correlation- kinetic  component  field  Zj^Kz) 
and  potential  W^Kz)  are  determined  in  a  manner 
similar  to  that  for  atoms,  except  that  the  same 
orbitals  (l>i,(z)  are  employed  in  the  calculation  of 
the  potentials  vf\z)  and  W/^(z).  In  Figures  7  and 
8,  we  plot  for  Li  metal  (r^  ^  3.24)  the  Pauli 
and  correlation-kinetic  Z^^Kz)  fields,  respectively, 
employing  the  orbitals  of  Eq.  (32),  The  relationship 
between  and  p  is  determined  via  energy  mini¬ 
mization  in  the  local  density  approximation.  The 
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z  (in  Fermi  wavelengths) 


FIGURE  7.  The  Pauli  component  field  at  the  surface  of  the  Li  metal. 


corresponding  value  of  ^8  =  1.246.  (See  Appendix 
of  [17].)  [The  field  is  also  plotted  elsewhere 

[16,  17,  20]  for  =  2.0  and  6.0.  For  purposes  of 
completeness,  we  include  it  here  for  Li  metal.] 
Observe  that  both  fields  are  concentrated  in  the 
surface  region  and  possess  the  requisite  Bardeen- 
Friedel  [27]  oscillations  in  the  metal  bulk.  Fiow- 


ever,  as  in  the  atomic  case,  the  correlation-kinetic 
field  is  an  order  of  magnitude  weaker  than  is  the 
Pauli  field.  Observe  that  both  fields  decay  many 
Fermi  wavelengths  into  the  vacuum  region.  (For  Li 
metal,  one  Fermi  wavelength  is  5.61  A.)  The  cor¬ 
rectness  of  the  asymptotic  structure  of  the  field 
in  the  vacuum  region  is  demonstrated  in 


z  (in  Fermi  wavelengths) 

FIGURE  8.  The  correlation-kinetic  field  zPHz)  at  the  surface  of  the  Li  metal. 
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z  (in  Fermi  wavelengths) 

FIGURE  9.  Comparison  of  the  asymptotic  structure  of  the  correlation-kinetic  field  Z/JMz)  at  the  surface  of  the  Li  metal 
with  its  exact  analytical  asymptotic  structure  Iz^. 


Figure  9  where  the  exact  analytical  structure 
afKp)/z^  is  also  plotted.  The  calculated  and  ex¬ 
act  structures  can  be  observed  to  have  merged  at  a 
distance  of  about  four  Fermi  wavelengths  from  the 
surface. 


In  Figure  10,  we  plot  the  approximate  KS  ex¬ 
change  potential  vf\z)  and  the  work  done  W^^(z), 
The  potential  vfKz)  which  possesses  the  exact 
[22-24]  asymptotic  structure  of  the  KS  exchange 
potential  v^(z)  in  the  vacuum  region  achieves  this 


z  (in  Fermi  wavelengths) 

FIGURE  10.  The  approximation  v^iz)  to  the  Kohn-Sham  exchange  potential  as  determined  by  Eq.  (25).  The  Pauli 
component  l/14^®(z)  of  Eq.  (30)  is  also  plotted. 


904 


VOL.  65,  NO.  5 


CORRELATION-KINETIC  COMPONENT  STRUCTURE 


limit  by  about  one  Fermi  wavelength  from  the 
surface.  As  is  evident  from  the  figure,  it  also  has 
the  correct  limiting  value  of  -2/3  (in  units  of 
Skp/lTr)  of  v^(z)  in  the  metal  bulk.  The  asymp- 
totic  structure  of  W/^(z)  in  the  vacuum  differs 
from  that  of  the  KS  potential  v^(z)  as  has  been 
shown  [20]  analytically.  The  limiting  value  of 
W^^(z)  in  the  metal  bulk  is  also  different  [20], 
being  -0.757  (in  units  of  3kp/27r)  for  Li  metal. 
The  resulting  correlation-kinetic  potential  compo¬ 
nent  W^^^Kz)  is  plotted  in  Figure  11.  Note  that  this 
potential  is  entirely  positive  and  possesses  the 
correct  asymptotic  structure  of  Eq.  (23)  in  the  vac¬ 
uum.  The  potentials  vfKz),  W^^(z),  and  W/^^(2) 
all  exhibit  the  Bardeen-Friedel  oscillations  within 
the  metal.  Once  again,  the  potential  W^^^\z)  is  an 
order  of  magnitude  smaller  than  W/^(z).  How¬ 
ever,  as  the  asymptotic  structure  of  the  KS  poten¬ 
tial  z^;^.(z)  and  its  Pauli  component  W/^(z)  differ 
in  both  the  vacuum  and  metal  bulk  regions,  the 
significance  of  the  correlation-kinetic  term 
should  be  greater  in  the  metal  surface  case  than  it 
is  for  atoms. 


Conclusions  and  Summary 

The  KS  ^'exchange"  potential  v^(r)  is  composed 
of  a  term  W/^(r)  representative  of  Pauli  correla¬ 
tions  and  another  which  constitutes  part  of 


the  correlation-kinetic  effects.  These  components 
of  the  potential  i;^(r)  arise,  respectively,  from  fields 
and  Z^^^(r)  representative  of  these  correla¬ 
tions.  In  this  article,  we  studied  the  structure  of  the 
correlation-kinetic  field  ZfXr)  and  potential 
as  applied  to  finite  atomic  and  extended 
metal  surface  systems.  For  these  systems,  this  field 
and  potential  are  about  an  order  of  magnitude 
smaller  than  the  corresponding  Pauli  components. 
In  atoms,  the  field  Zf^(r)  exists  principally  in  the 
intershell  regions  and  is  oscillatory  there.  This  then 
gives  rise  to  a  trough  in  the  resulting  potential 
in  these  regions.  Since  the  Pauli  component 
PV/^(r)  is  monotonic  with  a  positive  slope  every¬ 
where,  the  intershell  bumps  of  the  KS  potential 
v^(r)  can  now  be  understood  to  be  entirely  a  conse¬ 
quence  of  correlation-kinetic  effects.  Although 
there  is  a  small  component  of  the  field  Z/Kr)  in  the 
deep  interior  of  atoms,  this  field  is  short-ranged, 
vanishing  within  the  last  occupied  shell.  The 
asymptotic  structure  —  1/r  of  v^(r)  in  the  classi¬ 
cally  forbidden  region  is  therefore  due  entirely  to 
its  Pauli  component  W/^(r).  Since  the  field  Z^^Kr) 
in  atoms  is  small  and  short-ranged,  the  KS  poten¬ 
tial  v^(r)  can  be  approximated  accurately  by  its 
Pauli  component  W/^(r).  This  is  borne  out  in  the 
literature  [4,  15].  For  example,  total  ground-state 
energies  of  atoms  with  this  approximation  lie 
within  25  ppm  of  those  of  the  OPM,  being  within  5 
ppm  for  atoms  heavier  than  Kr. 


z  (in  Fermi  wavelengths) 

FIGURE  11.  The  correlation-kinetic  component  of  the  approximate  Kohn-Sham  exchange  potential  V'fHz). 

The  function  of  ajp(/3)/z  is  also  plotted. 
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At  a  metal  surface,  the  correlation-kinetic  field 
Zf  Xr)  is  concentrated  mainly  in  the  surface  region. 
The  field,  however,  is  long-ranged.  It  decays 
asymptotically  in  the  vacuum  as  [20]  with 

the  coefficient  dependent  upon  the  metal  Fermi 
energy  and  surface  barrier  height.  It  also  extends 
deep  into  the  metal,  exhibiting  the  requisite 
Bardeen-Friedel  oscillations  there.  The  resulting 
correlation-kinetic  potential  W^^^Kr)  is  therefore 
also  long-ranged  in  the  vacuum  with  a  structure  of 
Further,  it  too  exhibits  the  Bardeen- 
Friedel  oscillations  in  the  metal  and  has  a  finite 
positive  limiting  value  in  the  metal  bulk.  The  Pauli 
component  W/^(r)  is  also  long-ranged  [20],  with 
an  —x~^  image-potential-like  structure  in  the  vac¬ 
uum  similar  to  that  of  the  KS  potential  i7^(r),  but 
with  a  decay  coefficient  that  is  different.  In  the 
metal  bulk,  W/^(r)  also  has  a  finite  (negative) 
limiting  value  which  is  close  to  but  less  than  that 
of  which  is  —kp/n.  Thus,  although  the  con¬ 
tribution  of  v^ir)  to  properties  such  as  the  surface 
energy  and  work  function  will  be  due  principally 
to  its  Pauli  component,  we  expect  the 
correlation-kinetic  contribution  in  this  instant  not 
to  be  insignificant.  The  long-range  of  both  the 
Pauli  and  correlation-kinetic  components  of  v^(r) 
in  the  vacuum  is  also  of  significance  from  the 
perspective  of  various  surface  probes  such  as  the 
scanning-tunneling  microscope  as  well  as  for  the 
interaction  of  electrons,  ions,  and  positrons  with 
the  surface. 
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ABSTRACT:  Density  functional  theory  for  the  pair  potential  in  liquid  Na  near  melting 
is  first  compared  with  that  extracted  by  liquid  structure  inversion.  All  the  main  features 
are  in  quantitative  agreement.  Quantum  chemical  representations  of  metallic  crystalline 
energies  are  then  treated  as  a  function  of  local  coordination  number  and  near-neighbor 
distance.  K  and  Cu  are  especially  referred  to,  in  different  lattice  structures.  Finally,  the 
Tersoff  potential  based  on  density  functional  theory  is  discussed  in  relation  to  the 
cleavage  force  in  ideal  crystalline  Si,  and  is  related  to  bond  breaking.  ©  1997  John  Wiley 
&  Sons,  Inc.  Int  J  Quant  Chem  65:  907-917,  1997 


Background 

The  search  for  force  fields  in  condensed  matter 
goes  back  to  at  least  the  so-called  rigid-ion 
model  in  the  1930s.  The  idea  was,  say  for  a  simple 
metal  crystal  like  Na  or  Al,  to  settle  on  a  model  for 
the  charge  density  to  associate  with  a  particular 
nucleus,  the  superposition  of  which  on  each  nu¬ 
cleus  would  lead  to  the  periodic  density  in  the 
crystal.  But  the  screened  ion  was  to  be  chosen 
judiciously  so  that  as  the  ions  vibrated  about  their 
equilibrium  lattice  positions  the  blobs  of  localized 
charge  density  moved  rigidly  with  the  nuclei.  This 
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was  understood  at  that  time  to  imply  pair  forces 
between  nuclei.  This  idea  was  formalized  in  the 
1960s  through  pseudopotential  theory.  The  essen¬ 
tial  equation  embodying  these  approaches  can  then 
be  written  for  the  internal  energy  E  as 

£  =  (1) 

i<j 

Thus,  the  energy  is  divided  into  two  parts:  a  vol¬ 
ume-dependent  part  E^,  with  ft  denoting  specifi¬ 
cally  the  atomic  volume,  and  a  pair  potential  con¬ 
tribution  (f),  which  is  itself  density-dependent  as 
explicitly  noted  in  writing  Eq.  (1).  Clearly,  Eq.  (1) 
is  valuable  for  comparing,  say,  crystalline  phases 
in  which  atomic  rearrangements  are  made  at  a 
constant  volume.  In  the  next  section,  density  func¬ 
tional  theory  (DFT)  will  be  used,  in  the  manner  of 
Perrot  and  March  [1],  to  obtain  the  pair  potential 
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(/>(r)  in  the  liquid  phase  of  metallic  Na,  just  above 
its  freezing  point.  Reference  will  also  be  made,  but 
now  briefly,  to  both  K  and  Be.  However,  emphasis 
will  be  placed  on  the  fact  that,  for  Na,  contact  can 
be  established  with  an  "experimental  potential" 
derived  by  inverting  the  measured  liquid  structure 
factor  S(q),  derived  from  either  X-ray  or  neutron 
diffraction  experiments.  To  date,  this  has  not 
proved  possible  for  K  and  Be,  although,  in  the 
former  case.  X-ray  diffraction  data  are  available  in 
the  work  of  Greenfield  et  al.  [2]. 

Attention  will  then  move  in  the  third  section  to 
a  different  type  of  representation  of  the  ground- 
state  energy  of  crystals,  namely,  through  direct 
appeal  to  the  dependence  on  local  coordination. 
This  approach  goes  back  at  least  to  the  formulation 
of  Poshusta  and  Klein  [3]  and  was  worked  out  in 
the  alkali  metals  by  Malrieu  et  al.  [4].  The  specific 
form  to  be  discussed  in  the  third  section  follows 
directly  the  work  of  March  et  al.  [5].  Instead  of  Eq, 
(1)  for  a  crystal  with  a  coordination  number  and 
near-neighbor  distance  Tq,  one  writes 

r^)  =  -cR(ro)  -/(c)g(ro),  (2) 

where  it  is  of  the  essence  of  the  method  that  jR(ro) 
and  gir^)  are  determined  explicitly  by  free-space 
dimer  potential-energy  curves.  The  detailed  forms 
to  be  chosen  for  the  functions  R,  g,  and  /  are 
discussed  in  the  third  section.  Then,  in  a  similar 
vein,  the  directionally  bonded  Si  crystal  will  be 
discussed,  the  example  taken  being  the  Tersoff 
potential  [6]  which  is  fitted  to  the  density  func¬ 
tional  calculations  of  Yin  and  Cohen  [7].  The  type 
of  description  used  in  the  third  section  is,  in  fact, 
characteristic  of  the  so-called  glue  models  of  inter¬ 
atomic  forces  used  by  solid-state  theorists.  The 
Tersoff  potential  will  be  illustrated  specifically  by 
using  it  to  calculate  the  cleavage  force  in  Si,  fol¬ 
lowing  the  very  recent  work  of  Matthai  and  March 
[8].  We  must  note  here,  however,  that  there  is 
already  a  body  of  work  in  which  high-precision, 
all-electron,  local  spin  density  approximation 
(LSD A)  calculations  on  films  and  slabs  (especially 
metals)  exist,  from  which  cleavage  energies  can  be 
deduced.  At  least  some  of  these  calculations  have 
reported  what  amounts  to  the  first  derivative  of 
the  energy  with  respect  to  the  interplanar  distance 
(see,  e.g.,  Boettger  et  al.  [9],  R5sch  et  al.  [10],  Smith 
et  al.  [11],  Gay  et  al.  [12],  and  others). 

Then,  in  the  fourth  section,  some  attention  is 
given,  reverting  back  to  the  type  of  philosophy 


underlying  the  old  rigid-ion  model,  to  the  precise 
formulation  of  lattice  dynamics  in  relation  to  force 
fields,  via  DFT.  Since  phonon  dispersion  relations 
turn  out  to  be  described  in  terms  of  two  densities, 
obviously,  that  of  the  perfect  periodic  crystal  and 
also  the  change  due  to  "freezing  in"  a  phonon,  one 
cannot  only  formulate  a  correct  "vector  rigid-ion" 
model  in  lattice  dynamics  but  also  one  can,  but 
now  in  the  elastic  region  only,  formulate  the  cleav¬ 
age  force  in  terms  of  phonon  dispersion  relations, 
as  in  the  original  work  of  Zaremba  [13],  which  was 
extended  by  Kohn  and  Yaniv  [14].  With  this  as 
background  to  the  present  review,  let  us  turn 
immediately  to  discuss  the  calculation  of  the  pair 
potential  entering  Eq.  (1)  for  metallic  Na  in  the 
liquid  phase,  just  above  its  freezing  point. 


Density  Functional  Theory  (DFT)  of 
Pair  Potentials  in  Simple  sp  Liquid 
Metals,  Especially  Na  Near  Freezing 

The  treatment  below  follows  closely  that  of  Per- 
rot  and  March  [1].  These  workers  adopted,  essen¬ 
tially,  within  DFT,  the  philosophy  of  the  rigid-ion 
model  referred  to  above.  Their  building  block  was 
therefore  the  screening  charge  around  a  single 
Na"^  ion  plunged  into  a  bath  of  conduction  elec¬ 
trons  precisely  equal  to  that  in  liquid  metal  Na 
near  freezing.  This  screening  charge  was  calcu¬ 
lated  by  DFT,  from  the  Kohn-Sham  equations 
using  a  local  density  approximation  (LDA).  Figure 
1  shows  that  the  total  charge  Q(r)  enclosed  within 
a  sphere  of  radius  r  centered  on  the  Na  nucleus  is 
approaching  the  asymptotic  value  of  unity  at  large 
r.  This  is  a  requirement  imposed  by  the  fact  that 
long-range  electric  fields  cannot  exist  in  a  metallic 
medium.  The  so-called  Friedel  oscillations  in  the 
screening  charge  are  in  evidence;  they  fall  off 
grossly  as  cos(2kj:r)/r^,  with  2fcj  denoting  the 
diameter  of  the  Fermi  sphere,  determined,  in  turn, 
by  the  mean  conduction  electron  density  into  which 
the  metal  ion  Na"^  is  plunged. 

Perrot  and  March  then  assumed  that  the 
ground-state  electron  density  of  the  two-center 
problem  is  the  superposition  of  the  one-center  den¬ 
sities.  They  thereby  calculated  the  change  in  ki¬ 
netic  energy,  AT(R)  say,  as  the  pair  of  screened 
ions  brought  from  infinite  separation  to  the  speci¬ 
fied  internuclear  separation  R.  This  change  in  ki¬ 
netic  energy  is  naturally  calculated  from  the 
Kohn-Sham  wave  functions.  One  also  must  calcu- 
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FIGURE  1.  Integrated  displaced  charge  0(r)  contained 
with  a  sphere  of  radius  r  centered  on  a  Na'^  ion 
embedded  in  an  initially  uniform  electron  gas  of  density 
equal  to  the  conduction  electron  density  in  liquid  metal 
near  freezing.  Note  that  (a)  0(r)  ^  1  for  perfect 
screening  of  the  monovalent  ion  Na'^  in  this  conducting 
medium  and  (b)  the  corresponding  pair  potential  <^(r) 
given  by  Perrot  and  March  [1]  have  a  prominent  turning 
point  at  the  position  marked  by  the  arrow  (see  also 
Fig.  2). 

late  the  potential  energy  change  AL[(R)  in  assem¬ 
bling  the  pair  from  infinite  separation.  This  latter 
quantity  is,  in  fact,  determined  solely  by  the  total 
charge  Q(r)  defined  above. 

To  be  more  specific,  Perrot  and  March  showed 
from  DPT  based  on  the  superposition  density  that 
if  one  denotes  the  kinetic  energy  change  AT(R), 
plus  exchange  and  correlation  energy  change  in 
LDA,  by  the  sum  AG(K),  then  the  basic  equation 
for  the  pair  potential  has  the  structure 

=  AG(K)  +  AL[(jR).  (3) 

As  already  remarked,  the  quantity  AU(R)  is  speci¬ 
fied  uniquely  in  terms  of  Q(R)^  The  reader  requir¬ 
ing  fuller  details  is  referred  to  the  articles  of  Perrot 
and  March  [1].  Their  pair  potential  obtained  using 
LDA  for  the  exchange-correlation  potential  V^^ir) 
in  the  Kohn-Sham  equations  is  shown  for  Na 
liquid  metal  at  freezing  in  the  upper  curve  at  large 
r  in  Figure  2.  We  note  briefly  that  a  similar  pair 
potential  was  derived  for  divalent  liquid  Be  metal 
[1]  and  subsequently  also  for  liquid  metal  K  [15]. 

COMPARISON  OF  DFT  PAIR  POTENTIAL 

FOR  Na  WITH  DIFFRACTION  POTENTIAL 

FROM  LIQUID  STRUCTURE  FACTOR 

INVERSION 

The  purpose  of  this  section  is  to  outline  the  way 
in  which  this  DFT  pair  potential  for  liquid  Na  can 
be  brought  into  direct  contact  with  an  "experimen- 


FIGURE2.  Pair  potentials  0(r)  In  liquid  Na  just  above 
freezing  point,  in  units  of  =  p  with  the  melting 
temperature.  Upper  curve  at  large  r:  DFT  0(r);  lower 
curve  at  large  r:  “diffraction”  potential  obtained  by  Reatto 
[17]  by  inverting  liquid  structure  factor  S(q)  measured 
using  X-rays  by  Greenfield  et  al.  [2]. 


taT'  diffraction  potential.  The  idea  of  inverting  the 
experimentally  measured  liquid  structure  factor 
S(q)  goes  back  to  the  proposal  of  Johnson  and 
March  [16].  These  workers  pointed  out  that  a  pair 
potential  (/)(r)  could  be  extracted,  at  least  in  prin¬ 
ciple,  from  the  force  equations  of  the  BGY  classical 
statistical  mechanical  hierarchy.  If  we  write  for  the 
(classical)  nuclei  in  a  liquid  metal  near  freezing  the 
pair  distribution  function  g(r-^2^,  essentially,  the 
Fourier  transform  of  the  measured  structure  factor 
S(qX  in  the  Boltzmann  form 

g(ri2)  =  exp(-U(r^2^/kQT),  (4) 

where  U(r^2^  is  the  potential  of  mean  force  and  fcg 
is  Boltzmann's  constant,  then  the  force  equation 
reads 


^U(r,2) 


^®(ri,r2,r3) 

- ^ - 

g(r^2)  dx^ 


(5) 


This  equation  is  readily  interpreted.  The  left-hand 
side  is  evidently  the  total  force  on  atom  1.  The 
right-hand  side  expresses  this  as  a  sum  of  two 
parts:  The  first  is  the  direct  pair  interaction  from  a 
second  atom  at  distance  r^2  from  atom  1.  The 
second  term  comes  from  the  force  exerted  by  an 
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atom  at  weighted  with  the  probability  g^/g, 
where  g^^^  is  the  three-atom  correlation  function, 
while  g(r^2^  in  the  denominator  takes  care  of  the 
fact  that  there  is  certainly  a  second  atom  at  dis¬ 
tance  r^2  froni  atom  1.  In  early  work  [16],  had 
to  be  approximated  (e.g.,  as  a  product  of  three 
g(r)'s),  and  this  was  uncontrolled.  However,  Reatto 
et  al.  [17]  brought  the  scheme  to  full  fruition  for 
Na  by  using  computer  simulation  to  bypass  the 
need  to  approximate  the  three-particle  correlation 
function,  and  their  pair  potential  for  liquid  Na 
near  freezing  is  compared  with  the  density  func¬ 
tional  pair  potential  of  Perrot  and  March  [1]  in 
Figure  2.  Although  small  quantitative  differences 
exist,  it  is  remarkable  that  all  the  main  features  of 
the  diffraction  potential  are  reflected  in  the  DFT 

This  is  made  plain  in  Table  I,  where  the 
principal  features  of  the  DFT  potential  are  com¬ 
pared  with  those  of  the  diffraction  potential.  We 
stress  that  there  were  no  adjustable  parameters  in 
the  DFT  calculation,  the  input  merely  being  the 
valence  of  unity  for  Na,  the  experimental  density 
of  the  liquid  at  freezing,  and,  of  course,  the  LDA 
approximation  to  inserted  into  the 

Kohn-Sham  equations. 

It  is  worth  adding  here  that  the  asymptotic  form 
of  (/>(r)  has  subsequently  been  studied  further  in 
the  work  of  Blazej  and  March  [18].  These  workers 
took  the  "sphericaT'  blobs  of  screening  charge 
aroimd  the  pair  of  ions  at  separation  R  and  al¬ 
lowed  the  blobs  to  float  off  the  nuclei,  although 
remaining  undeformed.  They  then  show  that  a 
correction  to  the  usual  asymptotic  form  of  </)(r) 
results,  to  yield  the  result 

Acos(2fcrr  +  a)  B 

Mr)  =  - 3 - j:  r  0.  (6) 

In  Eq.  (6),  A  denotes  the  amplitude  of  the  Friedel 
oscillatory  form  and  a  is  a  phase  shift.  Blazej  and 
March  displayed  the  form  of  the  next  term  as 
proportional  to  r“^.  However,  more  work  has  to 


be  done  on  the  range  of  validity  of  Eq.  (6),  which 
probably  will  come  into  its  own  at  larger  separa¬ 
tion  r  than  is  plotted  in  Figure  2.  The  generaliza¬ 
tion  of  Eq.  (6)  to  forces  between  a  pair  of  atoms 
outside  a  metal  surface  has  been  given  by  Blazej  et 
al.  [19]. 

We  turn  from  this  discussion  of  pair  potentials 
within  the  framework  of  Eq.  (1)  to  the  very  differ¬ 
ent  quantum  chemical  model  expressed  in  Eq.  (2). 
Of  course,  the  present  use  of  Eq.  (2)  is  for  crystal 
lattices  with  different  local  coordination  c,  while  a 
merit  of  Eq.  (1)  in  the  above  context  was  that  it 
could  be  applied  not  only  to  long-range  ordered 
assemblies  but  also  to  condensed  metallic  phases 
with  short-range  order. 


Coordination  Number-dependent 
Force  Fields 

COLD  METAL  CRYSTALS 

March  et  al.  [5]  very  recently  used  a  near¬ 
neighbor  Heisenberg  Hamiltonian  approach,  based 
on  results  of  Malrieu  et  al.  [4]  for  alkali  and  noble 
metals  to  set  up  a  coordination  number-dependent 
pair  potential  </>(r,  c),  with  c  denoting  the  coordi¬ 
nation  number.  The  Heisenberg  Hamiltonian  used 
is  basically  characterized  by  properties  of  the  free- 
space  dimer,  say  Na2  in  the  alkalis  or  Cu2  in  the 
noble  metal.  Both  ground-state  and  triplet-state 
potential-energy  curves  are,  however,  now  in¬ 
volved — the  latter  presenting  some  practical  diffi¬ 
culties  for  DFT,  which  is  still  mainly  useful  as  a 
ground-state  theory  (see,  however,  Theophilou 
[20]). 

Let  us  follow  these  brief  comments  by  summa¬ 
rizing  the  quantitative  results  of  Malrieu  et  al,  [4] 
for  three  (say  alkali)  metal  structures:  fee,  bcc,  and 
simple  cubic  (sc).  One  then  has  for  the  cohesive 
energy  near-neighbor  distance  Tq,  with 


TABLE  I _ 

Principal  features  of  DFT  potential  compared  with  “experimental”  diffraction  potential  for  liquid  Na 
near  freezing. 


Positions  of  turning 

Principal 

First 

Second 

First 

points  and  nodes  (A) 

minimum 

maximum 

minimum 

node 

3.73 

5.67 

7.37 

3.20 

Diffraction'^  (f>{r) 

3.9 

5.76 

7.44 

3.3 

® Perrot  and  March  [1]. 
^Reatto  [17]. 
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the  coefficients  of  R  and  g  in  Eq.  (2)  taken  from 
[4]: 

(i)  fcc(c  =  12)  :  £i2(ro)  =  6R(ro)  -  4g(ro)  (7) 

(ii)  bcc(c  =  8)  :  Egirg)  =  ARir^)  -  4g(ro) 

(iii)  sc(c  =  6)  :  EgCrf,)  =  SRir^)  -  Sgirg) 

Within  the  philosophy  of  this  work  (compare 
also  the  Tersoff  work  [6]  on  Si),  March  et  al.  [5] 
effected  an  approximate  generalization  of  Eqs.  (7) 
to  read 

= -cRiro) -f(c)g(ro).  (8) 

The  form  of  the  first  term  on  the  right-hand  side  of 
Eq.  (8)  may  be  motivated  as  a  simple  additive  sum 
of  the  interactions  between  nearest-neighbor  pairs. 
The  "correction"  term  involving  the  coordination 
number-dependent  function  /(c)  is  less  easy  to 
justify,  although  there  are  formulas  given  in  Mal- 
rieu  et  al.  accurate  through  second-order  of  pertur¬ 
bation  theory.  In  Eq.  (8)  as  applied  to  metallic  Na 
say,  Rir^)  is  the  triplet  potential  curve  of  the 
free-space  dimer,  while  gir^)  represents  the  "ex¬ 
change"  part,  gir^)  is  precisely  one-half  of  the 
difference  between  the  triplet  and  the  singlet 
potential-energy  curves.  March  et  al.  noted 
that,  given  these  potential-energy  curves  (say  from 
DFT),  the  procedure  set  out  fully  by  Carlsson  et  al. 
[21]  can  then  be  used  to  derive  a  pair  potential 
</)(r,  c)  having  the  shape 

4)^(r)  =  ccjy-^ir)  ~  f(c)(f)2(r).  (9) 

Concerning  the  function  f{c),  it  is  relevant  here 
to  note,  following  March  et  al.  [5],  that  Robertson 
et  al.  [22]  took  a  fixed  near-neighbor  distance  Tq 
equal  to  that  in  fee  Al  and  performed  DFT  calcula¬ 
tions  as  a  function  of  the  coordination  number. 
They  can  fit  their  results  for  the  energy  Eir^)  by 
writing 

E(rQ)  =  cA^  +  c^/^A2.  (10) 


While  it  must  not  be  assumed  that  the  c^^^  fit  will 
work  away  from  their  rg,  the  sublinear  depen¬ 
dence  of  f(c)  in  Eq.  (8)  is  already  evident  (but  now 
for  Na  say)  from  Eq.  (7). 

DFT  AND  TERSOFF  POTENTIAL  FOR  Si 

We  turn  from  sp  metals,  with  essentially  nondi- 
rectional  bonding,  to  covalently  bonded  Si.  The 
focus  will  be  the  work  of  Tersoff  [6],  who  pre¬ 
sented  convincing  evidence  that  the  atomic  coordi¬ 
nation  number  c  is  the  main  variable  determining 
the  now  directional  bonding  properties  of  Si  in 
different  structures. 

The  central  idea  that  the  bond  order,  a  measure 
of  bond  strength,  depends  upon  the  local  environ¬ 
ment  can  already  be  found  in  the  study  of  Abell 
[23].  He  argued  that  an  atom  with  numerous 
neighbors  makes  weaker  chemical  bonds  than  an 
atom  will  with  only  a  few  neighbors.  For  more 
recent  work  on  bond-order  potentials,  the  reader 
may  refer  to  Alinaghian  et  al.  [24],  where  extensive 
references  are  given. 

What  we  must  stress  in  the  present  context  is 
that  the  above  ideas  have  been  made  fully  quanti¬ 
tative  for  Si  using  DFT.  Thus,  the  work  of  Yin  and 
Cohen  [7]  is  based  on  self-consistent  pseudopoten¬ 
tial  calculations  using  EDA.  From  their  work  ([7]; 
see  also  Tersoff  [6]),  we  constructed  Table  II  which 
records  DFT  values  for  the  cohesive  energy  of  Si 
with  different  atomic  coordination  numbers.  Both 
cohesive  energy /bond  and  per  atom  are  collected 
in  Table  II.  As  to  the  former  case,  the  first  show 
row  of  this  table  shows  a  clear,  monotonic  de¬ 
crease  as  the  coordination  number  increases,  which 
is  a  quantitative  account  for  Si  of  the  arguments  of 
Abell  [23].  The  cohesive  energy  per  atom,  in  con¬ 
trast,  goes  through  a  maximum  as  the  coordination 
number  c  increases  through  diamond,  leveling  off 
at  higher  values  of  c. 

Below,  we  focus  on  the  use  of  the  Tersoff  poten¬ 
tial  for  Si,  constructed  to  embody  almost  all  the 
DFT  results  in  Table  II,  for  calculating  the  cleavage 


TABLE  II _ _ 

DFT  values  for  cohesive  energy  of  Si  (in  eV).® 


Dimer 

Graphite 

Diamond 

Simple  cubic 

fee 

Cohesive  energy  /  bond 

3.2 

2.6 

2.3 

1.4 

0.7 

Cohesive  energy  /  atom 

1.6 

3.9 

4.7 

4.3 

4.1 

^Constructed  from  Yin  and  Cohen  [7];  see  also  Tersoff  [6]. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


911 


MARCH 


force  as  a  function  of  interplanar  separation  [8]. 
But  two  points  should  be  made  first  in  this  connec¬ 
tion:  First,  there  are,  as  the  author  himself  empha¬ 
sizes,  overly  soft  bond-angle  forces  in  Tersoff s 
parametrization  of  the  DFT  results.  He  himself 
stresses  the  need  for  an  improved  treatment  of 
bond-angle  forces  (see,  again,  the  recent  study  of 
Alinaghian  et  al.  [24]).  Second,  Tersoff  [6]  also 
emphasized  the  merits  of  his  potential  as  applied 
to  &(100)  and  Si(lll)  surfaces:  These  are  a  focal 
point  of  the  study  of  Matthai  and  March  [8]  to  be 
summarized  later. 


Cleavage  Force  Between  Planes  of 
Atoms  in  sp  Metals  and  in  Si 


POSITION  AND  HEIGHT  OF  MAXIMUM  IN 

CLEAVAGE  FORCE  F{z)  FOR  Si:  RELATION 

TO  BOND  BREAKING 

Matthai  and  March  [8]  discussed  the  cleavage 
force  in  a  model  of  Si  with  a  chemically  bonded 
force  field.  In  their  model,  a  cleaved  piece  of  Si  has 
parallel  faces  at  separation  a  -\-  z,  with  a  the  origi¬ 
nal  interplanar  spacing  perpendicular  to  the  cleav¬ 
age  plane. 

These  workers  then  presented  an  estimate  for  a 
block  of  Si  of  the  maximum  value  of  F(z)  and  its 
position,  say,  using  the  coordination-depen- 
dent  interatomic  potential  of  Tersoff  [6],  already 
discussed  above. 

Matthai  and  March  considered  two  cases  (see 
also  Fig.  3): 


JELLIUM  MODEL  FOR  sp  METALS:  ELASTIC 
REGIME 

We  shall  refer  to  the  phonon  theory  below  to 
summarize  the  way  to  calculate  the  cleavage  force 
in  the  elastic  regime  from  appropriate  dispersion 
relations,  following  Zaremba  [13]  and  Kohn  and 
Yaniv  [14].  However,  let  us  first  obtain  for  simple 
sp  metals  the  gross  variation  of  the  cleavage  force 
f(z)  in  the  elastic  regime,  where 

f(z)  =  Az,  (11) 

with  z  the  interplanar  separation.  The  work  of 
March  and  Paranjape  [25]  used  the  simplest  DFT, 
namely,  the  linearized  Thomas-Fermi  (TF)  approx¬ 
imation,  combined  with  a  semi-infinite  jellium  ap¬ 
proach  to  obtain  (see  also  Heinrichs  [26]) 


(i)  a  Si(lll)  surface,  with  one  bond  cut  and 

(ii)  a  Si(lOO)  surface,  with  breaking  of  two 
bonds. 


The  modeling  of  the  surface  was  carried  out  by 
having  12  planes  of  movable  atoms  on  top  of  a 
fixed  substrate.  The  system  was  allowed  to  relax 
by  molecular  dynamics.  The  atoms  on  the  surface 
planes  were  held  fixed,  with  the  remainder  per¬ 
mitted  to  relax.  As  the  energy  £(z)  is  a  function  of 
the  separation  z  of  atoms  in  the  two  surface  planes, 
the  cleavage  force  F(z)  then  is  given  by 


F(z) 


dE(z) 

dz 


(14) 


A  =  (12) 

with  the  mean  interelectron  separation,  related 
to  density  by  =  3/47rr5^.  Their  value  of  the 
constant  t  was  refined  (and  substantially  in¬ 
creased)  according  to  the  subsequent  work  of 
Streitenberger  [27]  who  gave  the  result 

he 

t  -  0.229— TTy.  (13) 

Having  summarized  this  simple  approach  to  treat 
the  elastic  regime  in  sp  metals,  we  note  that  the 
same  model  can  be  used  on  a  jellium  cylinder  to 
predict  size  effects  in  atomic  force  microscopy  (see 
the  Appendix).  Let  us  turn  next  to  the  recent  work 
on  the  cleavage  force  in  ideal  Si  crystal,  treated  by 
Matthai  and  March  [8]. 


The  form  of  £(z)  obtained  by  Matthai  and  March 
is  shown  in  Figure  4. 

The  maximum  in  cases  (i)  and  (ii)  occurs  when 
z  lies  between  0.8  and  0.95  of  the  original  interpla¬ 
nar  separation  a.  To  gain  insight,  Matthai  and 
March  considered  the  H2  molecule  as  it  is  pulled 
apart  from  its  equilibrium-internuclear  separation, 
say.  Using  the  Coulson-Fischer  wave  function 
[28],  designed  to  dllow  a  "transition"  from  a  delo¬ 
calized  molecular  orbital  (LCAO)  behavior  at 
to  a  (unsymmetrized)  Heitler-London  character  at 
large  internuclear  separation  R,  Matthai  and  March 
pointed  out  that  after  extension  of  0.6 R^  from  the 
equilibrium  separation  the  LCAO  MO  is  no  longer 
the  lowest-energy  state,  the  electrons  becoming 
localized.  We  identify  this  "transition"  with  bond 
breaking.  From  Figure  4,  we  see  that  the  situation 
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(a) 


(b) 

FIGURE  3.  Si(111)  surface  with  one  bond  cut;  (b) 
SidOO)  surface  with  breaking  of  two  bonds.  (After  Matthai 
and  March  [8].) 


is  only  quantitatively  different  tor  Si,  a  somewhat 
larger  extension  now  being  required  for  bond 
breaking.  Of  course,  this  treatment  is  restricted  to 
ideal  crystalline  Si:  In  real  materials,  defects  will 
lead  to  important  changes  from  such  ideal  behav¬ 
ior. 

We  turn  to  the  area  of  lattice  dynamics,  where 
some  precise  statements  relating  to  interatomic 
forces  can  be  made. 


FIGURE  4.  Energy  E(z)  as  a  function  of  separation  z 
of  atoms  in  two  surface  planes  of  SI.  Cleavage  force 
F(z)  is  then  to  be  obtained  from  Eq.  (14).  (After  Matthai 
and  March  [8].) 


Phonon  Theory  in  Conducting  Crystals, 
Including  Cleavage  Force  in  Elastic 
Region 

We  start  with  simple  metals  and  adopt  the 
treatment  of  Jones  and  March  [29].  For  pair  poten¬ 
tials,  one  has  an  equivalent  ''rigid-ion''  model  (see 
first  section)  in  which  the  ground-state  density 
p(r)  in  the  presence  of  a  phonon  frozen  into  the 
lattice  is  given  by 

p(r)  =  l]cr(|r  -  I  -  uj),  (15) 

where  I  are  the  direct  lattice  vectors  and  denote 
the  small  displacements  from  the  lattice  positions 
due  to  the  frozen  phonon.  The  spherical  localized 
"blobs"  cr(r),  centered  on  each  nucleus  in  the 
perturbed  lattice,  are  assumed  to  move  "rigidly," 
i.e.,  without  deformation,  as  the  ions  suffer  small 
displacements  from  their  equilibrium  lattice  posi¬ 
tions  Z„. 

Jones  and  March  [29]  showed,  from  the  concepts 
of  DFT,  that  while  the  above  is,  of  course,  approxi¬ 
mate,  although  a  good  approximation  for  bcc  Na 
metal  from  the  earlier  discussion  of  liquid  Na  near 
freezing,  a  vector  rigid-ion  picture  is  formally  ex¬ 
act  for  phonon  theory.  Thus,  one  writes  for  the 
first-order  density  change  p/r)  when  the  ions  are 
moved  by  small  displacements  U;  from  the  sites  /, 
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following  Jones  and  March, 


Pi(r)  =  Eu, 


^p(r) 

dVLi 


(16) 


/ 

This  change  is  given  by  linear  response  theory  as 


Pi(r,  E)  =  /  AV^«(r')F(rr'£)  dr',  (17) 


where  (see,  e.g.,  Stoddart  et  al.  [30])  F  can  be 
calculated  from  the  one-body  periodic  potential 
V(r)  of  DFT  from 


dF 


=  IRe 


Go(rr]E+) 


d'po(rjrE) 

dE 


(18) 


where  Gg  is  the  perfect  lattice  Green  function 
while  pQ(rr'E)  is  a  Dirac  (idempotent)  density  ma¬ 
trix  whose  diagonal  element  po(r)  is  the  exact 
crystal  density. 

One  can  (Jones  and  March  [29])  express  the 
change  in  the  one-body  potential  AV^'Xri)  in  Eq. 
(17)  due  to  "freezing  in  a  phonon"  as 


Ay<»  =  +  /  LJ(rr')pi(r')  dr'.  (19) 


Writing  Eq.  (16)  next  as 

Pi(r)  =  Hu;  •  Ri(r),  (20) 

i 

Jones  and  March  demonstrated  that  R/(r)  =  R(r  - 
/)  and  they  obtained  an  integral  equation  for  R(r) 
from  DFT.  This  was  solved  approximately  for  Al 
and  for  Si  (Claesson  et  al.  [31]).  The  relation  to  the 
"rigid-ion"  model,  or  pair  potential  theory,  was 
fully  discussed  by  Jones  and  March  [29]  and  also 
the  way  phonon  dispersion  curves  can  be  calcu¬ 
lated. 

FERMI  SURFACE  IIN  METALS  AND 

LONG-RANGE  FORCES 

Flores  et  al.  [32]  used  the  above  formulation  to 
treat  the  effect  of  Fermi  surface  topology  on  long- 
range  oscillatory  interactions  in  metals,  generaliz¬ 
ing  the  results  for  simple  sp  metals  discussed 
earlier  in  this  article.  When,  for  instance,  there  are 
necks  on  the  Fermi  surface,  obvious  anisotropic 
effects  arise,  and  in  certain  directions,  an  interac¬ 
tion  energy-oscillatory  function/r,  with  r  the 
range  of  the  interaction,  can  occur,  contrasted  with 
coslkpT/r^  in  simple  metals. 


The  writer  (March  [33])  has  emphasized  a  re¬ 
lated  point,  namely,  that  the  one-body  potential  of 
DFT,  plus  li(rr')  discussed  above,  are  suffi¬ 

cient  to  map  out  the  Kohn  [34]  anomaly  in  the 
lattice  vibrations.  This  will  then  yield  the  correct 
Fermi  surface.  Direct  use  of  the  one-electron  eigen¬ 
values  from  the  solution  of  the  Kohn-Sham  equa¬ 
tions  with  potential  y(r)  does  not,  on  the  other 
hand,  give  the  correct  Fermi  surface,  as  pointed 
out  by  Gunnarsson  and  Schonhammer  [35]. 


ELASTIC  REGIME  OF  CLEAVAGE  FORCE  IN 

TERMS  OF  PHONON  THEORY 

We  return  briefly  at  this  point  to  the  elastic 
regime  of  the  cleavage  force  F(z),  where  F{z)  = 
Az.  The  work  of  Zaremba  [13]  and  Kohn  and 
Yaniv  [14]  gave  A  explicitly  in  terms  of  the  phonon 
dispersion  relations  in  the  perfect  lattice  before 
cleavage  and  this  is  a  very  precise  route  for  the 
calculation  of  the  constant  A,  One  expects  nearly 
free-electron  metals  to  regain  grossly  A  as 

obtained  by  March  and  Paranjape  [25]  as  well  as  in 
the  later  work  of  Streitenberger  [27].  We  stress  that 
all  the  above  is  within  the  context  of  DFT,  al¬ 
though  in  constructing  appearing  in  Eqs. 

(17)  and  (19),  Jones  and  March  emphasized  that 
one  requires  the  "exchange-correlation"  func¬ 
tional  LJ(rr'),  which  transcends  knowledge  of  the 
one-body  potential  of  DFT.  Thus,  to  complete  the 
phonon  calculation,  as  Jones  and  March  [29] 
proved,  one  also  needs  the  further  functional 
derivative 


mr,rO 


5y,,(r) 

Spit')  ' 


(21) 


with  V^Jix)  the  usual  exchange-correlation  poten¬ 
tial  of  DFT.  Claesson  et  al.  [31]  discussed  approxi¬ 
mate  forms  of  F  and  U  for  (a)  Al  and  (b)  Si  and 
the  interested  reader  is  referred  to  their  article  for 
the  detailed  forms.  However,  it  is  noteworthy  that 
a  shell  model  treatment  of  Si  is  equivalent  to  a 
factorization  of  the  response  function  F(r,  r'), 
which,  of  course,  affords  great  simplification. 


Summary  and  Future  Directions 

Different  DFT  approaches  have  been  discussed 
for  treating  interatomic  force  laws  in  condensed 
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metallic  phases  and  for  dealing  with  interplanar 
interactions  in  suitably  cleaved  crystals.  It  has  been 
stressed  in  the  latter  case  that  the  calculations 
presented  here  are  for  "ideal"  materials.  Natu¬ 
rally,  defects  will  have  to  be  incorporated  before 
the  theoretical  predictions  can  be  brought  into 
direct  contact  with  observation. 

Returning  to  the  central  Eqs.  (1)  and  (2),  for 
simple  liquid  metals  such  as  Na  near  freezing, 
contact  has  been  made  between  a  DFT  pair  poten¬ 
tial  calculated  from  first  principles  using  LDA  and 
that  extracted  using  only  classical  statistical  me¬ 
chanics  to  invert  the  measured  liquid  structure 
factor  S(q).  As  Figure  1  and  Table  I  show,  the 
agreement  is  already  rather  remarkable,  although 
some  purely  quantitative  differences  remain.  As 
this  article  was  nearing  completion,  work  by 
Rajagopalan  and  Srivastava  [36]  appeared  on  liq¬ 
uid  Cu.  This  was  used  subsequently  by  March  and 
Ray  [37]  to  compare  it  with  the  density  functional 
calculations  of  Carlsson  et  al.  [21],  who  extracted  a 
pair  potential  for  Cu.  March  and  Ray  [37]  also 
used  the  coordination  number  representation  (2) 
for  Cu  to  extend  the  Carlsson  et  al.  work  on  fee  Cu 
to  give  £(c,  Vq)  also  for  bcc,  sc,  and  diamond 
structures,  but  for  these  latter  cases,  to  our  knowl¬ 
edge,  there  is  not  as  yet  DFT  work  for  comparison. 
For  the  simpler  sp  metal  K,  however,  Rubio  and 
March  [38]  very  recently  brought  the  DFT  calcula¬ 
tions  for  a  variety  of  structures  (in  LDA)  into 
contact  with  the  quantum  chemical  model  (2), 
However,  they  used  semiempirical  representations 
of  the  singlet  and  triplet  potential-energy  curves 
for  the  K2  dimer.  While  the  ^2^  potential-energy 
curve  is  obviously  accessible  through  DFT,  more 
problems  exist  in  calculating  the  triplet  curve  from 
DFT,  but  approaches  such  as  that  set  out  by 
Theophilou  [20]  and  later  developments  should  be 
studied  further  in  the  context  of  the  quantum 
chemical  representation  (2). 

Finally,  phonon  theory  has  been  posed  here  in 
density  functional  language,  with  illustrative  ex¬ 
amples  given  for  Al  and  Si  crystals  and  also  for  the 
cleavage  force  following  [13,  14].  The  Appendix 
treats  size  effects  on  the  cleavage  force  in  nearly 
free-electron  metals  and  the  establishment  of  a 
self-consistent  field  from  the  simplest  DFT  for  fu¬ 
ture  studies  of  such  size  influences.  Atomic  force 
microscopy  may,  perhaps,  in  the  future,  throw 
further  light  on  the  relevance  of  such  studies  to 
real  materials,  e.g..  Si  and  W  [39,  40]. 


Appendix:  Size  Dependence  of 
Cleavage  Force  Between  Parallel 
Planar  Jellium  Metal  Surfaces 


Razavy  et  al.  [41]  very  recently  discussed  size 
effects  in  the  jellium  model  already  referred  to  in 
the  main  text.  As  there,  the  simplest  DFT,  namely, 
the  linearized  Thomas-Fermi  (TF)  model,  was  em¬ 
ployed.  The  force  between  two  parallel  planar  con¬ 
ducting  cylinders  of  infinite  extension  was  consid¬ 
ered  by  these  workers. 

In  such  a  system,  with  cylindrical  symmetry, 
the  main  achievement  was  to  establish  the  self- 
consistent  field  analytically,  by  solving  the  lin¬ 
earized  TF  equation 


dV  d^V 

—  — \-  p  ^ - h  - — 

<?p^  dp  dz^ 


q^V, 


(Al) 


with  q  *  the  TF  screening  length.  Separating  the 
variables  through 

V(p,z)  =  F(p)Z(z)  (A2) 

leads  to  two  differential  equations: 
d^Z 

and 

d^F  ,  dF  ^  ^ 

XT  +  P"  ^  P  + 

dp  dp 


with  p  being  a  real  separation  constant.  Specifying 
regions  I  and  II  as  within  the  two  semi-infinite 
jellium  cylinders,  Razavy  et  al.  [41]  found  in  these 
regions 


V(p.z) 


C(p)cos  pzIo[Q(p)p]  dp,  p  <  R, 


(A5) 


with  R  the  radius  of  the  cylinders  and  Iq  the  usual 
Bessel  function.  To  specify  C(p),  Razavy  et  al.  [41] 
matched  V  in  Eq.  (A5)  to  the  electrostatic  potential 
in  region  III,  where  this  denotes  the  whole  region 
outside  the  two  jellium  semi-infinite  cylinders,  in¬ 
cluding  the  gap  of  width  d  between  them. 

The  energy  stored  in  the  electrostatic  field  was 
then  calculated  as  the  sum  of  two  terms,  A  and 
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A  £2/  and  the  force  F  from 

£1  dL  £2 

f  = - -  + - 

dd  dd 


(A6) 


Razavy  et  al.  showed  that  the  first  term  is  indepen¬ 
dent  of  d,  while  all  the  dependence  of  A  £2  on  d 
can  be  subsumed  within  the  simple  factor 
sin(/cd/2),  with  k  a  Fourier  transform  variable.  It 
would  be  of  obvious  interest  if  this  size  depen¬ 
dence  predicted  by  the  simplest  DFT  could  be 
tested  using  the  atomic  force  microscope  and  work 
along  these  lines  is  in  progress  (see  Allegrini  et  al. 
[40]). 

In  concluding  this  Appendix,  it  is  worth  com¬ 
menting  that  the  work  of  Razavy  et  al.  discussed 
above  makes  available  an  (of  course,  approximate) 
self-consistent  field  as  a  function  of  cleavage  sepa¬ 
ration  d.  In  the  future,  this  may  well  afford  a 
'"zeroth-order"  field  to  use  in  solving  the 
Kohn-Sham  equations  in  cylindrical  symmetry  for 
a  cleaved  jellium  cylinder.  Of  course,  an  LDA 
contribution  to  V(rX  namely,  would  have  to 

be  added,  since  the  TF  approximation  used  by 
Razavy  et  al.  [41]  does  not  include  exchange  and 
correlation. 
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ABSTRACT:  Electronic  structures  and  properties  of  carbides  MC,  M5C  (M  = 

Fe,  Ni,  Cu),  and  TiC  and  JiyC  were  studied  using  density  functional  methods  (DF), 
particularly  the  local  density  approach  (LDA)  with  the  Vosko~Wilk“Nusair  (VWN) 
correlation  functional,  the  generalized  gradient  approximation  (GGA)  of  Becke  and 
Perdew  (using  Becke's  1988  exchange  functional  and  Perdew's  1986  correlation  functional 
— BP86),  and  the  ADF  program  of  Baerends  et  al.  In  the  first  part  of  this  report,  we 
studied  equilibrium  geometries  and  dissociation  energies  for  the  process  MC  M  +  C 
involving  the  doublet  ground  state  for  M  =  Cu,  singlet  and  triplet  states  for  M  =  Ni,  and 
triplet  and  quintuplet  states  for  M  =  Fe,  Ti.  Charge  distributions  by  population  analysis, 
dipole  moments,  and  vibrational  frequencies  were  also  evaluated.  All  calculations  were 
done  using  triple-zeta  basis  sets,  with  frozen-core  orbitals  and  the  GGA  corrections.  In 
the  second  part  of  this  report,  we  consider  the  doublet  states  of  Cu^C  and  singlet  and 
triplet  states  of  Ni5C,  taking  planar  and  nonplanar  models  for  the  M5  clusters.  The  triplet 
and  quintuplet  states  of  Fe5C  were  studied  with  a  nonplanar  model  for  the  Fe5  cluster, 
whereas  in  the  case  of  the  triplet  and  quintuplet  states  of  TiyC,  a  Ti7  planar  model  was 
chosen.  Optimization  of  the  position  of  the  C  atom  in  the  cluster,  dissociation  energies, 
distribution  of  charges  in  the  molecule,  and  dipolar  moments  were  also  analyzed,  and 
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comparisons  with  the  results  obtained  for  the  corresponding  carbides  (MC)  were  made. 
These  results,  together  with  the  ones  of  the  corresponding  carbonyls — that  have 
previously  been  done  by  the  same  methodology — provide  us  with  an  interesting 
comparison  of  the  M — C  bond  dissociation  energy  in  carbides  and  carbonyls.  Calculations 
of  the  C — O  bond  dissociation  energies  for  the  MCO  and  M„CO  compounds  were  also 
performed.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  919-928,  1997 


Introduction 

Carbonyls  and  carbides  (and  also  carbenes)  of 
transition  elements  are  a  very  important  class 
of  compounds  for  several  reasons:  One  of  these  is 
that  carbonyls  are  considered  to  be  the  first  step  in 
the  reaction  of  CO  with  H2,  catalyzed  by  transition 
elements  (Fischer-Tropsch  synthesis).  From  the  in¬ 
dustrial  point  of  view,  this  is  of  paramount  impor¬ 
tance  due  to  the  large  number  of  chemicals  that 
can  be  obtained  from  it. 

The  initial  step  of  the  chain  of  reactions  is  the 
chemisorption  by  materials  which  contain  such 
elements.  The  simplest  model  for  these  materials  is 
one  atom,  which  is  where  the  importance  of  the 
MCO  compounds  lies.  Carbides  (MC)  are  obtained 
by  cleavage  of  the  C — O  bond  from  carbonyls,  and 
carbenes  (MCH2)  are  obtained  by  hydrogenation 
of  the  carbides  or  carbonyls.  In  previous  reports 
[1-3],  we  considered  carbonyls  and  carbenes  of  Ti, 
Ni,  Fe,  and  Cu  using  DFT  methods.  In  this  report, 
we  focus  on  carbides  as  a  complementary  study. 

As  a  further  step,  it  is  natural  to  consider — as 
suitable  models  for  real  materials — clusters  or 
crystals  of  the  elements  involved  and  to  use  the 
appropriate  computational  techniques.  To  compare 
the  results  obtained  in  the  MC  and  M„C  calcula¬ 
tions,  we  must  use  the  same  computational  level 
in  both  cases.  Therefore,  the  cluster  size  cannot  be 
too  large. 

These  calculations  should  provide  accurate  pre¬ 
dictions,  at  least  for  the  equilibrium  molecular 
geometries,  dissociation  energies,  and  vibrational 
frequencies.  For  this  reason,  we  studied — besides 
the  MC  compounds — the  M5C  structure  with 
symmetry  (for  M  =  Ni,  Fe,Cu)  and  TiyC  with 
symmetry  C^^.  They  all  involve  transition-atom 
clusters  with  geometries  taken  from  their  crystals. 
The  transition  elements  considered  in  this  re¬ 
port  (Ti,  Cu,  Ni,  and  Fe)  present  quite  different 
difficulties.  Cu  is  the  simplest  because  of  its  elec¬ 
tronic  structure,  as  compared  to  Ni,  Fe,  and  Ti;  Ni 


is  the  most  difficult.  We  focused  our  interest  in  the 
geometrical  and  energetical  aspects  of  the  systems 
under  consideration. 


Methods 

The  calculations  reported  in  this  article  are  based 
on  the  Kohn-Sham  approach  to  density  functional 
theory  (DFT)  [4],  with  the  methodology  of  Baerends 
et  al.  The  program  used  was  ADF  (version  2.0.1) 
[5]. 

The  density  functional,  also  called  the  ex- 
change-and-correlation  (XC)  functional,  consists  of 
a  local  part  (the  local  density  approach,  LDA)  and 
a  so-called  gradient  correction  (the  generalized 
gradient  approximation,  GGA).  Initially,  the  LDA 
used  for  the  calculation  of  the  exchange  energy 
Ex(LDA)  was  the  one  proposed  by  Slater  [6]  (Xa 
approximation),  but  a  version  that  also  includes  a 
correlation  part — the  one  used  in  this  report — is 
that  proposed  by  Vosko,  Wilk,  and  Nusair  (VWN) 
[7]. 

The  GGA  introduces  terms  depending  on  the 
gradient  of  the  electron  density  that  account  for 
the  inhomogeneities  necessary  to  describe  the 
nonuniform  electron  gas  in  the  case  of  atomic  and 
molecular  systems.  GGA  corrections  are  the  sum 
of  two  terms:  exchange  and  correlation.  The  intro¬ 
duction  of  these,  sometimes  called  nonlocal  correc¬ 
tions,  significantly  improve  the  results  of  the 
calculation  of  several  properties,  such  as  bond 
energies,  molecular  geometries,  and  dipole  mo¬ 
ments. 

We  have  several  possibilities  using  the  ADF 
program  with  the  GGA  correction  calculations.  The 
one  used  in  this  report  is  due  to  Becke  [8]  for  the 
exchange  term  and  that  of  Per  dew  [9]  for  the 
correlation.  The  method  is  called,  accordingly, 
LDA/BP. 

We  used  the  Slater-type  orbital  (STO)  basis  set 
for  the  atomic  orbitals  (AO),  which  are  internally 
stored  in  the  ADF  program.  We  selected  the  one 
called  IV,  which  is  a  triple  basis  set  with  polariza¬ 
tion. 
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Integrals  involving  electronic  density  (such  as 
Coulomb  potentials)  were  evaluated  using  these 
auxiliary  bases  and  then  taking  linear  combina¬ 
tions  of  these  integrals.  The  coefficients  of  these 
linear  combinations  were  evaluated  in  each  SCF 
cycle. 

The  matrix  elements  of  the  overlap  and  Fock 
matrices  were  calculated  by  numerical  integration. 
The  success  of  DFT  methods — such  as  the  one 
implemented  in  ADF — is  due,  among  other  rea¬ 
sons,  to  the  existence  of  accurate  and  very  efficient 
numerical  methods  for  the  calculation  of  integrals 
using  STO  functions  [10]. 

Instead  of  calculating  total  energies,  as  most 
methods  do,  ADF  calculates  bond  energies  accord¬ 
ing  to  Ziegler's  procedure  [11].  These  bond  ener¬ 
gies  are  the  difference  between  the  total  energy  of 
a  molecule  (or  atom)  and  those  of  the  atoms  (or 
fragments)  in  established  reference  states. 

Optimization  of  the  M — C  bond  length  at  the 
LDA/BP  level  was  done  for  all  the  compounds 
considered  in  this  report.  In  all  cases,  the  lowest 
energy  of  the  M  and  C  fragments  were  obtained, 
and  the  adiabatic  M — C  bond  dissociation  energy 
was  calculated  in  every  case  using  the  sum-of- 
fragments  energies  (£5^/)  energy  at  the 

equilibrium  geometry  of  the  MC  compounds.  Mul- 
liken  population  analysis  was  carried  out,  and  the 
charge  distribution  in  the  molecule  and  the  related 
dipole  moment  were  obtained,  as  well  as  the  vi¬ 
brational  frequencies  of  the  normal  modes. 

Regarding  the  M5C  compounds,  we  have  two 
possibilities  for  the  clusters:  Ni  and  Cu  crystals 
have  fee  (face-centered  cubic)  elementary  cells, 
whereas  Fe  has  a  bcc  (body-centered  cubic)  and  Ti 
has  an  hep  (hexagonal  close-packed)  one.  In  the  Ni 
and  Cu  cases,  we  selected  two  cluster  models.  The 
first  one,  called  "planar"  and  symbolized  by  (5, 0), 
had  all  five  atoms  located  on  the  vertices  of  a 
square  and  its  center.  This  would  correspond  to 
the  top  face  of  the  elementary  cell.  The  other, 
called  "hollow"  and  symbolized  by  (4, 1),  was 
formed  by  taking  the  four-center  atoms  of  the 
lateral  faces  of  the  cube  and  the  fifth  atom  on  the 
center  of  the  bottom  face,  thus  forming  a  square 
pyramid  with  the  four  M  atoms  at  the  corners  of 
the  base  and  the  fifth  M  atom  at  the  top.  This  was 
the  only  one  employed  in  the  case  of  Fe.  We  are 
using  the  nomenclature  given  by  Post  and 
Baerends  [12]. 

The  geometry  of  these  clusters  was  taken  from 
experimental  crystal  data.  The  distance  from  the 
center  of  the  square  to  one  of  the  M  atoms  was 


2.54  A  in  the  case  of  Cu,  2.50  A  in  the  case  of  Ni, 

o 

and  2.03  A  in  the  case  of  Fe,  and  for  Ti,  the  center 
of  the  hexagon  was  2.91  A  from  the  other  Ti  atoms. 

The  central  M  atom  in  (5, 0),  which  we  will  call 
Mj,  was  displaced  from  the  M4  plane  a  distance 
equal  to  1.80  A  for  the  CU5  (4, 1)  cluster  and  1.77  A 
for  the  Ni5  (4,1)  cluster.  In  the  case  of  the  Fe5 
cluster,  the  distance  from  Fe^  to  the  Fe4  plane  was 
1.44  A. 

All  calculations  were  carried  out  at  the  unre¬ 
stricted  approximation  level.  We  checked  the  elec¬ 
tronic  configuration  after  SCF  convergence  in  or¬ 
der  to  see  if  the  LUMO  orbital  was  lower  in  energy 
than  was  the  HOMO,  both  for  a  and  (3  spin.  Every 
time  we  found  this  not  to  be  the  case,  we  changed 
the  configuration  accordingly,  verifying  the  de¬ 
crease  in  energy  and  continuing  until  the  lowest 
energy  was  attained.  In  some  cases  of  difficult 
convergence,  we  had  to  resort  to  the  level-shifting 
technique,  increasing  the  energy  of  the  virtual  or¬ 
bitals  a  little  bit  (in  some  cases  up  to  0.5  au). 


Results  and  Discussion 

All  calculations  reported  in  this  article  were 
done  at  the  LDA/BP  level.  It  is  well  known  [13, 
14]  that  predictions  for  molecular  geometries  are 
improved  when  GGA  corrections — such  as 
BP — are  included  in  the  LDA  methodology,  result¬ 
ing  in  values  very  close  to  experimental  ones.  In 
general,  DFT  methods  are  superior  to  Hartree-Fock 
(HF)  ones,  particularly  for  metal-ligand  bond 
lengths. 

CO  CASE 

One  of  the  objectives  of  this  article  was  to  eval¬ 
uate  the  dissociation  energy  of  the  C — O  bond 
when  the  carbonyl  group  is  attached  to  one  or 
several  transition-element  atoms  and  then  to  com¬ 
pare  this  value  to  that  of  the  free  CO  molecule.  In 
that  way,  we  can  rationalize  the  action  that  differ¬ 
ent  metal  catalysts  may  have  on  reactions  that 
involve  the  cleavage  of  that  particular  bond. 

For  the  free  CO  molecule,  the  C — O  bond  en¬ 
ergy  obtained — with  an  optimized  C — O  bond 
distance  of  1.1394  A — was  -0.540653  au.  For  the 
fragments  O  (triplet),  O  (singlet),  and  C  (singlet), 
the  bond  energies  were  -  0,057444,  -  0.00875,  and 
—  0.000206  au,  respectively.  Therefore,  the  sum  of 
the  lowest  bond  energies  for  the  resulting  frag- 
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merits  of  the  dissociation  of  CO  was  [C(l)  + 
0(3)]  =  —  0.057650  au,  which  gave  us  a  value  for 
the  (adiabatic)  dissociation  energy  equal  to  303 
kcal/mol. 

We  found  in  the  literature  [15]  an  experimental 
value  of  11.09  eV  (256  kcal/mol)  for  this  dissocia¬ 
tion  energy  and  a  C — O  bond  distance  of  1.127  A. 
In  [16],  a  calculated  value  of  277  kcal/mol  is 

o 

reported,  as  well  as  a  bond  distance  of  1.127  A. 
They  also  provided  an  experimental  value  of  259 
kcal/mol,  so  our  result  is  about  44  kcal/mol  too 
high  (17%  higher).  As  far  as  bond  distance  is 
concerned,  the  agreement  is  satisfactory. 

We  calculated  values  for  the  CO  dissociation 
energy  and  bond  distance  using  different  bases 
and  DFT  parameterizations.  These  values  were  in 
good  agreement  with  the  ones  just  referred  to. 

In  any  case,  since  we  focus  our  interest  on 
energy  differences  to  be  able  to  assess  the  influ¬ 
ence  of  the  nature  and  number  of  the  different 
metal  clusters  bonded  to  the  CO  molecule  on  the  C 
— O  bond  dissociation  energy,  we  used  in  our 
calculations  the  value  just  quoted  (303  kcal/mol). 


MC  AND  M CO  (M  -  Ti,  Fe,  Ni,  Cu)  CASES 

In  Table  I,  we  show  the  bond  energies  (in  atomic 
units)  for  the  optimized  distances  d(MC)  as  well  as 
the  adiabatic  dissociation  energies  and  electronic 
configurations  of  the  different  carbides  MC.  We 
evaluated  these  dissociation  energies  by  subtract¬ 
ing  from  the  bond  energies  the  ones  corresponding 
to  the  M  and  C  atoms  in  the  configurations  and 
multiplicities  that  yield  the  lowest  energies. 


The  bond  energies  for  the  different  atoms  that 
we  used  to  calculate  the  given  below: 


Doublet  Cu 
Triplet  Ni 
Quintuplet  Fe 
Quintuplet  Ti 
Singlet  C 


(2  3  5/1  3  5Xs'^d^^) 
(2  3  5/1  3  4)(s^^^^) 
(2  3  5/2  3  iXs^d^) 
(2  3  3/1  3  0)(s^d^) 

(1  1  0/1  1  0)(sV^) 


-  0.009972  au 
-0.021278  au 
-0.137846  au 
-0.042003  au 
-0.000206  au 


In  Table  II,  we  provide  values  for  the  equilib¬ 
rium  geometries,  bond  energies,  and  dissociation 
energies  of  the  M — C  bonds  of  the  carbonyls  given 
in  [1]  (for  the  Fe,  Ni,  and  Cu  compounds)  and  [2] 
(for  the  Ti  compounds).  We  also  included,  under 
the  column  A  D^,  the  difference  between  the  values 
of  the  M — C  bond  dissociation  energies  in  the 
carbide  and  the  corresponding  carbonyl.  This  dif¬ 
ference  can  be  easily  shown  to  be  identical  to  the 
difference  between  the  values  of  the  C — O  bond 
dissociation  energy  in  free  CO  and  the  carbonyls,  a 
key  quantity  in  this  report.  This  reduction  is  very 
important,  since  it  shows  the  decrease  in  energy 
needed  to  dissociate  the  carbonyl — i.e.,  to  break 
the  C — O  bond — when  going  from  free  CO  to  CO 
bonded  to  a  metallic  ligand.  It  explains  why  in 
catalytic  processes,  and  especially  in  heteroge¬ 
neous  catalysis  with  transition  elements,  reactions 
whose  rate-determining  step  involve  the  cleavage 
of  that  particular  bond  are  enhanced  compared  to 
their  noncatalyzed  counterparts.  The  correspond¬ 
ing  electronic  configurations  for  the  carbonyls  are 
the  ones  detailed  in  the  publications  just  refer¬ 
enced,  so  we  shall  not  repeat  them  here. 


TABLE  I _ 

MC  compounds. 


Molecule 

d 

E 

De 

Configuration 

^sof 

CuC(2) 

1.799 

-0.154060 

90.3 

(5  5  2/5  4  2) 

-0.010178 

NiC(1) 

1.624 

-0.246783 

141.4 

{5  4  2/5  4  2) 

-0.021484 

NiC(3) 

1.625 

-0.230529 

131.2 

(5  5  2/4  4  2) 

-0.021484 

FeC(3) 

1.565 

-0.386087 

155.6 

(5  4  2/4  4  1) 

-0.138052 

FeC(5) 

1.628 

-0.357984 

138.0 

(5  5  2/4  4  0) 

-0.138052 

TiC(3) 

1.679 

-0.279043 

148.6 

(5  4  0/3  4  0) 

-0.042209 

TiC(5) 

1.829 

-0.244165 

126.7 

(5  4  1/3  3  0) 

-0.042209 

d  -.  M— 
(au). 

C  distance  (A);  E\  bond  energy  (au);  adiabatic  bond  dissociation  energy  (kcal/mol);  sum  of  fragments  energy 
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TABLE  II _ 

MCO  compounds. 


Molecule 

d(MC) 

diCO) 

a  (MCO) 

E 

De 

ADe 

o; 

CuCO(2) 

1.903 

1.160 

136.8 

-0.575241 

15.4 

75 

228.2 

NiCOd) 

1.661 

1.167 

180.0 

-0.659746 

61.4 

80 

223.1 

NiCO(3) 

1.861 

1.164 

142.5 

-0.605833 

27.5 

104 

199.5 

FeCO(3) 

1.698 

1.173 

180.0 

-0.753609 

47.1 

109 

194.6 

FeCO(5) 

1.864 

1.166 

163.7 

-0.741656 

39.6 

98 

204.7 

TiCO(3) 

2.013 

1.171 

180.0 

-0.626791 

27.7 

121 

182.2 

TiCO(5) 

2.034 

1.171 

180.0 

-0.653341 

44.4 

82 

220.7 

d(MC);  M— C  distance  (A);  c/(CO):  C— O  distance  (A);  a(MCO):  MCO  angle  (degrees);  E:  bond  energy  (au);  D^:  M— C  bond 
dissociation  energy  (kcal/mol);  ADgi  difference  between  Dq  values  from  Tables  I  and  II  (kcal/mol);  D^:  C— O  bond  dissociation 
energy  (kcal/mol). 


The  interaction  energies  for  MC  +  O  corre¬ 
spond  to  dissociation  energies  of  the  C — O  bond 
in  the  reverse  reaction,  i.e.,  MCO  ^  MC  +  O. 
These  values  are  obtained  by  adding  the  energy  of 
0(3)  (  ”  0.057444  au)  to  the  one  of  the  correspond¬ 
ing  carbide  and  then  subtracting  the  energy  of  the 
carbonyl,  listed  in  Table  II.  The  optimized  distance 
for  the  carbide  was  used;  therefore,  the  result 
corresponds  to  the  adiabatic  dissociation  energy 
D'e- 

It  is  interesting  to  compare  the  optimized  dis¬ 
tances  M — C  for  the  corresponding  carbides  (Table 
I)  and  carbonyls  (Table  II).  In  every  case,  we  see  a 
reduction  in  that  distance  when  going  from  the 
carbonyl  to  the  carbide.  This  is  to  be  expected, 
since  the  strength  of  the  C — M  bond  increases 
when  only  the  M  atom  is  the  one  bonded  to  the  C 
atom,  compared  to  the  case  when  there  is  also  a  C 
— O  bond  involved.  According  to  Tables  I  and  II, 
this  reduction  goes  from  0.037  A  for  NiC(l)  to 
0.334  A  for  TiC(3). 

In  the  same  fashion,  we  see  a  concomitant  in¬ 
crease  in  the  dissociation  energy  of  the  M — C 
bond  when  going  from  the  carbonyl  to  the  corre¬ 
sponding  carbide,  i.e.,  it  is  more  difficult  to  re¬ 
move  the  C  atom  from  the  metal  surface  when  it  is 
present  as  a  carbide  than  as  a  carbonyl.  This  in¬ 
crease,  which  is  presented  under  the  column  A 
in  Table  II,  also  reaches  its  maximum  at  TiC(3). 

For  NiC,  like  NiCO,  the  singlet  state  turns  out 
to  be  more  stable  than  is  the  triplet.  The  differ¬ 
ences  in  the  distances  d(Ni — C)  are  0.037  A  for  the 
singlet  and  0.236  A  (much  greater)  for  the  triplet. 
Note  that  the  distances  d(Ni — C)  in  the  case  of  the 


carbides  are  almost  the  same,  whereas  they  are 
significantly  larger  for  the  triplet  in  the  case  of  the 
carbonyls. 

The  reduction  in  is  80  kcal/mol  (similar  to 
that  of  CuCO)  for  the  ground  state  (singlet),  but  it 
increases  to  104  kcal/mol  for  the  excited  state 
(triplet).  When  we  go  onto  FeC,  we  see  that — in 
the  same  manner  as  in  FeCO — the  triplet  state  is 
more  stable  than  is  the  quintuplet  and  the  values 
of  AD'  are  109  and  98  kcal/mol,  respectively.  For 
TiC,  we  find  the  triplet  state  to  be  more  stable  than 
is  the  quintuplet,  as  opposed  to  the  corresponding 
carbonyls. 

According  to  these  values,  the  compound  in 
which  the  C — O  bond  is  strongest  (of  the  ones 
considered  here)  is  CuCO(2),  followed  closely  by 
NiCCXD  and  TiCO(5).  On  the  other  hand,  it  is 
weakest  in  TiCO(3),  followed  by  FeCC)(3).  If  we 
consider  ground  states  only,  then  the  weakest 
C — O  bond  is  the  one  in  FeCO(3),  followed  by 
TiCO(5)  and  NiCO(l).  The  strongest  bond  is  still 
that  of  CuCO(2). 

In  Table  III,  we  tabulate  the  charge  transfer, 
dipole  moment,  and  overlap  for  the  carbide  species, 
obtained  via  a  Mulliken  population  analysis  of  the 
different  compounds  listed.  Appropriate  values  for 
the  carbonyls  are  also  included.  We  observe  an 
important  charge  transfer  from  the  metal  atoms  to 
carbon,  which — for  the  ground  states — oscillates 
between  0.29  au  for  NiC(l)  and  0.40  au  for  TiC(3). 
These  values  are  always  slightly  higher  for  the 
corresponding  excited  states.  The  same  order  is 
obeyed  by  the  dipole  moments,  as  is  to  be  ex¬ 
pected.  When  compared  to  the  carbonyls,  we  see 
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TABLE  III _ 

MC  compounds. 


Molecule 

Q 

q' 

/A 

s 

S' 

ni) 

nr) 

CuC(2) 

0.3164 

0.1032 

2.433 

0.37 

0.28 

621  (0.5) 

386  (7) 

NiCd) 

0.2868 

0.1376 

1.983 

0.43 

0.42 

928  (25) 

617  (0.2) 

NiC(3) 

0.3248 

0.1132 

2.752 

0.43 

0.33 

877  (26) 

460  (9) 

FeC{3) 

0.3412 

0.1939 

2.471 

0.45 

0.40 

962  (57) 

574  (26) 

FeC(5) 

0.3977 

0.1896 

3.013 

0.46 

0.37 

887  (42) 

462  (6) 

TiC(3) 

0.3969 

0.2951 

3.151 

0.51 

0.33 

982  (83) 

431  (10) 

TiC(5) 

0.4082 

0.2675 

3.265 

0.37 

0.34 

865  (52) 

402  (25) 

q:  Atomic  charge  transferred  from  the  M  to  the  C  atom;  fi:  dipole  moment  (D);  S:  total  M—C  overlap;  f:  vibrational  frequency 
(cm“^);  /:  intensity  (km /mol).  Primed  quantities  refer  to  the  corresponding  carbonyl. 


that  the  charge  transfer  is  fairly  smaller  than  in  the 
carbide  case,  but  follows  essentially  the  same  or¬ 
der. 

In  the  same  table,  we  display  the  vibrational 
frequencies  and  intensities  of  the  M — C  bonds  for 
the  carbides  MC  (/)  and  for  the  carbonyls  MCO 
(/')  (the  latter  were  taken  from  [1,  2]),  We  can 
infer  from  them  that  there  is  a  marked  increase  in 
the  vibrational  frequency  of  the  M — C  stretching 
mode  when  going  from  the  carbonyl  to  the  carbide 
(which  goes  from  235  cm"^  for  CuC(2)  to  551 
cm“^  for  TiC(3),  concomitant  with  a  reduction  in 
the  M — C  bond  distance. 

For  the  ground  states  of  NiC,  FeC,  and  TiC,  the 
frequencies  are  fairly  close  to  one  another  and 
significantly  greater  than  that  of  CuC.  This  denotes 


stronger  M — C  bonds  in  those  compounds,  in 
agreement  with  the  dissociation  energies  given  in 
Table  I.  The  overlaps  follow  a  similar  trend. 

In  the  case  of  TiC,  comparing  the  triplet  ground 
state  to  the  excited  quintuplet,  we  see  that  the 
vibrational  frequency  of  the  M — C  bond  in  the 
first  is  significantly  larger  than  in  the  second.  This 
is  also  true  for  the  dissociation  energies  (Table  I). 

M^C  AND  M^CO  (n  =  5  FOR  M  =  Fe, 

Ni,  Cu;  II  =  7  FOR  M  =  Ti)  CASES 

In  Table  IV,  like  in  Table  I,  we  show  the  equilib¬ 
rium  geometries  and  bond  energies  of  the  carbide 
species  with  the  C  atom  in  the  field  of  the 
clusters  as  well  as  their  adiabatic  dissociation  ener- 


TABLE  IV _ 

compounds. 


Molecule 

d 

E 

De 

Configuration 

^sof 

Cu5C(2)(5,0) 

1.750 

-0.393986 

93.7 

(11  3  7  4  25/11  3  7  4  24) 

-0.244523 

Cu5C(2)(4,1) 

0.05 

-0.425183 

157.4 

(12  3  7  4  24/11  3  7  4  24) 

-0.174142 

Ni5C(l)(5,0) 

1.625 

-0.606384 

145.8 

(11  3  6  3  24/10  3  6  3  24) 

-0.373782 

Ni5C(1)(4,1) 

-0.050t 

-0.587068 

183.8 

(11  2  7  4  23/10  3  6  4  24) 

-0.293903 

Ni5C(3)(5,0) 

1.65 

-0.611966 

149.3 

(11  2  7  4  24/11  3  6  4  22) 

-0.373782 

Ni5C(3)(4,1) 

-o.iot 

-0.595919 

189.4 

(10  3  7  4  24/10  3  6  4  23) 

-0.293903 

Fe5C(3)(4, 1) 

0.50 

-1.274801 

186.0 

(11  2  4  5  21/10  2  3  6  20) 

-0.978150 

Fe5C(5)(4, 1) 

0.40 

-1.281033 

189.9 

(11  2  4  5  22/10  2  3  5  20) 

-0.978150 

Ti7C(3)(7,0) 

1.82 

-1.086318 

129.9 

(10  1  3  3  1  5  13/10  1  3  3  16  10) 

-0.878152 

Ti7C(5)(7,0) 

1.79 

-1.101740 

139.5 

(10  1  3  3  16  13/10  1  3  2  16  10) 

-0.878152 

d:  — C  distance  (A);  t.  negative  distances  indicate  central  atom  and  C  atom  are  on  the  same  side  of  the  M4  plane;  E;  bond 

energy  (au);  bond  dissociation  energy  (kcal/mol);  Eg^^:  sum  of  fragments  energy  (au). 
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gies.  To  calculate  these  dissociation  energies,  one 
must  know  the  bond  energies  for  the  M„  clusters 
in  their  configurations  of  minimum  energy,  which 
are  listed  below: 


Cu5(5,0) 

(10  3  7  4  24/10 

3  7  4 

23) 

CujCil) 

(10  3  7  4  24/10 

3  7  4 

23) 

Ni5(5,0) 

(9  3  7  4  22/9 

2  7  4 

23) 

Ni5(4,l) 

(9  3  7  4  22/9 

2  7  4 

23) 

Fe5(4,l) 

(8  3  3  6  22/10 

1  3  6 

18) 

Ti7(7,0) 

(9  1  3  3  16  12/8  1  ■ 

5  2  16 

10) 

-0.244317  au 
-0,173936  au 
-0,373576  au 
-0.293697  au 
—  0.977944  au 
-0.878946  au 


Adding  to  these  energies  the  one  of  the  C  atom 
(which  has  already  been  quoted  as  —  0.000206  au), 
we  get  the  sum  of  fragments  energy  shown  in 
Table  IV.  Similarly,  in  Table  V,  we  show  the  equi¬ 
librium  geometries,  bond  energies,  and  dissocia¬ 
tion  energies  of  the  M„ — C  bond  for  the  carbonyls, 
which  have  also  been  given  in  [1]  for  M  =  Fe,  Ni, 
Cu  and  in  [2]  for  M  =  Ti.  These  are  obtained  from 
the  sum  of  fragments  energy  given  above  and  the 
energy  of  CO  at  the  equilibrium  distance  (1.139  A), 
which  is  -  0.540653  au.  We  also  provide  values  for 
the  sum  of  fragments  (M„  and  CO)  energy, 
but  do  not  include  the  corresponding  electronic 
configurations  because  they  have  already  been 
given  in  prior  reports,  the  only  exception  being 
those  of  the  Ti  clusters.  These  are  (11  1  3  3  18 
12/11  1  3  3  18  10)  and  (11  1  3  3  18  13/11  1  3  2  18 
20)  for  Ti7CC)(3)(7, 0)  and  TiyCO(5)(7,0),  respec¬ 
tively.  Note  that  the  values  reported  in  this  table 
were  calculated  after  optimization  of  the  C — O 
distance,  unlike  in  earlier  publications  [1]. 


CujC  Case 

Let  us  consider  first  the  CugC  cluster  and  its 
corresponding  carbonyl  in  both  (5,0)  and  (4,1) 
geometries. 

CU5CO  is  more  stable  in  the  geometry  (5, 0)  by 
47  kcal/mol,  whereas  CU5C  is  more  stable  in  the 
(4,1)  geometry  by  20  kcal/mol.  The  Cu^ — C  dis¬ 
tance  in  CU5C  (5,0)  is  1.750  A,  0.049  A  smaller 
than  in  CuC.  The  dissociation  energy  of  the  CU5C 
bond  is  93.7  kcal/mol,  3.4  kcal/mol  greater  than 
in  the  former  case.  In  this  way,  we  see  that  the 
interaction  of  CuC  with  the  square  ring  CU4  brings 
about  a  strengthening  of  the  C — CU5  bond. 

When  compared  to  the  corresponding  car¬ 
bonyls,  we  see — in  the  (5, 0)  case — a  reduction  in 
the  Cu — C  distance:  from  0.225  to  0.104  A,  which 
shows  a  greater  interaction  between  the  C  atom 
and  the  CU5  group  than  between  the  C  and  the  Cu 
atoms.  The  adiabatic  dissociation  energy  of  the  C 
— O  bond,  D^,  is  concomitantly  reduced  with  re¬ 
spect  to  the  CuCO  case  (from  228  to  215  kcal/mol). 

In  the  case  of  the  (4, 1)  cluster,  we  observe  an 
equilibrium  distance  of  0.05  A  to  the  center  of  the 
square  (on  the  opposite  side  relative  to  the  C 
atom),  and  since  the  Cu^  atom  is  1.80  A  away  from 
the  CU4  plane,  the  distance  Cu^ — C  is  1.85  A,  just 
a  little  greater  than  the  one  in  CuC  (1.80  A,  Table 
I).  However,  here  we  have  the  influence  of  the  CU4 
ring,  which  increases  considerably  the  dissociation 
energy  of  the  Cug — C  bond  with  respect  to  the 
(5, 0)  case,  going  from  93.7  to  157.4  kcal/mol.  Thus, 
as  the  CU4  ring  approaches  the  C  atom,  the  inten¬ 
sity  of  the  C-cluster  bond  is  considerably  enhanced 


TABLE  V _ 

M„CO  compounds. 


Molecule 

d 

d(CO) 

E 

^sof 

Oe 

D’e 

ADe 

DIff. 

Cu5CO(2)(5,0) 

1.975 

1.160 

-0.794584 

-0.784970 

6.03 

215.3 

87.8 

12.9 

Cu5CO(2)(4,  1) 

1.70 

1.15 

-0.719414 

-0.714589 

3.03 

148.6 

154.5 

79.7 

Ni5CO(1)(5,0) 

1.70 

1.17 

-0.950188 

-0.914229 

22.55 

179.7 

123.4 

43.4 

Ni5CO(1)(4,1) 

0.02 

1.23 

-0.876046 

-0.834350 

26.16 

145.3 

157.8 

77.8 

Ni5CO(3)(5, 0) 

1.75 

1.17 

-0.958880 

-0.914229 

28.02 

181.6 

121.4 

17.8 

Ni5CO(3)(4,1) 

0.05 

1.22 

-0.856103 

-0.834350 

13.65 

127.2 

175.9 

72.2 

Fe5CO(3)(4,1) 

1.05 

1.22 

- 1 .547066 

-1.518597 

17.86 

134.8 

168.3 

59.8 

Fe5CO(5)(4, 1) 

0.90 

1.25 

-1.592293 

-1.518597 

46.24 

159.3 

143.8 

45.4 

Ti7CO(3)(7,0) 

2.00 

1.16 

-1.472449 

-1.419599 

33.16 

206.3 

96.8 

-24.1 

Ti7CO(5)(7,0) 

2.05 

1.19 

-1.484859 

-1.419599 

40.95 

204.4 

98.7 

16.4 

d\  Mi — C  distance  (A);  d(CO):  C— O  distance  (A);  E:  bond  energy  (au);  E^Qf:  sum  of  fragments  energy  (au);  D^:  — C  bond 
dissociation  energy  (kcal/mol);  D^:  C— O  bond  dissociation  energy  (kcal/mol);  Diff.:  difference  between  AD^  values  between 
Tables  II  and  V  (kcal/mol). 
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when  going  from  (5, 0)  to  (4, 1).  As  for  the  dissocia¬ 
tion  energy  of  the  C — O  bond  (Dp,  it  shows  the 
opposite  change:  it  gets  reduced  from  215  to  149 
kcal/moL 

IXigC  Case 

Let  us  now  consider  the  Ni5C  case.  Here,  we 
have  four  different  cases  to  analyze:  singlet  and 
triplet  combined  with  (5, 0)  and  (4, 1)  geometries. 
According  to  Table  V,  the  triplet  state  is  more 
stable  than  is  the  singlet  by  3.5  kcal/mol  for  the 
(5,0)  case  and  by  5.6  kcal/mol  for  the  (4,1)  case. 
This  differs  from  the  NiC  case,  where  the  singlet  is 
more  stable  than  is  the  triplet  by  10.2  kcal/mol. 

The  dissociation  energies  for  the  bonds 
Ni — C  and  Ni^ — C  show  tendencies  similar  to  the 
ones  of  Cu.  For  the  singlet,  it  increases  slightly 
(from  141  to  146  kcal/mol)  in  the  (5,0)  case  and 
strongly  (to  184  kcal/mol)  in  the  (4, 1)  case. 

For  the  triplet,  we  have  similar  values,  amount¬ 
ing  to  18.1  and  58.2  kcal/mol  increments.  Thus, 
the  interaction  of  the  Ni4  ring  with  the  NiC  car¬ 
bide  is  similar  to  that  of  the  Cu  case. 

Now,  let  us  look  at  the  corresponding  car¬ 
bonyls.  In  the  case  of  NiCO,  the  singlet  is  more 
stable  than  is  the  triplet  by  34  kcal/mol,  according 
to  Table  II.  In  the  case  of  Ni5CO,  the  singlet  is  still 
more  stable  in  the  (4, 1)  case,  but  the  triplet  is  the 
more  stable  one  when  the  structure  is  of  the  (5, 0) 
type.  It  is  observed,  then,  that  in  going  from  NiCO 
to  Nig  CO,  the  difference  in  energy  between  the 
two  multiplicities  is  reduced,  being  inverted  in  the 
(5, 0)  case. 

The  reduction  of  the  C — O  bond  dissociation 
energy  amounts  to  43  and  78  kcal/mol  in  the 
case  of  the  singlet  when  going  from  NiCOd) 
to  Ni5CO(5, 0)  and  NigCO(4, 1),  respectively.  In 
the  triplet  case,  we  observe  reductions  of  18 
and  72  kcal/mol  when  going  from  NiCO(3)  to 
NigCO(3)(5, 0)  and  Ni5CO(3)(4, 1),  respectively. 

In  [17],  values  are  reported  for  compounds  simi¬ 
lar  to  the  ones  that  are  the  subject  of  this  study, 
but  of  the  form  Ni(5,4)  and  Ni(4, 5)  as  well  as 
Ni(5,0)  (in  the  nomenclature  we  are  using).  For 
Ni(4,5),  distances  in  the  order  of  0.1-0.3  A  are 
mentioned,  not  too  different  from  the  values  found 
in  the  present  report  (0.05  and  0.10  A,  singlet  and 
triplet,  respectively).  As  for  the  Ni„ — C  bond  dis¬ 
sociation  energies,  they  report  a  calculated  value 


of  265  kcal/mol  and  an  experimental  one  of  170 
kcal/mol.  The  latter  is  in  excellent  agreement  with 
our  values  (184  kcal/mol  for  the  singlet  and  189 
kcal/mol  for  the  triplet).  However,  in  the  (5,0) 
case,  their  calculated  value  for  the  Nii — C  distance 
is  1.18  A,  substantially  smaller  than  ours  (1.625  A), 
and  a  bond  dissociation  energy  of  117  kcal/mol, 
whereas  our  values  are  146  kcal/mol  for  the  sin¬ 
glet  and  149  kcal/mol  for  the  triplet, 

Fe^C  Case 

In  the  case  of  FegC,  we  have  only  the  (4,1) 
geometry  for  the  cluster  and  the  triplet  and  quin¬ 
tuplet  states.  Regarding  the  carbide  species,  we  see 
that — according  to  Table  IV — the  quintuplet  is 
more  stable  than  is  the  triplet  by  3.9  kcal/mol, 
unlike  the  FeC  case,  where  the  triplet  is  more 
stable  than  is  the  quintuplet  by  17.6  kcal/mol.  This 
shows  the  great  influence  that  the  Fe4  ring  has 
over  the  FeC  species,  which  is  also  reflected  by  the 
fact  that  the  C  atom  is  almost  in  the  Fe4  plane. 

As  in  prior  cases,  let  us  consider  the  influence 
that  the  Fe4  group  has  over  the  Feg — C  bond 
dissociation  energy  in  the  carbides  and  over  the  C 
— O  bond  in  the  carbonyls.  We  observe  an  incre¬ 
ment  in  the  former  energy  in  the  case  of  the  triplet, 
going  from  155.6  (FeC)  to  186.0  kcal/mol  (FegC). 
In  the  case  of  the  quintuplet,  the  corresponding 
change  is  from  138.0  to  189.9  kcal/mol.  Once  again, 
the  strengthening  of  the  Fe — C  bond  by  the  action 
of  the  Fe4  ring  is  clearly  manifested. 

If  we  focus  on  the  C — O  bond  dissociation  en¬ 
ergy,  we  infer — from  Tables  II  and  V — that  in  the 
case  of  the  triplet  it  diminishes  by  59.8  kcal/mol. 
In  the  case  of  the  quintuplet,  the  corresponding 
reduction  is  45.4  kcal/mol.  The  influence  of  the 
Fe4  ring  on  the  FeCO  species  is  found  to  strengthen 
the  interaction  of  the  Fe  atom  with  the  metallic 
cluster  and  weaken  the  C — O  bond. 

In  [18],  the  compound  FegC(CO)ig  was  consid¬ 
ered.  It  has  three  carbonyl  groups  per  Fe  atom,  and 
it  is  the  first  carbide  cluster  reported  in  the  litera¬ 
ture,  being  the  most  similar  one  to  our  FegC.  The 
only  information  that  is  provided  about  this  car¬ 
bide  is  the  distance  from  the  C  atom  to  the  Fe4 
plane,  which  is  estimated  as  being  0.08  A.  Our 
value  is  0.40  A,  which  is  much  larger,  but  one 
should  keep  in  mind  the  differences  between  the 
two  compounds. 
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Ti^C  Case 

Lastly,  let  us  consider  the  Ti  carbides  and  car¬ 
bonyls.  Here,  we  focused  only  on  the  (7, 0)  geome¬ 
try  of  the  cluster.  From  Table  IV,  we  see  that  in  the 
case  of  Ti7C  the  quintuplet  is  more  stable  than  is 
the  triplet  by  9.6  kcal/mol.  This  is  quite  the  oppo¬ 
site  of  TiC,  where  the  triplet  is  more  stable  by  21.9 
kcal /mol,  as  in  the  Fe  cases.  When  we  move  onto 
the  corresponding  carbonyl  compounds,  we  find 
the  quintuplet  to  be  more  stable  in  both  cases,  the 
differences  being  7.8  kcal/mol  for  TiyCO  and  16.7 
kcal/mol  for  TiCO. 

In  the  case  of  TiyCfS),  which  is  the  ground  state, 
there  is  a  12.8  kcal/mol  increment  with  respect  to 
the  (excited)  state  TiC(5).  This  change  is  similar  to 
those  already  seen  in  planar  cases. 

However,  if  we  consider  TiyCO)  (excited  state), 
there  is  a  18.7  kcal/mol  reduction  in  the  Ti — C 
bond  dissociation  energy.  This  is  an  anomalous 
case,  since  it  means  that  the  interaction  of  the  Ti^ 
ring  with  the  TiC  carbide  strengthens  the  C-metal 
bond  for  the  ground  state,  but  not  for  the  excited 
state. 

We  have  a  similar  situation  for  the  C — O  bond 
in  the  case  of  the  carbonyls.  In  the  quintuplet,  the 
C — O  bond  dissociation  energy  decreases  by  16.3 
kcal/mol  when  going  from  TiCO(5)  to  Ti7CCX5). 
In  this  case,  both  states  are  ground  states.  In  the 
triplet,  we  have  again  an  anomalous  case,  since  the 
C — O  bond  dissociation  energy  increases  by  24.1 
kcal/mol. 

When  we  concentrate  on  the  values  of  charge 
transfer,  we  see  that  in  the  Cu  and  Ni  compounds 
there  is  a  marked  increase  (as  shown  in  Table  VI) 
when  going  from  the  (5, 0)  to  the  (4, 1)  geometry, 
whereas  a  reduction  is  observed  in  the  dipole 
moments.  In  the  (5, 0)  and  (7, 0)  cases,  this  charge 
transfer  is  similar  to  those  of  the  carbides  MC, 
listed  in  Table  III. 

For  the  charge  distribution  analysis,  we  will 
consider  a  typical  case  only.  Choosing  CU5C  as  an 
example,  a  population  analysis  gives  us  for  the 
atoms  on  the  vertices  of  the  square  a  positive 
charge  of  0.1030  au,  whereas  that  on  the  central 
one  is  -0.1586  au.  This  leaves  a  total  charge  for 
the  cluster  of  0.2535  au,  which  is  the  net  electronic 
charge  transferred  to  the  C  atom.  In  the  (4, 1)  case, 
each  Cu  atom  on  a  vertex  carries  a  positive  charge 
not  too  different  from  that  of  the  previous  case 
(0.1126  au),  but  the  fifth  Cu  atom  now  bears  a 
positive  charge  as  well  (0.1278  au),  which  results 


TABLE  VI _ 

M„C  compounds. 


Molecule 

q 

Cu5C(2)  (5, 0) 

0.2535 

1.335 

Cu5C(2)(4,1) 

0.5782 

0.313 

NigCd)  (5,0) 

0.3494 

1.781 

Ni5C(1)(4,1) 

0.6035 

0.403 

NigCO)  (5, 0) 

0.3208 

1.565 

Ni5C(3)(4,1) 

0.5695 

0.142 

FegCO)  (4, 1) 

0.4786 

0.868 

FegCCS)  (4, 1) 

0.5507 

0.755 

Ti^CO)  (7,0) 

0.4049 

0.988 

Ti7C(5)  (7,0) 

0.4141 

0.935 

q:  Electronic  charge  transferred  from  M„  to  C  (au).  f. 
moment  (D). 

i:  dipole 

in  a  larger  positive  charge 
au). 

on  the  duster  (0.5782 

Conclusions 


1.  For  the  carbides  considered  in  this  report,  the 
most  stable  states  are  the  singlet  for  NiC  and 
the  triplets  for  FeC  and  TiC,  which  is  quite 
similar  to  the  corresponding  carbonyl  com¬ 
pounds,  except  for  TiCO,  where  the  quin¬ 
tuplet  is  more  stable  than  is  the  triplet. 

2.  M — C  bond  lengths  are  significantly  reduced 
when  going  from  the  carbonyls  to  the  corre¬ 
sponding  carbides,  whereas  bond  dissocia¬ 
tion  energies  increase  in  like  manner. 

3.  There  is  a  sizable  reduction  in  the  dissocia¬ 
tion  energy  of  the  C — O  bond  of  the  car¬ 
bonyls  with  respect  to  the  free  CO  molecule. 

4.  In  the  case  of  the  M — C  carbides,  we  observe 
an  important  charge  transfer  from  the  metal 
atom  to  the  C  atom,  which  varies  between 
0.30  and  0.40  electronic  charge  units. 

5.  The  stretching  frequencies  for  the  M — C 
bonds  of  the  MC  carbides  increase,  as  is  to  be 
expected,  in  the  same  order  as  bond  dissocia¬ 
tion  energies.  These  values  are  considerably 
higher  than  the  ones  of  the  corresponding 
carbonyls. 

6.  Considering  now  the  M„C  carbides,  we  find 
the  following  results:  First,  for  CU5C,  the 
geometry  of  the  (4, 1)  cluster  is  more  stable 
than  is  the  (5, 0)  one,  as  opposed  to  the  car- 
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bonyl  case.  Ni5C,  however,  shows  higher  sta¬ 
bility  for  the  (5, 0)  geometry,  and  at  the  same 
time,  the  triplet  becomes  slightly  more  stable 
than  is  the  singlet.  NiC  displays  the  opposite 
behavior.  In  the  case  of  Ve^C,  the  quin¬ 
tuplet  is  more  stable  than  is  the  triplet,  in  a 
similar  way  to  that  of  the  corresponding  car¬ 
bonyls  and  unlike  TiC,  where  the  triplet  is 
more  stable  than  is  the  quintuplet.  For  Ti7C, 
and  also  for  TiyCO,  the  quintuplet  is  more 
stable  than  is  the  triplet.  We  find  the  oppo¬ 
site  order  for  TiC. 

7.  Looking  now  at  the  dissociation  energy  of 
the  C — O  bond  in  the  M„CO  compounds,  we 
notice  a  reduction  with  respect  to  the  corre¬ 
sponding  MCO  compounds.  This  reduction 
is  moderate  for  the  planar  geometries,  but 
significantly  greater  for  the  nonplanar  ones. 
This  means  that  the  C — O  bond  is  weakened 
considerably  by  the  influence  of  the  M4  group 
(M  =  Cu,Ni,Fe).  In  the  case  of  TiyCO,  we 
find  a  similar  reduction  in  the  C — O  bond 
energy  (compared  to  TiCO),  but  this  is  only 
so  in  the  ground  quintuplet  state,  since  in  the 
excited  triplet  state,  we  find  an  increase  in 
that  energy. 
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Introduction 

For  the  past  decade,  the  fabrication  of  ultra¬ 
small  devices  and  associated  developments 
in  laser  technology  for  exploration  of  their  proper¬ 
ties  have  raised  important  theoretical  questions, 
both  conceptual  and  practical  In  particular,  the 
short  times  sampled  in  femtosecond  laser  studies 
and  the  wealth  of  potential  information  from  dif¬ 
ferent  initial  preparations  call  for  a  description  of 
Coulomb  kinetics  (e.g.,  electrons  and  holes)  far 
from  equilibrium,  extending  from  initial  times  to 
times  long  compared  to  the  characteristic  collision 
times  and  dephasing  times  [1,2].  Such  theoretical 
considerations  already  have  been  explored  in  other 
contexts,  such  as  transport  in  simple  classical  liq¬ 
uids  and  plasmas  [3]  and  for  nuclear  matter  [4], 
but  their  experimental  relevance  in  these  cases  has 
been  less  compelling.  The  objective  here  is  to  for¬ 
mulate  the  problem  of  Coulomb  kinetics  in  terms 
of  the  fundamental  exact  first  two  hierarchy  equa¬ 
tions  for  the  one-  and  two-particle  reduced  density 
operators.  Practical  applications  result  from  an 
approximate  closure  of  the  two-particle  equation 
subject  to  important  constraints  on  acceptable  ap¬ 
proximations  associated  with  conservation  laws, 
representability,  stationary  solutions,  and  quan¬ 
tum  statistics.  These  structural  properties  are  inde¬ 
pendent  of  considerations  of  classical  or  quantum 
effects  and  do  not  rely  on  the  matrix  representa¬ 
tion  appropriate  to  the  specific  problem.  The  theo¬ 
retical  analysis  of  a  given  approximation  occurs  at 
the  compact  abstract  level  for  greatest  simplicity 
and  generality  and  in  a  formulation  that  allows 
exploitation  of  previous  work  on  classical  systems. 

This  approach  is  illustrated  for  a  simple  two- 
band  model  of  a  semiconductor,  to  show  how 
corrections  to  the  semiconductor  Bloch  equations 
can  be  constructed  to  include  the  effects  of  scatter¬ 
ing  and  dynamic  screening.  In  this  discussion,  only 
electron-electron  and  electron-laser  interactions 
are  considered;  the  lattice  is  presumed  to  be  rigid. 
A  simple  closure  approximation  including  exact 
three-particle  exchange  correlations  and  all  resid¬ 
ual  pair  correlations  is  indicated  and  its  content 
discussed.  Solution  of  the  two-particle  equation 
gives  the  two-particle  reduced  density  operator  as 
a  functional  of  the  one-particle  density  operator. 
Use  of  this  result  in  the  first  hierarchy  equation  for 
the  one-particle  density  operator  gives  a  closed 


kinetic  equation.  The  resulting  non-Markovian  ki¬ 
netic  equation  describes  the  evolution  of  an  arbi¬ 
trary  initial  preparation  from  asymptotically  short 
times  to  the  long-time  Boltzmann  limit.  The  short 
time  evolution  includes  the  buildup  of  dynamical 
screening  (polarization  effects);  at  long  times,  the 
Boltzmann  scattering  rates  are  determined  from 
the  full  T-matrix  rather  than  from  the  weak  cou¬ 
pling  Born  approximation. 

In  this  brief  presentation,  only  the  theoretical 
structure  and  method  are  described  as  an  overview 
of  the  general  approach  to  carrier  dynamics.  More 
detailed  calculations  based  on  the  approximation 
suggested  here  are  in  progress. 


Reduced  Density  Operators  and 
Hierarchy 

We  consider  N  electrons  interacting  with  a  rigid 
ionic  lattice,  with  overall  charge  neutrality.  The 
electrons  interact  with  the  lattice  and  with  each 
other  via  bare  Coulomb  potentials.  In  addition, 
they  interact  with  a  classical  (i.e.,  many  photon) 
transverse  electric  field,  E(0  via  a  dipole  interac¬ 
tion.  The  Hamiltonian  for  the  system  is  then 

N 

Hit)  =  Ho  +  U+  Ho  =  E  (1) 

i=l 

I  N 

H,,,  =  -At-ECO,  (2) 

where  h^ii)  is  the  single-particle  Bloch  Hamilto¬ 
nian  for  interaction  of  electron  i  with  the  lattice, 
V{i,  j)  is  the  Coulomb  potential  for  interaction  of 
electrons  z,  ;,  and  jx  =  ixii)  is  the  total  dipole 
moment  due  to  all  electrons.  The  external  field 
E(0  characterizes  the  laser  field,  which  may  have 
a  very  short  time  scale  in  some  pump  experiments, 
but  may  also  represent  a  longer  time  driving  field 
with  a  duration  comparable  to  various  dephasing 
mechanisms.  The  details  of  this  field  are  not  im¬ 
portant  for  the  formal  considerations  here,  but  can 
play  a  role  in  the  numerical  method  used  to  solve 
the  equations.  The  eigenvalue  problem  for  the 
Bloch  Hamiltonian  is  described  by 

=  e„(k)i/<„(k),  (3) 

where  a  denotes  the  band  index,  and  k,  the  mo¬ 
mentum  quantum  number  (here  and  in  the  follow¬ 
ing  we  set  h  =  1). 
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The  initial  state  of  the  system  is  defined  by  an 
N-particle  density  operator,  p(l, . . . ,  N ).  This  could 
be  the  Gibbs  distribution  if  the  electron-lattice 
system  is  initially  in  equilibrium,  but  it  is  not 
necessary  to  specify  its  form  at  this  point.  Im¬ 
plicit  in  the  definition  of  the  density  matrix  is  an 
N-particle  symmetrization  operator  so  that  a  repre¬ 
sentation  of  its  matrix  elements  in  terms  of  unsym¬ 
metrized  products  of  single  particle  states  is  possi¬ 
ble  [5].  This  is  an  important  feature  that  will  be 
exploited  in  the  next  section  to  extract  the  ex¬ 
change  correlation  of  the  reduced  density  opera¬ 
tors  associated  with  p{l, ...,  N).  The  time  evolu¬ 
tion  of  the  density  operator  is  governed  by  the 
Liouville-von  Neumann  equation 

^,p(t)  +  f[H(0,p(0]  =0.  (4) 

The  reduced  density  operators  for  tn  particles  are 
defined  by 

/(-)(!, . . . ,  m;  f )  =  . ^p(0.  (5) 

The  notation  on  the  right  side  indicates  a  trace 
over  the  degrees  of  freedom  associated  with  parti¬ 
cles  m  -h  1  through  N.  More  precisely,  in  matrix 
representation,  it  denotes  summation  over  the  di¬ 
agonal  quantum  numbers  for  these  particles.  The 
reduced  density  operators  inherit  an  M-par- 
ticle  symmetrization  operator  from  the  corre¬ 
sponding  N-particle  symmetrization  operator  in  p. 
The  BBGKY  (Born,  Bogoliubov,  Green,  Kirkwood, 
Yvon)  hierarchy  equations  [5,6]  for  the  time  de¬ 
pendence  of  the  reduced  distribution  functions 
now  follow  directly  from  a  partial  trace  of  the 
Liouville-von  Neumann  equation, 

m 

!=1 

m 

m 

=  -  E  Tr„+^ilVU,  m  +  1), 

i=l 

+  (6) 

The  left  side  of  this  equation  is  just  the 
Liouville-von  Neumann  equation  for  m  isolated 
particles.  The  right  side  expresses  a  coupling  to  the 


other  degrees  of  freedom  through  the  Coulomb 
interactions.  In  the  following,  it  will  be  sufficient 
to  consider  only  the  cases  with  m  =  1  and  2, 

+  Tr^i[Va,2),f<-^Kl,2;t)]  =0  (7) 
+  i[H(l,2;0,/®(l,2;0] 

2 

+  E  Tr3f[V(f,3),/®(l,2,3;f)]  =  0,  (8) 

i=l 

where  the  single-particle  and  two-particle  Hamil¬ 
tonians  are  given  by 

H(l;  t)  =  /zo(l)  -  Ml)  •  E(f)  H(l,2;  t) 

=  E(/zo(0~MO*E(f))  +  m,2).  (9) 

t=i 

These  equations  are  exact  but  formal  since  they 
are  not  closed  in  terms  of  and  alone.  It  is 
necessary  to  construct  a  suitable  approximation 
that  expresses  as  a  functional  of  and 
Then  Eqs.  (7)  and  (8)  provide  the  means  to  calcu¬ 
late  and  In  particular,  if  the  equation  for 
is  solved  as  a  functional  of  i.e., 

/(^>(1,2;0-F<^>(l,2;f  If'O,  (10) 

then  use  of  this  solution  in  the  first  hierarchy  Eq. 
(7)  provides  a  closed  kinetic  equation  for 

djm;t)  +  i[H(l;t)J^Hl;t)] 

-h  Tr2/[y(l,2),F<2>(l,2;M/^^0]  =0.  (11) 

This  is  the  approach  proposed  here  for  obtaining 
the  appropriate  kinetic  equation  for  charge  carri¬ 
ers.  The  procedure  is  representation-independent, 
and  as  a  first-order  equation  in  time,  it  is  appropri¬ 
ately  posed  as  an  initial  value  problem.  This  is  in 
contrast  to  the  alternative  nonequilibrium  Keldysh 
Green's  function  method  which  requires  a  detailed 
reconstruction  of  the  specified  initial  conditions 
from  an  artificial  past  history  [7]. 


Exact  Exchange  Effects  and 
Correlations 

There  are  two  important  sources  of  correlations 
among  the  charge  carriers.  One  is  due  to  the  long- 
range  Coulomb  interaction,  while  the  other  is  due 
to  the  exchange  symmetry  among  particles  of  the 
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same  species.  It  is  useful  to  extract  explicitly  the 
dominant  effects  due  to  exchange  symmetry  by 
identifying  the  exchange  operators  implicit  in  the 
definitions  of  the  reduced  density  operators.  This 
will  allow  a  more  controlled  identification  of  the 
residual  correlations  due  to  Coulomb  interactions. 
The  two-  and  three-particle  reduced  density  opera¬ 
tors  are  expressed  as 

=  ^(l,2)/(")(l,2;f)  (12) 

/<3)(1, 2, 3;  f )  =  2, 3;  f  )^(1, 2, 3) 

=  ^(1,2,3)/®(1,2,3;0,  (13) 

where  ^1,2)  and  5^(1, 2, 3)  are  the  two-  and 
three-particle  antisymmetrization  operators  that 
convert  product  states  for  two  and  three  electrons 
into  antisymmetric  states,  respectively.  Such  sym- 
metrization  factors  occur  necessarily  from  the  defi¬ 
nition  of  the  underlying  N-particle  density  matrix. 
The  last  equality  follows  from  the  permutation 
symmetry  of  the  reduced  density  operators  and  of 
the  symmetrization  operators.  Next,  the  correlation 
operators  associated  with  2;  t)  and 

2,3;  t)  are  introduced: 

/(2)(1,2;0  (14) 

/(^)(1,2,3;0 

^/(^)(l;0/<'>(2;f)/<'>(3;f) 

+  Og(2,3;  t)  +/<'>(2;  0g(l,3;  t) 

+  f^K3-,t)g(l,2;t)+gil,2,3-,t).  (15) 

Since  the  symmetrization  operators  have  been  ex¬ 
tracted  explicitly  in  (12)  and  (13),  the  correlation 
functions  g(l,  2;f)  and  g(l,2,3;f)  are  primarily 
measures  of  correlations  due  to  the  Coulomb  inter¬ 
actions.  Conversely,  even  when  there  are  no  corre¬ 
lations  in  and  /^^^(l,  2,3;  0,  the  distri¬ 
bution  functions  t)  and  /^^Kl,2, 3;  t)  have 

two-  and  three-particle  correlations  induced  by  the 
antisymmetrization  operators.  Therefore,  it  is  use¬ 
ful  to  rewrite  the  first  two  hierarchy  equations  in 
terms  of  §(1,2;  t)  and  §(1,2,3;  0*  The  first  equa¬ 
tion  becomes 

+  Tr^ i[V,a, 2), /W(l;  t) f^K2;  f )] 

+  rr2f[V,(l,2),g(l,2;l)]  =  0,  (16) 


where  14(1,2)  =  y(l,  2)c5^(l,  2)  is  the  pair  potential 
with  exchange  effects.  The  left  side  of  this  equation 
is  recognized  as  generating  the  time-dependent 
Hartree-Fock  dynamics  so  the  equation  becomes 

<?,/W(l;f)  +  /[H,/l;t),f'’(l;0] 

=  -Tr2/[V,(1,2),^(1,2;0],  (17) 

where  the  Hartree-Fock  Hamiltonian  is 

H,/l;t)  =  H(1;0  +  ^./l;  t), 

y,/l;  t)  ^  TrJ^^Kl;  014(1,2).  (18) 

This  is  the  expected  result:  The  mean-field 
Hartree-Fock  dynamics  is  modified  by  a  coupling 
to  other  electrons  due  to  the  presence  of  Coulomb 
correlations. 

The  equation  for  the  correlations,  §(1,2;  0,  fol¬ 
lows  from  the  second  hierarchy  equation.  The  ef¬ 
fects  of  the  symmetrization  operators  are  evalu¬ 
ated  in  detail  in  the  Appendix  leading  to  the  exact 
equation  for  §(1,2;  t): 

d,ga,2;t)  +  i[H(l,2-,t)ga,2-,t) 
-g(l,2;t)HKl,2;t)] 

+  Tr,i{[V^a,3)J^^Kl;t)] 

Xf(2,3;f)M2,3) 

+  [V,(2,3),/«(2;f)] 

X|(l,3;0^(l,3)} 

=  -i{va,2-,t)f^Ki-,t)f^K2-,t) 

2 

-  E  Tr^i[V(j,3),g(l,2,3-,t) 

i=i 

X(1 -Pi3 -P23)],  (19) 

with  the  definitions 

H(l,2;  t)  =  H;,/l;  t)  +  H^j(2;  t)  +  1/(1, 2;  t) 

(20) 

1/(1, 2;  0  ^  {1  -/^'>(1;0  -/(i>(2;0}V(l,2). 

(21) 

The  first  two  terms  on  the  left  side  describe  pair 
dynamics  generated  by  the  Hamiltonian  H(l,  2;  t). 
This  Hamiltonian  differs  from  H(l,2)  in  two  im¬ 
portant  ways:  First,  the  single-particle  energies  are 
renormalized  to  the  mean-field  Hartree-Fock 
energies.  Second,  the  pair  potential  is  modified 
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by  the  prefactor  {1  -  ~  t)}  describ- 

mg  '"blocking"  effects  of  the  statistics.  To  see 
this,  note  that  {1  -  t)  ~~  t)}  = 

{h^^Kl;  t)  -  t)f^^K2)  0},  where 

=  1  is  the  "hole"  occupancy. 

Thus,  the  matrix  elements  of  y(l,2;f)  are  re¬ 
stricted  according  to  the  occupation  of  carrier  den¬ 
sity  in  the  nonequilibrium  state.  Both  the  blocking 
and  mean-field  effects  in  H(l,  2;  t)  account  for 
significant  many-particle  correlations  in  this  effec¬ 
tive  pair  dynamics.  The  third  and  fourth  terms  on 
the  left  side  describe  polarization  or  screening  ef¬ 
fects  that  are  essential  for  a  proper  treatment  of  the 
long-ranged  Coulomb  interactions.  In  the  classical 
limit,  these  are  the  linearized  Vlasov  operators 
leading  to  the  Debye-Hiickel  dynamic  screening 
and  pair  correlations.  Here,  these  operators  gener¬ 
ate  the  random  phase  or  chain  approximation  in¬ 
cluding  exchange  effects.  The  first  term  on  the 
right  side  of  (20)  is  a  source  of  correlations  due  to 
the  "commutator"  of  y(l,2;f)  with  the  uncorre¬ 
lated  pair  operator  t);  equivalently, 

the  time  derivative  of  the  uncorrelated  state  is 
correlated  due  to  the  Coulomb  interactions.  Here, 
this  source  of  correlations  is  due  to  the  pair  inter¬ 
action  including  blocking  effects.  Finally,  all  resid¬ 
ual  many-body  effects  are  contained  in  the  last 
term  on  the  right  side  describing  three-particle 
correlations  not  associated  solely  with  three-par¬ 
ticle  exchange  effects. 

Equations  (17)  and  (20)  are  still  exact  and  fully 
equivalent  to  Eqs.  (7)  and  (8).  However,  the  analy¬ 
sis  of  the  exchange  effects  and  the  transformation 
to  the  correlation  functions  provides  a  form  in 
which  the  dominant  mean  field,  polarization,  and 
blocking  effects  are  made  explicit.  This  is  a  more 
convenient  form  for  introduction  of  appropriate 
approximations  or  application  to  specific  prob¬ 
lems. 


Bloch  Representation 

The  above  analysis  has  used  only  the  abstract 
operator  form  of  the  reduced  density  operators 
and  associated  hierarchy  equations.  Specific  appli¬ 
cations  require  a  particular  matrix  representation. 
For  example,  a  fully  ionized  plasma  might  best  be 
described  in  terms  of  single-particle  momentum 
states.  To  show  the  relationship  of  the  density 
matrix  approach  to  the  standard  equations  of 
semiconductor  physics,  we  consider  a  representa¬ 


tion  using  Bloch  states  I  h(l)  \  ak> 

=  e^(k)  I  ak).  Only  the  first  hierarchy  equation 
will  be  considered;  the  analysis  for  the  second 
equation  is  similar: 

t)  +  t)]  +  C(l;  t)  =  0, 

(22) 

where  C(l;  t)  is  the  "collision"  operator  due  to 
pair  correlations 

C(1;0  =  Tr2z[V,(l,2),g(l,2;0].  (23) 

The  band  occupation  densities,  n^(k;  t),  and  the 
polarization  densities,  defined  by 

n^(k;  t)  =  <ak  |  \  ak> 

p,,,(k;0^<ak|/(i>(f)|a'k>,  (24) 

where  the  band  indices  a  a'  in  the  definition  of 
the  polarization  density  t).  To  simplify  the 

discussion,  we  consider  the  case  of  only  two  bands 
(conduction  and  valence).  Also,  we  consider  only 
homogeneous  states,  so  that  the  one-particle  den¬ 
sity  matrix  is  diagonal  in  the  wave  vector,  k.  The 
equations  are  then  found  to  be 

0  -  2Im{<ak  |  H,/l;  f)  |  a'k>p„,„(k;  f)} 

+  C,„(k;  f)  =  0  (25) 

<?,p,„.(k;0  +  f[<ak  I  |  ak> 

-<a'k|H,/l;f)|a'k>]  p„„.(k;  0 

+  /<ak|%(l;0|a'k> 

X[n„,(k;  t)  -  n„(k;  f)] 

+  C„,,(k;0  =  0,  (26) 

where  it  is  understood  that  a  ¥=  a'.  The  matrix 
elements  in  these  equations  are  easily  evaluated: 

<ak|H,/l;0l  z^k) 

-  6,(k)5,^,  +  <ak  I  I  ^k>  ~  E(t)  •  (27) 

where  =  <ak  |  /x  |  vk).  Equations  (25)  and 

(26)  then  become 

<?(n„(k;  f)  -  2Im{[<ak  |  \  a'k>  -  E(0  • 

Xp«'a(l«;f)}  +  Ok;0  =  0  (28) 

<?fPaa'(k;  0  +  i[(ak  \  t)  \  ak> 

-<a'k|H,/l;f)  |a'k>]  p„„,(k;f) 

+  z[<ak|  V.^la'k)  -E(0-jLt„„,] 

X(M„.(k;  t)  -  n^(k;  0)  +  C„„.(k;  t)  =  0. 

(29) 
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The  matrix  elements  of  the  Hartree-Fock  potential 
are  easily  evaluated  and  the  resulting  equations 
for  n„(k;  f)  and  f)  are  found  to  be 

^,«^(k;  t)  +  2Im(E(0  •  f)) 

+  2lmY^V(q)[n^(k  +  q;f)/„(k,  -q) 
q 

Xf*^(k,  -q)  +  n„ik  +  q;  t) 

X4,(k,  -q)I*,(k,  -q) 

+p,^(k  +  q;  f)f,,(k,  -q)/*„(k,  -q) 

+  p„,(k  +  q;  t)I„(k,  -q)f*(k,  -q)]  p„,(k;  t) 
+  C„{k;  f)  =  0  (30) 

dtp^„(k;t)  +  i(e^(k)  -  €„(k)  -  E(f) 

^  Ace  At)  z)  ])p„(k;f) 

-iY,V(q)[n^{k  +  q;t) 
q 

x(|J„(k,-q)P-|Ukt-q)P) 

+  n^(k  +  q;t)(l4j,(k,  -q)|^  -  \I^„{k,  -q)l^) 
+  2p„(k  +  q;  t)Re(/„(k,  -q)I*„(k,  -q) 
-Uk,-q)Ok,-q))]Pct,(k;f) 

-i|E^('?)[”c(k  +  q;  0 

Xf„(k,  -q)4*(k,  -q)  +  n„(k  +  q;  f) 
xUk,  -q)I*„(k,  -q) 

+p„(k  +  q;f) 

XI Jk,  -q)/*„(k,  -q)  +  p„,(k  +  q;  f) 
Xf„(k,  -q)J*(k,  -q)] 

+  E(t)  • 

X(n^(k;  0  -  n^(k;  f))  +  Q^(k;  f)  =  0.  (31) 

Here,  4j3^.(k,q[)  arises  from  the  Coulomb  matrix 
elements 

Ot2k2!  lo£.2k2/ 

^k'i  +  k'2,ki  +  k2l^  +  q 

q 

X  ~q)la,a'P^2rq)  (32) 


4a-(k,q)  -  / 

=  7:.„(k-q,-q)  (33) 

and  V(q)  is  the  Fourier-transformed  pair  potential. 
The  valence  and  conduction  occupation  numbers 
are  related  by  n^(k;  t)  +  n^(k;  t)  =  constant,  and 
the  polarization  densities  are  related  by 
These  are  the  most  general  hierarchy  equations. 

The  matrix  elements  C^„/(k;  f)  describe  both 
inter-  and  intraband  collisions  and  can  be  ex¬ 
pressed  in  terms  of  the  matrix  elements  of  the 
correlation  function  g(l,  2;  t)  using  (23) 

Q„.(k;  f) 

=  <ak|{Tr2i[V,(l,2),g(l,2;f)]}|a'k) 

=  f  E  E  {<ak;  j'iki|V,|j/2k2;  vjkj) 

ki,k2,k3  7^1,  1^2/  *^3 

X<V2k2;  Vjkjifla'k;  v^k^} 

-<ak;  Vjkj|gli'2k2;  I'skj) 

X<r'2k2;  V3k3|y>'k;»'iki>}.  (34) 

Further  reduction  follows  from  substitution  of  the 
Coulomb  matrix  element  (32). 

In  practice,  the  relevant  values  of  q  may  be 
sufficiently  small  to  justify  the  approximation 
4,p(k,q)  ^  f„„(k,0)  =  Then,  these  equations 
simplify  to  the  usual  forms 

<?,n,,(k;  f)  +  2Im(E(0  •  /r„p„,,(k;  f)) 

+  2lmY,V(q)p^„ik  +  q;  t)p^,(k;  t) 
q 

+  C„(k;  f)  =  0  (35) 

(?,p„(k;  f)  +  i(e^(k)  -  e„(k)  -  E(f) 

•[  Aaa  -  Aa'a'])Pci,(k;0 

-iYiViq)[n^(k  +  q;  f)  -  nj,(k  +  q;  f)] 
q 

Xp„(k;t) 

-iij^V(q)p,„(k  +  q;/t)  +  Elf)- 

V  q  I 

Xin^ik;  0  -  n^(k;  f)) 

+  C„(k;t)  =  0,  (36) 
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with  the  collision  matrix  elements 
Q,(k;f) 

=  -21m  Y.  V(q) 

q#  0/  kj 

x{(ck  —  q;  ckj  +  ql^lck;  ckj) 

+  <ck  -  q;  vkj  +  qlglck;  vkj) 

-  <cki  +  q;  ck  -  q|§|ck;  ckj) 

-<i)ki  +  q;ck  -  q|^|ck;t;ki>}  (37) 

Q,(k;  f) 

=  -21m  Y 

q#0,ki 

X{<ck  -  q;cki  +  q|^|z;k;cki> 

+  <ck  -  q;yki  +  q|f  |uk;  ukj)} 

-  Y  V(<?){<cki  +  q;ck  -  q|f|r;k;cki> 

q¥=  0,  kj 

+  <i;ki  +  q;ck  -  q|fbk;ykj> 

-<ck;  ckjl^lcki  +  q;  i;k  -  q> 
-<ck;i>k]|f|tJki  +  q;t?k  -  q>}.  (38) 

Aside  from  the  small  q  limitation  of  the 
Coulomb  matrix  elements,  the  above  equations  are 
still  exact.  The  detailed  dependence  of  C^„-(k;  f) 
on  n„(k;  t)  and  t)  requires  specification  of 

g(l,2;f)  as  a  functional  of  This  follows 

from  an  approximate  solution  to  Eq.  (19).  To  illus¬ 
trate  the  procedure,  consider  the  weak  coupling 
limit  in  which  ^(1,2;  t)  is  evaluated  to  first  order 
in  the  Coulomb  interactions.  Since  g(l,2,3;  t)  oc¬ 
curs  in  (19)  multiplied  by  y(l,2),  it  is  sufficient  to 
evaluate  g(l,2,3;  t)  to  zeroth  order  in  the  poten¬ 
tial.  However,  since  the  correlations  due  to  statis¬ 
tics  have  already  been  extracted,  the  three-particle 
correlation  function  vanishes  to  this  order.  Also, 
since  the  source  term  on  the  right  side  is  of  first 
order  in  the  potential,  then  g(l,2;  t)  also  is  of  first 
order.  Thus,  all  explicit  dependence  on  V(l,  2)  can 
be  neglected  on  the  left  side  of  the  equation,  lead¬ 
ing  to  the  weak  coupling  equation 

d,g(l,2;  t)  +  f[(H(l)  +  H(2)),  g(l,2;  t)] 

-/“(I;  f)/^>(2;  t)VHl,2-,  f)},  (39) 


with  the  solution 
|(l,2;f)  =  e->'(H(i)+H(2))f|(i 

•^0 

x{y(l,2;T)/<i>(l;T)/<i>(2;T) 

x/W(2;T)l/Ul,2;T)}e‘<»W+»(2)X(-T)_ 

(40) 

Since  the  Hamiltonian  H(l)  is  diagonal  in  the 
Bloch  representation,  it  is  straightforward  to  calcu¬ 
late  the  matrix  elements  of  g(l,  2;  f )  given  by  (40) 
and  obtain  the  collision  operator  as  a  quartic  func¬ 
tion  of  the  occupation  numbers  [recall  t)  = 

{l~f^Kl;t)-f^K2;t)}V(l,2)l  The  result  is  a 
generalization  of  the  Born-Boltzmann  collision  op¬ 
erator,  extended  to  include  the  effects  of  initial 
correlations  and  non-Markovian  effects  at  short 
times.  In  the  long  time  limit,  it  becomes  exactly  the 
Uhlenbeck-Boltzmann  collision  operator  with 
scattering  calculated  in  the  Born  approximation. 
An  improved  approximation  appropriate  for  most 
current  experimental  conditions  is  described  in  the 
next  section,  where  strong  scattering  and  polariza¬ 
tion  effects  are  accounted  for  as  well. 


Pair  Correlation  Approximation 

The  exact  transformed  hierarchy  Eq.  (19)  for 
g(l,2;  f)  is  an  appropriate  form  for  the  introduc¬ 
tion  of  approximations  since  it  makes  explicit  the 
physical  mechanisms  for  renormalized  single-par¬ 
ticle  states,  blocking  in  the  pair  d3mamics,  and 
dynamic  screening.  The  approximations  entail 
some  statement'  about  the  three-particle  correla¬ 
tions  in  g(l,2,3;  t).  More  specifically,  the  approxi¬ 
mation  should  give  g(l,  2, 3;  t)  in  terms  of  the  pair 
correlations  and  one-particle  distribution  function, 
so  that  the  first  two  hierarchy  equations  become 
closed  equations  for  and  §(1,2;  f).  The 

formal  solution  to  the  second  equation  provides 
the  functional  in  (10)  required  for  a  kinetic  equa¬ 
tion 

f®(i,2;f|/<'>) 

=  t)f^K2;  t)  +  |(1,2;  t  |  f^^))S^(l,2). 

(41) 
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In  general,  there  is  no  small  parameter  on  which  to 
base  a  systematic  expansion.  However,  there  are 
exact  structural  properties  of  the  hierarchy  equa¬ 
tion  that  impose  important  constraints  on  the  class 
of  approximations  considered  acceptable.  For  ex¬ 
ample,  the  single-particle  reduced  density  operator 
is  representable  as  a  trace  over  the  two-particle 
reduced  density  operator,  Nf  t)  ==  Tr2f^^^ 
(1,2;  f)  and,  therefore,  the  exact  first  hierarchy 
equation  for  t)  should  result  from  a  partial 

trace  over  any  acceptable  approximate  closure  of 
the  second  hierarchy  equation.  Additional  con¬ 
straints  follow  from  the  invariance  of  the  hierarchy 
equations  under  symmetry  transformations  de¬ 
rived  from  the  Hamiltonian  (e.g.,  rotations,  trans¬ 
lations,  Galilean  boosts).  An  important  conse¬ 
quence  of  these  invariances  is  the  local  conserva¬ 
tion  laws  for  mass,  energy,  and  momentum.  Con¬ 
straints  imposed  by  the  requirement  of  exact 
conservation  laws  are  discussed  in  [5,8,9].  Other 
constraints  can  be  included  as  well.  For  example,  if 
the  reduced  density  operators  for  the  equilibrium 
state  are  known,  it  can  be  required  that  they  are 
also  solutions  to  the  approximate  hierarchy  equa¬ 
tions.  In  this  way,  the  class  of  approximations  can 
be  assured  to  have  the  most  important  properties 
of  the  exact  hierarchy  in  a  context  that  does  not 
imply  weak  coupling  or  other  small  parameter 
limits. 

For  long-range  Coulomb  interactions,  it  is  im¬ 
portant  to  describe  screening  effects.  This  is  in¬ 
cluded  explicitly  on  the  right  side  of  (20),  as  are 
the  mean  field  Hartree-Fock-renormalized  single¬ 
particle  energies  and  the  Pauli  blocking  effects  for 
the  pair  potential.  Thus,  all  of  the  important  mech¬ 
anisms  appear  in  this  form  even  before  consider¬ 
ing  three-particle  correlations  in  g(l,  2,3;f).  Fur¬ 
thermore,  it  is  shown  in  [5]  that  any  choice  for 
g(l,  2, 3;  f )  that  is  Hermitian  and  pairwise  symmet¬ 
ric  in  the  particle  labels  preserves  both  the  repre- 
sentability  of  and  the  exact  conservation  laws. 
Consequently,  we  suggest  that  the  residual  three- 
particle  correlations  described  by  g(l,2, 3;0  can 
be  neglected  entirely,  except  for  the  conditions  of 
very  strong  coupling.  The  resulting  approximation 
preserves  the  exact  local  conservation  laws  for 
mass,  energy,  and  momentum  and  is  time  reversal 
invariant.  It  contains  strong  collision  effects  (ladder 
diagrams)  through  the  dependence  on  the  poten¬ 
tial  in  H(l,  2;  f )  and  dynamical  polarization  effects 
in  the  random-phase  approximation  (ring  dia¬ 
grams).  There  are  no  a  priori  limitations  on  the 


time  scale  in  this  approximation  so  that  a  uniform 
treatment  of  the  initial  value  problem  is  possible. 

With  this  approximation,  Eq.  (19)  becomes  a 
closed  equation  for  ^(1,2;  t)  in  terms  of  0, 

with  the  form 

(^, -L(l,2;f))g(l,2;f) 

-/(i>(l;  t)VHl,2;  t)] .  (42) 

The  formal  solution  provides  the  functional  in  (10) 
required  for  a  kinetic  equation 

F(2)(i,2;M/^'^)  -  {/^'^(l;0/^'^(2; 

+^(l,2;t|/<i>)mi,2),  (43) 
g(l,2;f  I /<!>)  =  Tr34LJ(l,2,3,4;f,fo)^(3,4;  to) 

—  zTr34  f  diUd,  2, 3, 4;  f ,  f ) 

x|y(3,4;?)/<i>(3;t)/(i>(4;i) 

-/<'>(3;f)/<'>(4;)))/^(3,4;f)]. 

(44) 

Here,  H(l,  2, 3, 4;  t,  to)  is  the  two-particle  propaga¬ 
tor  associated  with  the  generator  L(2,2;  f).  Its  form 
and  reduction  to  familiar  quantities  such  as  scat¬ 
tering  matrices  and  dielectric  functions  will  be 
discussed  elsewhere  [10]  and  only  the  structural 
features  noted  here.  The  first  term  on  the  right  side 
of  (44)  gives  the  pair  dynamics  due  to  initial  corre¬ 
lations  determined  from  the  system  preparation. 
The  second  term  describes  the  correlation  buildup 
even  in  the  absence  of  such  initial  correlations. 
Both  contributions  can  be  of  equal  importance  for 
initial  value  problems  at  short  times. 

The  collision  operator  in  the  kinetic  Eq.  (22) 
reflects  these  two  contributions: 

C(1;0  =  -/Tr2[y,(l,2),g(l,2;f)] 

=  r(l;f)  +  Kl;f)  (45) 

r(l;0  =  ~zTr234[K(1.2), 

17(1,2,3,4;  4;  to)]  (46) 

I(1;0  =  -fTr234/Vf[v,(l,2),U(l,2,3,4;f,)) 

h 

x{y(3,4;f)/W(3; '0/^(4;  f) 

t)VK3, 4;  f )}] .  (47) 
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The  Hartree-Fock  dynamics  is  modified  at  short 
times  by  the  conditions  of  the  initial  preparation 
through  The  other  modification  is  indepen¬ 
dent  of  the  initial  preparation  and  represents  the 
buildup  of  scattering  effects.  The  term  1(1;  t)  grows 
from  zero  to  an  asymptotic  value  1(1;  oo),  while  the 
initial  correlations  term  is  expected  to  de¬ 

crease  to  zero  for  most  physical  preparations.  This 
initial  transient  period  represents  the  transition 
from  time  scales  short  compared  to  the  scattering 
time  up  to  times  long  enough  for  completed  colli¬ 
sions.  On  the  long  time  scale,  C(l;  t)  1(1;^)  and 
the  kinetic  equation  takes  the  usual  Markov  form. 
In  this  limit,  a  Boltzmann-Uhlenbeck  form  is  ob¬ 
tained  with  scattering  determined  by  the  full  T- 
matrix  [11].  During  the  transient  period,  the  in¬ 
complete  scattering  is  described  by  an  effective 
potential  with  dynamical  screening  ranging  from 
Coulomb  at  short  times  to  fully  screened  at  long 
times. 


Discussion 

The  objective  of  this  presentation  has  been  to 
give  an  overview  of  an  approach  to  formulating 
practical  kinetic  theories  for  charged  particle  dy¬ 
namics,  with  the  potential  for  application  to  cur¬ 
rent  and  proposed  experiments  on  semiconductors 
using  femtosecond  lasers  to  prepare  and  probe  the 
charges.  The  new  features  of  such  experiments 
requiring  reconsideration  of  standard  kinetic  the¬ 
ory  are  states  far  from  equilibrium  and  the  inclu¬ 
sion  of  short  time  scales.  The  approach  here  formu¬ 
lates  the  description  in  terms  of  the  exact  hierarchy 
for  the  one-  and  two-particle  reduced  density  op¬ 
erators.  These  operators  determine  most  directly 
the  relevant  observables  of  interest.  There  are  two 
components  to  the  analysis.  First,  an  exact  trans¬ 
formation  of  the  first  two  hierarchy  equations  is 
performed  to  isolate  the  strong  correlations  due  to 
quantum  statistics  from  correlations  due  to  the 
Coulomb  forces.  This  description  in  terms  of  the 
excess  correlations  also  makes  explicit  the  mean- 
field  effects  associated  with  Hartree-Fock  renor¬ 
malization  of  the  single-particle  energies  and  the 
dominant  polarization  effects  necessary  to  account 
for  screening  of  the  long-range  Coulomb  forces.  As 
a  single-time  Markovian  first-order  set  of  differen¬ 
tial  equations,  the  description  is  formulated  as  an 
initial  value  problem,  appropriate  for  the  experi¬ 
mental  conditions  of  interest.  The  second  compo¬ 


nent  of  the  analysis  is  an  approximate  closure  of 
the  second  hierarchy  equation,  with  the  three-par¬ 
ticle  correlations  expressed  as  explicit  functionals 
of  the  two-particle  correlations  and  the  one-par¬ 
ticle  reduced  density  operator.  The  resulting  ap¬ 
proximation  transforms  the  first  two  hierarchy 
equations  into  a  closed  set  of  coupled  equations  to 
determine  the  one-particle-reduced  density  opera¬ 
tor  and  the  two-particle  correlations.  Finally,  a  last 
step  is  possible  where  the  approximate  second 
hierarchy  equation  is  solved  formally  to  give  the 
two-particle  correlations  in  terms  of  the  one-par¬ 
ticle  density  operator.  Use  of  this  result  in  the 
exact  first  hierarchy  equation  gives  a  closed  kinetic 
equation  for  the  one-particle  density  operator. 
While  the  coupled  pair  of  equations  for  the  corre¬ 
lations  and  one-particle  distribution  is  local  in  time 
(Markovian),  the  reduced  kinetic  theory  descrip¬ 
tion  in  terms  of  the  one-particle  distribution  alone 
is  necessarily  nonlocal  in  time.  This  latter  feature  is 
essential  for  a  proper  treatment  of  relaxation  and 
transport  at  short  times. 

The  transformation  of  the  hierarchy  and  result¬ 
ing  explicit  representation  of  the  most  important 
physical  mechanisms  simplifies  the  choice  of  ap¬ 
proximations  considerably.  Additional  constraints 
are  associated  with  the  exact  relationship  on  one- 
and  two-particle  reduced  density  operators,  the 
exact  conservation  laws,  and  the  existence  of  sta¬ 
tionary  states  (equilibrium,  in  the  absence  of  driv¬ 
ing  fields).  It  is  proposed  in  the  fifth  section  that 
these  constraints  are  satisfied  by  an  approximation 
obtained  by  neglecting  ^(1,2, 3;  0.  The  resulting 
approximate  equations  retain  all  of  the  desired 
physical  mechanisms  and  time  scales,  excluding 
only  conditions  of  strong  Coulomb  coupling.  The 
detailed  solution  to  the  equation  for  the  pair  corre¬ 
lations  is  essentially  an  effective  two-body  prob¬ 
lem  and  will  be  discussed  elsewhere. 

The  analysis  of  the  hierarchy  and  considerations 
leading  to  this  approximation  occurs  at  the  ab¬ 
stract  operator  level  and  does  not  require  explicit 
attention  to  the  detailed  structure  of  the  physical 
system  considered  (e.g.,  the  number  of  bands)  and 
additional  processes  are  easily  accounted  for  at 
this  abstract  level.  Having  completed  the  many- 
body  analysis,  the  physical  properties  of  interest 
follow  directly  from  an  appropriate  matrix  repre¬ 
sentation  of  the  operator  equations,  so  that  manip¬ 
ulation  of  complex  notation  is  deferred  to  this  late 
stage.  Of  course,  the  final  measure  of  any  ap¬ 
proach  is  the  ability  to  describe  interesting  new 
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phenomena  in  quantitative  applications.  We  hope 
to  report  on  such  results  in  the  near  future. 
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Appendix:  Transformation  of  the 
Hierarchy  Equations 

Define  a  superoperator,  0^^!^  ^hat  exchanges  the 
labels  of  particles  1  and  2  in  all  operators  to  its 
right: 

^i2X(1,2)  =  X(2,1).  (48) 

This  operator  should  not  be  confused  with  the 
permutation  operator  defined  over  quantum  states, 
Pi2/  that  permutes  the  quantum  labels  of  the  states. 
The  second  equation  of  the  BBGKY  hierarchy  then 
can  be  written  in  the  compact  form: 

+  (1  +^i2)Tr3/[y(l,3)/(3>(l,2,3;i)  -  he] 

=  0,  (49) 

where  he  indicates  the  Hermitian  conjugate  of  the 
preceding  term  in  the  brackets.  The  objective  of 
this  Appendix  is  to  show  how  the  exchange  effects 
in  the  definitions  of  /^^Kl/2;0  and  2,3;f) 

can  be  evaluated  exactly  to  give  the  Hartree-Fock 

energies,  the  blocking  factors  in  the  pair  potential, 
and  the  linear  operators  responsible  for  polariza¬ 
tion  effects.  Substitution  of  (14)  into  (49)  and  use  of 
the  first  hierarchy  equation  gives 

S^a,2){d,g^^Kl,2;t)  +  z[H(l,2;0,f‘^>(l,2;f)] 
+  i[V(l,2-,t),f^Kl-,t)f^^K2;t)]} 

-^(1, 2)(1  +  ^i2)Tr3z[  Va,  3) 3;  f ) 
X/W(2;f)^(l,3)  -  he] 

+  (1  +^i2)Tr3j 

x[Va,3)pKl,2,3;t) 

X^(l,2,3)  -  he]  =  0.  (50) 

First,  note  that  the  last  term  can  be  written  with  an 
overall  prefactor  of  ^(1, 2)  as  well  using  the  iden¬ 


tity  ^(1,2,3)  =  (1  -  Pi3  -  P23)c5^(1;2)  and  the  fact 
that  c5^(l,2)  commutes  with  any  symmetric  two- 
particle  operator.  Consequently,  the  overall  opera¬ 
tor  c5^(l,  2)  in  (51)  can  be  factored  out,  leaving  the 
simpler  result: 

d,f^Kl,2;t)  +  i[H(l,2-,t)g^^Kl,2;t)] 

+  i[V(l,2)t),f^Kl;t)f^^K2;t)] 

-a  +^i2)Tr3j[ni,3)/<2>(l,3;0 
X/‘^>(2;t)^(l,3)  -  he] 

+  (1  +^-i2)Tr^i 

x[v(l,3)/®(l,2,3;f)(l  -Pi3  -F23)  “H 

=  0.  (51) 

Next,  represent  2, 3;  f )  in  the  last  term  using 
(15)  in  the  equivalent  form 

/‘3>(l,2,3;f) 

=  /(2)(1, 3;  t)/(i>(2;  f )  +  f «(!;  t)g(2, 3;  t) 

+  /«(3;t)g(l,2;t)  +g(l,2,3;f) 

to  obtain 

Vil,3)f^^Kl,2,3;t)(l  -  Pi3  -  P23) 

=  y(l,3)/(2)(l,3;  Of^K2;  f)(^(l,3)  -  P23) 
+  y(l,3){g(2,3;t)/('^(l;f) 
+g(l,2;t)/W(3) 

+g(l,2,3;f)}(l -Pi3  -  P23).  (52) 

The  contribution  proportional  to  .5^(1, 3)  cancels 
the  terms  on  the  second  line  of  (51)  and  the  equa¬ 
tion  for  g^^Kl,  2;  f )  becomes 

d,gm,2;t)  +  i[H(l,2;t),gm,2;0] 

+  i[y(l,2;t),/<^>(l;f)/(’>(2;f)] 

-a  +,^i2)rr3/[m,3)/<2>(l,3;f) 

X/<i>(2;  t)P23  -  he] 

+  (1  +^,2^Tr3i[Vil,3) 

X{g(2,3;  f)/W(l;  t)  +  gil,2;  f)/(i>(3)} 

X  (1  —  P13  —  P 23)  ~  he] 

+  (1  +^,2^Tr,i[V(l,3)g^^Kl,2,3-,  t) 

X(1  -  Pi3  -  P23)  -  te]  =  0.  (53) 
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The  fourth  term  on  the  left  side  can  be  simplified 
using  the  identities 

^23^23  ^  1'  ^^3^23  ^  1/  ,  . 

P23X(1,2,3)P23  =  X(1,3,2)  ^  ^ 

to  write 

Tr3ni,3)/(2>(1,3;  0/^’>(2; 

=  Tr3P23P23y(l,3)/®(l,3;  t)f^K2;  OP23 

=  Tr^  P23V(1,  2)/<2)(1,  2;  0 

=  Tr3/(')(3;  0P23^(1,2)/PH1,2;  t) 

=  Tr2P23P2sf^^(^->  0P23V(1,2)/(2>(1,2;  t) 

=  Tr3P23/W(2;  f)V(l,2)/(^>(l,2;  f) 

=  0V(1,2)/(2>(1,2;  f).  (55) 

The  second  equation  of  the  BBGKY  hierarchy  now 
becomes 

<?,g(l,2;0  +  i[(H(l;f)  +  H(2;0),g<^>(l,2;f)] 

+  f[v(l,2;f)g(l,2;t)  -/ic] 

+  (1  +^32)Tr3i[ni,3){^(2,3;  t)/«(l;  t) 
+ga,2;  f)/(’>(3)}(l  -  Pi3  -  P23)  -  te] 

+  i[y(l,2;f)/(i)(l;f)/('>(2;f) 
-/W(l;f)/<i>(2;f)yt(l,2;f)] 

+  (1  +  ^12 )rr3i[  y(l,  3)f <=*>(!,  2, 3;  f ) 

X(1  -  Pi3  -  P23)  - /ic]  =  0,  (56) 

with  the  definition 

y(l,2;f)  ^  [1  -/<!>(!;  0  --/(^>(2;  0] V(l,2). 

(57) 

This  shows  clearly  the  origin  of  the  Pauli  blocking 
factor  in  the  pair  potential. 

The  remaining  exchange  terms  in  the  second 
line  of  (56)  can  be  analyzed  in  a  similar  way: 

Tr3y(l,3)[g(2,3;f)/W(l;0 

+|(l,2;f)/(i>(3)](l  -Pi3  -P23) 

=  Tr3y(l,3)[|(l,2;f)/(^>(3) 
-^(2,3;f)/«(l;f)Pi3] 


+  Tr3y(l,3)[f(2,3;f)/<‘>(l;f)(l  -  P23) 

-|(l,2;f)/(i)(3)(Pi3  +  P23)] 

=  Tr3y(l,3)[f(l,2;t)/(i>(3) 

-Pi3Pi3|(2,3;f)/W(l;f)Pi3] 

+  Tr3y(l,3)[g(2, 3;  f  )/(!>(!;  f)(l  -  P23) 

-Pi3Pi3§(l,2;f)/(b(3)P23(l  +  Pi3P,3)] 

=  Tr3y,(l,3)/i>(3)^(l,2;f) 

PTrjya,  3)|(2,3;  f)/®(l;  f)(l  -  P23) 
-Pi3g(2,3;  f)/('>(l)(l  +  Pi3P23)-  (58) 

The  first  term  on  the  right  side  gives  the 
Hartree-Fock  single-particle  energy  shift: 

y,/l;f)  =  Tr3y,(y3)/«(3), 
y,(i,3)  =  y(i,3)^(i,3). 

The  second  term  can  be  transformed  by  writing 
the  factor  (1  +  P13P23)  =  (1  -  P23)  +  (1  +  Pi3)P23 
=  (1  -  P23)  +  P23(l  +  Pu)-  Recall  that  an  overall 
factor  of  2)  has  been  canceled  out  in  writing 
(56).  Since  (1  +  Pi2)^(l,  2)  =  0,  the  term  P23(l  + 
P12)  vanishes  in  this  context  and  Eq.  (58)  simplifies 
to 

Tr3y(l,3)[g(2,3;f)/«(l;f) 

+g(l,2;f)/®(3)](l  -Pi3  -P23) 

=  y,/l,2;f)g(l,2;f) 

+  Tr3y,(l,3)^(2,3;  f)/'i>(l;  f)(l  -  P23). 

(59) 


The  second  term  gives  the  polarization  effects  with 
exchange.  The  exact  equation  for  g(l,2;0  now 
takes  the  final  form  of  (19). 
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ABSTRACT:  Using  a  molecular  cluster  with  71  atoms,  we  studied  the  electronic  and 
structural  properties  of  nitrogen  vacancy,  boron  antisite,  and  oxygen  in  cubic  boron 
nitride  through  an  ab  initio  Hartree-Fock  calculation.  We  found  that  all  defects  introduce 
a  deep  state  above  the  middle-energy  gap.  These  defects  present  a  local  symmetry.  In 
the  case  of  nitrogen  vacancy,  the  possibility  of  the  F-center  formation  is  discussed.  All  the 
calculations  were  performed  using  the  program  code  GAMESS.  ©  1997  John  Wiley  &  Sons, 
Inc.  Int  J  Quant  Chem  65:  941-946,  1997 


Introduction 

Boron  nitrides  exist  in  both  high-density 
(tetrahedral  diamondlike)  and  low-density 
(graphitelike)  forms  or  modifications.  Of  the  high- 
density  forms,  cubic  boron  (c-BN)  is  unique  among 
the  group  III  nitrides  in  that  its  zinc-blende  struc¬ 
ture  is  the  most  stable  and  best  characterized  form. 
Then,  c-BN,  the  lightest  of  III-V  semiconductors, 
is  a  large  band-gap  material  (-6.4  eV)  [1]  which 
has  several  similarities  compared  with  diamond, 
such  as  crystal  structure,  wide  energy  band  gap, 
extreme  hardness,  high  thermal  conductivity,  high 
melting  temperature,  and  low  reactivity  [2].  But 
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while  diamond  can  be  readily  doped  only  with  the 
p-type,  c-BN  can  be  doped  p-  and  n-types  when 
suitable  impurities  are  added  [3].  Although  it  could 
be  an  important  material  for  devices  of  high  tem¬ 
perature  and  high  pressure,  with  many  applica¬ 
tions  in  industries  in  the  future,  the  quality  of  the 
samples  remains  as  a  major  problem.  If  we  com¬ 
pare  it  with  other  III-V  materials,  c-BN  has  a  very 
high  concentration  of  native  defects. 

In  this  article,  we  report  a  theoretical  calculation 
of  the  behavior  of  native  defects  in  c-BN.  The 
electronic  and  structural  properties  of  N-vacancy 
(Vac^^p,  boron  antisite  (B^^^j),  and  oxygen  substi¬ 
tutional  (0[^])  are  calculated.  Although  several 
calculations  are  available  for  pure  materials  [4], 
there  are  only  a  few  works  concerning  the  theoreti¬ 
cal  study  of  the  defects  in  c-BN  [5, 6].  Particularly, 
an  interesting  work  was  performed  by  Gubanov  et 
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aL  [6]  using  LAPW-ASA-TB  for  vacancies.  How¬ 
ever,  in  that  calculation,  the  authors  did  not  in¬ 
clude  relaxation  or  asymmetric  distortions,  basi¬ 
cally  because  they  used  the  muffin-tin  approach 
for  the  potential.  These  effects,  as  we  will  show, 
are  important  to  interpret  the  experimental  results. 
As  far  as  we  know,  this  is  the  first  work  to  con¬ 
sider  distortions  induced  by  the  defects  in  the 
calculation. 


Theoretical  Procedure 

We  simulated  the  pure  crystal  through  the 
molecular  cluster  with  71  atoms 

initially  in  Td  symmetry,  as  seen  in  Figure  1.  The 
hydrogen  atoms  are  introduced  to  saturate  the 
dangling  bonds  at  the  cluster  surface,  as  usual 
[7, 8].  The  net  charge  of  3  +  at  the  cluster  is  neces¬ 
sary  to  get  a  neutral  charge  condition  as  in  a 
crystal.  Having  defined  the  molecular  cluster,  the 
total  energies  are  calculated  using  the  all-electron 
Hartree-Fock  method  in  the  usual  linear  combina¬ 
tion  of  atomic  orbitals  procedure.  The  atomic  basis 


FIGURE  1.  Molecularcluster  with  71  atoms 
(NB4N^2^i2N6H36)^^>  Initially  in  Td  symmetry,  with  the 
hydrogen  atoms  introduced  to  saturate  the  dangling 
bonds  at  the  surface. 


adopted  here  is  the  minimal  split- valence  3-21 G 
basis  set  [9],  which  consists  of  6s3p  (3s)  primitive 
Gaussian  functions  contracted  in  3s2p  (2s)  basis 
functions  on  each  B  and  N  (H)  atoms.  The  size  of 
the  set  thus  ranges  from  the  smallest  242  (for  the 
Vacj^j)  to  247  (for  and  B^^p  contracted 

Gaussian-type  functions.  The  calculations  were 
performed  using  the  program  code  GAMESS  [10]. 
Before  calculating  the  distortions  on  the  geometric 
structure,  when  defects  substitute  the  N  central 
atom  in  the  cluster,  we  first  optimized  the  defect 
nearest-neighbor  bond  distances  with  respect  to 
the  Aj  vibrational  mode.  This  is  necessary  to  avoid 
spurious  forces  in  the  defect  cluster.  As  we  know 
from  the  literature  [8],  this  cluster  size  is  appropri¬ 
ate  to  study  single-impurity  distortions. 

For  the  pure  c-BN,  the  calculated  eigenvalue 
separation  between  the  highest  occupied  molecu¬ 
lar  orbital  (t2“HOMO)  and  the  lowest  unoccupied 
molecular  orbital  (a^-LUMO)  obtained  is  8.07  eV. 
A  better  result  for  the  band  gap  is  obtained  using 
the  Mulliken  correction  [11]:  AEiHOMO  ~ 
LUM.O)  =  ^lUMo  ^HOMO  “  7  +  where  /  and 
K  are  the  Coulomb  and  exchange  integrals,  respec¬ 
tively.  Using  this  correction,  the  result  for  the 
energy  gap  drops  to  6.53  eV,  in  excellent  agree¬ 
ment  with  the  experimental  results  [1]. 


Results  and  Discussion 

We  show  schematically  in  Figure  2(a),  (b),  and 
(c)  the  electronic  energy  levels  in  the  gap  region 
for  the  Bj^j,  Vac^^j,  and  respectively,  using 
a  defect  molecule  model.  On  the  left  side  is  always 
the  pure  system  with  an  ajt^  configuration.  Our 
procedure  to  estimate  the  energy  levels  in  the  gap 
was  to  compare  the  pure  cluster  and  the  defect 
clusters.  The  electronic  energy  levels  showed  in 
Figure  2  were  obtained  as  differences  between  the 
calculated  Hartree-Fock  eigenvalues.  Fortunately, 
in  all  defects  studied  here,  the  density  of  states  at 
the  top  of  the  valence  band  practically  does  not 
change.  In  this  sense,  the  top  of  the  valence,  used 
as  a  reference  for  the  calculated  electronic  levels,  is 
fairly  well  defined. 

As  one  can  see  in  the  middle  of  Figure  2(a),  the 
electronic  consequence  of  B^^j,  before  structural 
minimization,  is  to  introduce  a  t2-level  occupied 
with  four  electrons  in  the  gap  region.  This  configu¬ 
ration  presents  a  degenerated  ground  state  and, 
consequently,  is  favorable  for  a  Jahn-Teller  distor- 
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FIGURE  2.  Schematic  representation  of  the  energy  levels  in  the  gap  region  of  c-BN  for  (a)  boron  antisite,  (b)  nitrogen 
vacancy,  and  (c)  oxygen  substitutional. 


tion.  The  optimized  geometric  structure  has  a  C3y 
symmetry,  where  the  orbital  is  split  in  a  twofold 
degenerate  fully  occupied  e  orbital  and  an 
empty  orbital,  as  shown  on  the  right  side  of  Figure 
2(a).  The  fully  occupied  e-level,  with  a  ligand 
character  (7r-type),  is  located  4.81  eV  above  the  top 
of  valence  band,  resulting  in  a  deep  donor  level.  In 
Figure  3(a)  and  (b),  we  show  the  total  charge 
densities  in  the  (110)  plane  for  the  c-BN  and  c- 
systems,  respectively.  From  Figure  3(b), 
we  see  that  the  B  antisite  atom  makes  bonds  with 
the  B  atoms  in  the  plane  of  the  symmetry. 


Concerning  the  structural  properties  for  the  B^^^ 
defect,  our  results  show  that  the  antisite  B  atom  is 
found  in  an  off-center  position.  We  first  performed 
a  breathing  mode  total  energy  minimization  which 
leads  to  a  small  inward  relaxation  of  5%  in  the 
bond  length.  Starting  from  this  relaxed  configura¬ 
tion,  we  calculated  the  total  energy  changes  of  the 
Bjjyj  atom  along  the  (111)  and  (100)  directions,  as 
shown  in  Figure  4.  For  the  distortion  in  the  (111) 
direction,  toward  an  intersticial  position  in  a 
configuration,  we  find  a  stable  position  with  an 
energy  gain  of  1.34  eV  and  a  displacement  of  the  B 


FIGURE  3.  Contour  plot  of  the  total  curves  for  displacements  of  B  antisite  in  (1 1 0)  plane  for  (a)  pure  c-BN  and  (b)  B 
antisite. 
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atom  of  0.28  A.  In  addition,  the  distortion  in  the 
(110)  direction  gives  a  metastable  configuration 
with  C2J,  symmetry,  where  the  energy  gain  is 
0.65  eV  and  the  displacement  of  the  antisite  B 
atom  is  0.16  A. 

With  respect  to  the  Vac^^^j,  the  calculated 
breathing  mode  relaxation  of  the  first  neighbor 
boron  atoms  gives  an  outward  displacement  of 
0.31  A  for  these  atoms.  A  subsequent  mode 
distortion  was  observed  with  one  boron  atom, 
following  a  (111)  direction  and  the  three  others, 
B2,  B3,  and  B4,  moving  closer  to  form  a  triangle,  as 
seen  in  Figure  5. 

As  shown  in  the  middle  of  Figure  2(b),  the 
removal  of  an  N  atom,  Vac^^j,  results  in  a  strong 
perturbation  in  the  electronic  configuration  of  the 
system  with  two  orbitals,  an  fully  occupied  in 
the  gap  region  and  a  ^2  singly  occupied  resonant 
in  the  conduction  band,  in  agreement  with 
Gubanov  et  al.  [6].  After  the  total  energy  mini¬ 
mization,  a  configuration  is  obtained.  The  f2 
orbital  splits  in  an  plus  an  e  orbital,  as  shown 
on  the  right  side  of  Figure  2(b).  In  the  gap  region, 
there  are  two  levels  with  character  (la^  2a\) 
located  at  1.76  and  4.90  eV  above  the  top  of 
valence  band,  respectively. 

In  Figure  6,  the  wave  functions  for  the  two 
highest  occupied  orbitals  (la^la^)  of  the  neutral 


B2 


FIGURE  5.  VaC[/v]  the  mode  distortion,  with  one 
boron  (B1)  following  a  (111)  direction  and  the  three 
others  (B2,  B3,  and  B4)  moving  closer  to  form  a  triangle. 

vacancy  are  presented.  In  the  figure,  the  (a)  and  (c) 
planes  contain  the  Bi-Vac-B3  centers,  while  the  (b) 
and  (d)  planes  the  B2-Vac-B4  ones.  The  fully  occu¬ 
pied  la^  defect  orbital  in  the  gap  region  has  a 
TT-character  with  a  charge  density  shared  among 
the  three  equivalent  (B2,  B3,  and  B4)  boron  atoms 
near  the  vacancy.  This  trapped  charge  resembles 
the  F  centers  that  arise  in  anion  vacancies  in 
alkali  “halide  crystals  [12].  Particularly,  the  forma¬ 
tion  of  F  centers  in  hexagonal  BN  (h-BN),  due  to 
nitrogen  vacancy,  was  proposed  by  Zunger  and 
Katzir  using  an  extended  Huckel  method  [13].  The 


FIGURE  4.  Calculated  total  energy  curves  for  displacement  of  B  antisite  in  (100)  (03^)  and  (111)  (C3^,)  directions 
(see  text). 
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FIGURE  6.  Contour  plot  of  the  (a,b)  la^  and  (c,d)  2a^ 
orbitals.  The  (a)  and  (c)  planes  contain  the  B^-Vac-Bg 
centers,  and  the  (b)  and  (d)  planes,  the  B2-VaC'B4 
centers  (see  text). 


singly  occupied  orbital  above  the  middle  gap 
has  an  electronic  distribution  strongly  localized  on 
the  boron  with  a  7r-character.  Although  there 
are  some  similarities  between  the  highest  occupied 
orbitals  in  c-BN  and  h-BN,  like  the  7r-character  and 
charge-confinement,  a  very  important  difference  in 
the  energetic  ordering  of  the  electronic  levels  is 
remarkable.  Differently  from  h-BN,  the  c-BN  could 
present  an  F  center  only  for  the  doubly  ionized 
nitrogen  vacancy  (Vac”^  ).  The  essential  role  played 
by  carbon  doping  for  the  formation  of  F  centers  in 
h-BN  [14]  could  also  be  seen  in  the  case  of  c-BN,  at 
least  for  high  concentrations  of  these  impurities 
when  a  recombination  occurs  between  the  electron 
localized  at  the  la^  orbital  and  the  hole  introduced 
by  the  carbon  atoms.  It  is  in  agreement  with  exper¬ 
imental  EPR  results  by  Fanciulli  and  Moustakas 
[15], 

The  electronic  energy  levels  for  the  relaxed 
(right)  and  unrelaxed  (left)  configurations  at 
symmetry  for  the  system  are  shown  schemat¬ 
ically  in  Figure  2(c).  We  obtained  a  singly  occu¬ 
pied  level  with  antibonding  character  that  comes 
from  the  conduction  band  4.70  eV  above  the  top  of 
the  valence  band.  Also,  a  hyper-deep  state  be¬ 
low  the  top  of  valence  band  and  a  resonant  p-type 
state  in  the  valence  band  are  obtained.  By  this 
analysis,  we  see  that  the  substitutional  oxygen 
impurity  introduces  an  antibonding  state  in  the 
gap  region. 

We  obtained  the  energy  minimum  when  the 
first  neighbors  relax  —  0.16  A  outward  (10%  in 
the  bond  length).  But  if  we  look  carefully  at  the 


total  energy  surface,  we  observe  that  the  system 
distorts  from  the  symmetry  toward  con¬ 

figuration,  with  the  oxygen  and  the  boron  atoms 
moving  oppositely  in  the  (111)  direction.  The  oxy- 

o 

gen  and  boron  displacements  are  0.03  and  0.18  A, 
respectively. 


Final  Remarks 

In  conclusion,  we  have  discussed  the  role  played 
by  the  native  defects  in  c-BN  using  an  ab  initio 
Hartree-Fock  method  within  the  cluster  approach. 
For  the  neutral  isolated  boron  antisite,  a 
Jahn-Teller  distortion  in  the  (111)  direction  is 
observed.  For  the  neutral  N  vacancy,  a  strong 
relaxation  is  found:  highly  localized  orbital  in 
the  vacancy  region  (F-center  precursor)  and  a  ^2 
orbital  resonant  in  the  conduction  band  appear 
before  the  lattice  relaxation.  The  relaxation  (-f-  dis- 
toration)  splits  the  ^2  orbital,  leading  to  a  half- 
occupied  level,  close  to  the  conduction  band. 
Our  calculations  enable  us  to  suggest  that  the 
strong  EPR  signal  seen  in  c-BN  [15]  could  be 
provided  by  the  trapping  of  two  electrons  by 
acceptor  levels  introduced  by  carbon  doping.  For 
the  neutral  we  found  that  an  a-^  orbital, 

half-occupied,  is  introduced  above  the  middle  of 
the  valence  band  after  lattice  relaxation. 
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ABSTRACT:  Magnetism,  conductivity,  and  superconductivity  of  molecular  materials 
are  discussed  on  the  basis  of  extended  Hubbard-Peierls  Hamiltonians,  which  involve 
four  parameters;  transfer  integral  (T),  electron-lattice  interaction  (W),  and  on-site  and 
intersite  Coulomb  interactions  (U,  V).  The  four-component  spinor  is  introduced  to  define 
order  parameters  which  characterize  four  different  electronic  states  (charge  density  wave, 
spin  density  wave,  singlet  and  triplet  superconductivities)  in  a  unified  fashion.  Several 
intermediate  regions  in  these  parameters  are  interesting  and  important  from  current 
theoretical  reasons.  Many  experimental  results  clearly  show  the  importance  of 
intermediate  correlation  regimes  for  active  controls  of  these  states  by  chemical 
modifications  and  by  external  fields  such  as  high  pressure.  Several  model  Hamiltonians 
such  as  CT  and  t-J  models  are  derived  for  elucidation  of  potential  electronic  properties 
of  molecule-based  materials.  Recent  computational  results  based  on  the  t-J  model  are 
utilized  to  rationalize  our  /  model  for  spin-mediated  superconductivity.  Possibilities  of 
magnetic  conductors,  charge-  and/or  spin-mediated  superconductors,  and  photoinduced 
superconductors  instead  of  the  charge-mediated  Little  model  are  examined  in  the 
intermediate  region  of  several  metal-insulator  transitions.  Possible  candidates  are  also 
proposed  on  the  basis  of  various  circumstantial  experimental  results.  Implications  of  the 
calculated  results  are  finally  discussed  in  relation  to  the  recent  development  of  the 
intersection  area  among  conducting,  magnetic,  and  optical  molecular  materials.  ©  1997 
John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  947-964,  1997 
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Introduction 

In  the  past  decade,  magnetism  and  electron 
correlation  have  attracted  much  interest  in  re¬ 
lation  to  high-T^  superconductivity  in  copper  ox¬ 
ides  [1].  Many  experimental  and  theoretical  efforts 
have  been  made  for  the  elucidation  of  possible 
interplay  between  p-  and  d-electrons  of  transition- 
metal  oxides  in  the  intermediate  or  strong  elec¬ 
tron-correlation  regime  [2].  In  previous  articles 
[3-5],  we  examined  model  Hamiltonians  for  cop¬ 
per  oxides  to  elucidate  two  key  points:  (1)  How 
many  active  orbitals  are  necessary  for  modelings 
of  electronic  structures  of  copper  oxides?  and  (2) 
What  kind  of  model  Hamiltonians  are  derived  on 
the  basis  of  the  strength  of  electron  correlations? 
Concerning  the  first  question,  one-  to  six-band 
models  (N  =  1-6)  were  found  to  be  plausible  for 
copper  oxides  depending  on  the  purposes  and 
problems  under  investigation.  For  the  latter  ques¬ 
tion,  there  were  at  least  four  different  classes:  (1) 
the  t-J  model,  (2)  the  spin-fluctuation  model,  (3) 
the  spin-  and  charge-fluctuation  model,  which  is 
referred  to  as  an  electron  correlation  (EC)  model 
[5],  and  (4)  the  charge-fluctuation  or  CT  model. 
Many  charge-  and/or  spin-mediated  models  are 
feasible.  For  example,  models  (1)  and  (2)  provide  a 
simple  equation  to  estimate  the  transition  tempera¬ 
ture  (T^)  for  spin-mediated  superconductivity 
[3-5]: 

Tc--CU,  (1) 

where  C  and  are,  respectively,  a  phenomeno¬ 
logical  constant  (C  =  0.1)  and  the  effective  ex¬ 
change  integral  for  the  superexchange-coupled 
transition-metal  ions:  M(II)0^“M(II)  (1)  (M  =  Cu, 
Ni,  etc).  Our  ab  initio  computations  [4]  showed 
that  the  value  for  the  Cu(II)0^~Cu(II)  unit  (la) 
is  over  1000  cm“\  and,  therefore,  is  higher  than 
100  K  in  copper  oxide  superconductors  [1,  2]. 
However,  they  indicated  that  copper  oxides  belong 
to  the  intermediate  regime  (see  circle  4  in  Fig.  5), 
and,  therefore,  charge  and  phonon  degrees  of  free¬ 
dom  also  contribute  to  the  high-T^  superconductiv¬ 
ity,  although  only  the  spin  degrees  of  freedom  is 
explicitly  considered  in  Eq.  (1). 

A  BRIEF  REVIEW  OF  OUR  PREVIOUS  WORK 

During  the  past  decade,  we  have  been  inter¬ 
ested  in  the  design  and  investigation  of  possible 


organic,  organometallic,  and  inorganic  analogs  to 
copper  oxides  on  the  basis  of  an  assumption  that 
these  analogs  should  have  low  CT  excitation  ener¬ 
gies  and  large  |/^^|  values  before  doping,  although 
many  physicists  have  focused  attention  on  the 
investigation  of  possible  microscopic  mechanisms 
of  the  high-T^  superconductivity  [2].  Previously 
[3-5],  the  possible  mechanisms  have  been  exam¬ 
ined  from  the  viewpoint  of  charge  and  spin  de¬ 
grees  of  freedom,  and  several  model  Hamiltonians 
such  as  the  Hubbard  model  [6]  have  been  dis¬ 
cussed  in  relation  to  magnetism,  conductivity,  and 
superconductivity.  Since  undoped  copper  oxide 
was  known  to  be  an  antiferromagnetic  insulator, 
molecular  magnetic  compounds  thus  attracted  our 
special  interest  [3-5,  7].  Therefore,  we  may  have 
the  first  hypothesis  for  molecular  design  1: 

(HI)  The  first  step  to  molecular  design  is  to  find 
possible  isoelectronic  units  to  the  Cu(ll)0^  ~  Cu(Il)  (la) 
system. 

From  the  above  hypothesis,  we  have  first  been 
interested  in  1,3-dipolar  compounds  [8]  RYR  (R  = 
CH2,  CH,  etc;  Y  =  O,  S,  etc.  (2)  because  of  their 
isoelectronic  structures  with  1  as  illustrated  in 
Figure  1.  Figure  2  illustrates  the  orbital  energy 
diagrams  for  1  and  2  to  clarify  their  interrelation¬ 
ship.  Isoelectronic  d-ir  systems  were  also  de¬ 
signed  by  replacing  the  inorganic  anion  X  in  metal 
oxides  with  an  organic  77-ligand  [9],  as  illustrated 
in  Figure  2,  and  isoelectronic  77-radical  (R)  sys¬ 
tems  were  theoretically  synthesized  by  substitut¬ 
ing  divalent  magnetic  ions  with  radical  groups  (jR) 
under  the  assumption  that  the  p-d,  ir-d,  and  77- jR 
energy  gaps  should  be  small  so  as  to  reproduce  the 
large  antiferromagnetic  (negative)  exchange  inte¬ 
gral  of  la.  To  confirm  the  assumption,  the  com¬ 
plete  active  space  (CAS)  three-orbital  four-electron 
{4, 3}  configuration  interaction  (Cl)  calculations  [10] 
were  carried  out  as  illustrated  in  Figure  3.  Since 
the  direct  d-d  (R-R)  interaction  is  small  because 
of  no  metal-metal  orbital  overlap,  the  p~d,  ir-d, 
and  7T-R  orbital  interactions  play  crucial  roles  in 
the  configuration  mixing.  The  CAS  Cl  calculations, 
i.e.,  exact  diagonalization  of  the  extended  Hubbard 
model  [6],  revealed  a  characteristic  feature  for 
the  Cu(II)0^"Cu(II)  unit  (la):  low  ligand-metal 
charge-transfer  (LMCT)  excitation  energy  (AE^) 
and  strong  mixing  between  the  metal  biradical 
(MBR)  and  ligand-metal  charge-transfer  (LMCT) 
configurations  because  of  the  large  p-d  orbital 
overlap  integral  and  small  p-d  charge-transfer  en¬ 
ergy  (Ap^)  as  illustrated  in  Figure  4.  The  strong 
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M, - X - Mj 

1 

a:  Mi  =  M2=Cu(n) 
b:  Mi  =  M2=Ni(n) 
x=  o^-,  s^-,  f'- 

etc. 


R - Y - R 

2 


a:R=CH2.Y=NH,0,S 
b:R=CH,Y=NH,  0,S 
etc. 


Fe 


X=O^S^  F^- 


a:Mi  =  M2=Cu(II)  '  a:  Mi=Fean),  M2=Cu(II)' 

b;  Mi=Fean),M2=Cr(III) 

FIGURE  1.  The  component  units  for  construction  of  chain,  ladder,  and  sheet  with  intermediate  or  strong  electron 
correlations. 


mixing  between  MBR  and  LMCT  also  resulted  in 
the  cases  of  several  organometallic  systems,  e.g., 
the  binuclear  iron-sulfur  system  Fe(II)S2~Fe(III) 
(3)  [11]  and  the  transition-metal  tetrathiolate  sys¬ 
tem  (4)  in  Figure  1.  In  fact,  the  |  value  has  been 
estimated  from  experiment  to  be  over  800  cm“^ 
for  the  binuclear  copper-tetrathiolate  complex  (4a) 
[12,  13],  and  this  value  is  comparable  to  the 
observed  value  (-1000  cm“^)  for  copper 
oxide.  Therefore,  we  have  tried  to  synthesize  one¬ 
dimensional  (ID)  and  2D  complexes  (5)  using  the 
tetrathiolate  ligand  (4)  as  shown  in  Figure  1.  The 
intrachain  values  observed  by  the  magnetic 
measurements  were  indeed  very  large;  for  exam¬ 
ple,  they  were  100  and  —63  cm“^  respectively,  for 
5a  {Ml  -  Fe(III)  and  M2  =  Cu(II)}  and  5b  (Ml  = 
Fe(III)  and  M2  =  Cr(III)}.  However,  the  intercol¬ 
umn  interaction  for  4  could  not  be  controlled  so  as 


to  obtain  proper  crystals  since  appropriate  ligands 
(L2)  have  not  been  discovered  yet  [12,  14]. 

The  CAS  Cl  calculations  have  shown  that  elec¬ 
tronic  properties  of  excited  states  of  the  p-d,  rr-d, 
and  tt-R  systems  in  Figure  4  are  sensitive  to  the 
interaction  parameters  [10]  and  that  CT  excited 
states  such  as  LMCT  are  generated  by  low-energy 
photoexcitation  or  even  by  thermal  excitation  in 
special  cases.  This,  in  turn,  indicates  that  photoex¬ 
citations  may  induce  dramatic  changes  of  elec¬ 
tronic,  magnetic,  and  optical  properties  of  these 
p-d,  TT-d,  and  tt-R  interaction  systems  under  con¬ 
sideration.  In  the  past  decades,  we  have  thor¬ 
oughly  investigated  dimer  and  clusters  of  typical 
model  systems  to  elucidate  the  nature  of  these 
chemical  bonds.  The  calculated  results  showed  that 
active  controls  of  charge,  spin,  and  photon  proper¬ 
ties  in  the  systems  are  feasible  because  of  strong 
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FIGURE  2.  Schematic  illustrations  of  the  orbital  energy 
levels  of  isoelectronic  p-d,  ir-d,  p-R,  and  tt-R  systems, 
which  are  schematically  explained  by  A,  B,  C,  and  D, 
respectively. 

coupling  of  three  degrees  of  freedom  and  nonlin¬ 
ear  responses  to  external  fields  as  has  been  empha¬ 
sized  in  previous  articles  [15,  16]. 

PRESENT  PURPOSE  AND  INTEREST 

As  a  continuation  of  this  previous  work,  we 
summarize  here  and  extend  our  basic  ideas  for 
molecular  design  and  fully  examine  theoretical 
models  for  conducting,  magnetic,  and  optical  phe¬ 
nomena  in  molecular  materials.  Our  previous  / 
model  [4],  which  was  presented  before  the  pro¬ 
posal  of  the  so-called  t-J  model,  is  extended  on 
the  basis  of  recent  numerical  results  based  on  the 


latter  model.  Many  experimental  results  are  also 
examined  systematically  as  circumstantial  evi¬ 
dence  from  our  theoretical  viewpoints  presented 
previously  [3,  5].  Several  theoretical  models  or 
hypotheses  will  be  derived  from  these  investi¬ 
gations  for  active  controls  of  charge  and/or 
spin-mediated  and  photoinduced  properties  of 
molecular  materials.  Probably,  such  a  semiempiri- 
cal  approach  would  be  permitted  as  a  chemist's 
approach  at  the  current  stage  of  both  theoretical 
and  experimental  studies  on  molecule-based  mate¬ 
rials  in  the  intermediate  and  strong  correlation 
regime.  This  article  contributes  to  the  recent  devel¬ 
opment  of  the  intersection  area  among  conducting, 
magnetic,  and  optical  molecular  materials. 


Theoretical  Backgrounds 

MODEL  HAMILTONIANS  FOR  EXTENDED 
SYSTEMS 

Several  microscopic  theoretical  models  [5,  17] 
are  first  considered  in  order  to  elucidate  the  nature 
of  itinerant  ferromagnetism  via  electron  correla¬ 
tion,  spin-mediated  high-T^  superconductivity, 
fractional  quantum  Hall  effect,  etc.,  which  are  our 
main  concerns  in  the  field  of  the  solid-state  quan¬ 
tum  chemistry  [3,  18].  From  the  viewpoint  of  the 
nature  of  chemical  bonds,  conductive,  magnetic, 
optical,  and  other  properties  of  molecular  solids 
under  investigation  are  governed  by  three  impor¬ 
tant  microscopic  variables:  (1)  the  transfer  integral 


(A) 


FIGURE  3.  Schematic  illustrations  of  six  configurations  constructed  of  three  active  orbitals  and  four  active  electrons 
{4,  3}.  Notations  are  given  in  the  text. 
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MMCT(R) 

MMCT(L) 

DCT 


MMCT“ 
MMCT  + 


DCT 


MLCT(R) 

MLCT(L) 

MDR 


Apd 


MLCT" 
MLCT  + 


AEct 


G 


before  Cl  after  Cl 

(A)  (B) 

FIGURE  4.  The  energy  levels  of  the  six  states 
(A)  before  and  (B)  after  full  Cl  calculation  of  the 
Hubbard  model. 


(r)  between  d  (or  radical)  and  p  (or  77)  orbitals, 

(2)  electron-phonon  (lattice)  interaction  (W),  and 

(3)  electron-electron  interaction  [10],  although 
other  factors  such  as  the  chemical  potential  and 
crystal  structure  and  disorder  are  also  important 
for  molecular  materials.  T  describes  the  itinerant 
character  of  electron  or  hole,  namely,  the  charge  or 
electron  transfer  in  molecule-based  solids.  The  last 
term  is  usually  divided  into  on-site  (U)  and  inter¬ 
site  iV)  repulsion  integrals.  Therefore,  the  proper¬ 
ties  (P)  of  TT-radical  (jR)  and  7r-d  interaction  sys¬ 
tems  are,  in  principle,  described  by  these  variables: 
P  =  functions  (T,  W,  Li,  1/ )  as  illustrated  in  Figure 
5,  and  the  extended  Hubbard-Peierls  Hamiltonian 
involving  these  parameters  (T,  ]N,U,V)  should 
have  a  general  applicability  to  theoretical  treat¬ 
ments  of  possible  new  molecule-based  materials 
examined  later.  This  Hamiltonian  is  often  reduced 
to  several  simple  model  Hamiltonians  such  as  the 
t-J  model  [5],  for  which  effective  variables  are 
derived  from  the  four  parameters  in  Figure  5. 

There  have  been  many  theoretical  studies  in  the 
extreme  cases  illustrated  in  Figure  5:  (i)  large  T- 
limit  (delocalized  limit),  large  LL-limit  (localized 
limit),  and  large  W-limit  (Jahn-Teller  limit).  The 
behaviors  of  molecule-based  materials  in  these 
limits  are  rather  simple,  and  the  nature  of  them  is 
understood  in  many  cases.  On  the  other  hand, 
several  intermediate  cases  shown  in  circles  1-4  in 
Figure  5  are  not  well  known  yet  because  of  diffi¬ 
culty  of  theoretical  treatments;  it  is  noteworthy 
that  conventional  perturbation  approaches  break 


(Electron  lattice) 


FIGURE  5.  Variations  of  electronic,  magnetic,  and 
optical  properties  with  four  microscopic  molecular 
parameters:  transfer  integral  (T),  electron-phonon 
coupling  (1/1/),  and  on-site  (U)  and  intersite  (V) 
electron-repulsion  integrals.  Circles  (1-4)  denote  the 
intermediate  cases  (see  text). 


down  [5].  However,  the  characteristics  of 
molecule-based  materials  are  often  regarded  as  the 
intermediate  strengths  of  four  parameters  in  Fig¬ 
ure  5  [3,  4,  18].  This  is,  in  turn,  one  of  the  reasons 
why  the  molecular  materials  of  our  concern  are 
important  and  interesting  from  a  theoretical  view¬ 
point  [5,  18].  Thus,  we  have  the  second  hypo¬ 
thesis  2: 

(H2)  New  phenomena  and  science  may  be  expected  from 
thorough  and  careful  investigations  of  low-dimensional 
molecular  systems  in  the  intermediate  regime  in  Fig¬ 
ure  5. 


ORDER  PARAMETERS  FOR  MAGIVETIC  AND 
SUPERCONDUCTING  STATES 

The  magnetism  and  superconductivity  are  typi¬ 
cal  macroscopic  quantum  phenomena  which  can 
be  described  by  characteristic  order  parameters. 
For  the  purpose,  field  theoretical  formulations  are 
briefly  summarized  here  [17,  19,  20].  The  four- 
component  spinor  ^(r)  is  introduced  to  character¬ 
ize  possible  electronic  phases  for  molecular  mater¬ 
ials — (1)  charge  density  wave  (CDW),  (2)  spin 
density  wave  (SDW),  (3)  singlet  superconductivity 
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(SSC),  and  (4)  triplet  superconductivity  (TSC): 


’P(r)  = 


I  'AtW  1 

- 1/'1,  (r) 


(2) 


^■^(r)  =  (i//1^(r)t//'l(r)ip^(t)  -  lAfCr)). 


•/'a  (r)(  i/'„(r))  is  the  fermion  creation  (annihilation) 
field  operator  with  spin  a  at  point  r,  and  it  satis¬ 
fies  the  anticommutation  relations.  The  subscripts 
t  and  i  mean  up  and  down  spins,  respectively. 
The  electronic  phase  is  characterized  by  its  order 
parameter,  which  is  given  by  the  product  of  spinor 
fields;  '4'(r)T^'^(r').  The  product  is  composed  of  16 
components  and  it  is  factorized  into  the  following 
forms  as 


/  F\r,r')  D(r,r')]  (3) 

\DMr,r')  F(r,r')/' 


where  each  block  component  is  given  by 


F(r,r') 

/ 1//^  (r)tp^  (r')  (r)i//'l  (r')  \ 

i//^(r))/>|(r')  (r)i/r^  (r')  I 


1 

2 


C(r,r')  +  S^(r,r')  S^(r,r')  -  iSy(r,r')^ 

S^(r,r')  +  fSy(r,r')  C(r,r')  -  Sjfr,^)  ^ 


(4) 


F'(r,r') 

I  -(/f’!|:(r)i/f^(r')iA^(r)i/>.|.(r')  ^ 


C'(r, r')  +  S^r,  r')  S;(r, r')  -  iS'/r,  r') ' 
S'^(r,r')  +  iSy(r,r')  C'(r,r')  -  S^fr,!')  ^ 


(5) 


and 


D(r,r') 

-  «/</r)iA^(r') ^ 

1  A(r,r')  +  Aj^Fr')  A^(r,r')  -  zAy(r,r') 

2  Aj,(r,r')  +  zA  (r,r')  A(r,r')  -  A2(r,r') 


(6) 


They  can  be  also  rewritten  by  the  field  product 
elements  summarized  in  Table  L  The  number- 
density  field  operator  C(r,  r')(C'(r,  r'))  is  the  anti¬ 
symmetric  spinor  and  means  an  order  parameter 
for  CDW.  The  spin-density  field  operators,  S^(r,  r'X 
Sy(r,  rO,  S^Cr,  r')(S'^(r,  r'),  and  so  on,  are  the  sym¬ 
metric  spinors  and  correspond  to  operators  for 
SDW.  The  order  parameters  for  SSC  and  TSC  are 
generated  from  the  off-diagonal  field  operator 
D(r,  r')  [19].  Thus,  superconductivity^  is  regarded 
as  an  off-diagonal  long-range  order.  A(r,  rO  is  the 
antisymmetric  spinor  and  ^generates  the  order  pa¬ 
rameter  for  SSC.  A  ,^(r,  rO,  ^lad  A^Cr,  rO  are 

the  symmetric  spinors  and  yield  order  parameters 
for  TSC.  The  SSC  state,  whose  order  parameter 
does  not  depend  on  the  Fermi  momentum,  is 
isotropic;  it  is  called  the  s-wave.  However,  the  SSC 


TABLE  I _ 

Order  parameters  for  CDW,  SDW,  SSC,  and  TSC. 

6{r,r')  =  +  ij/ ^(r)il/^(r') 

S^(r,rO  =  (rli/f +  (r') 

S^(r,r')  = 

C(r,r')  =  ipl(r)ip^(r')  + 


Sy(r,r')  =  i{il^^(r)il/^(r')  - 

S;(r,r')  =  (C)  - 

A(r,  r')  =  (At  (C)  -  jA^  ('')<At 

A^(r,  r')  =  (A|  (r)iAi  ir')  -  ^Ar 

Ay(r,rO  =  -/{iA|(r)iAi(C)  +  lA-,^  (r)iAt  (^')} 

A^(r, r')  =  (At  +  lAi  (/')^At  (/*') 


952 


VOL  65,  NO.  5 


SUPERCONDUCTORS  IN  METAL-INSULATOR  TRANSITIONS 


state  often  depends  upon  the  direction  of  the  mo¬ 
mentum  whose  magnitude  equals  the  Fermi  mo¬ 
mentum.  The  SSC  state  which  exhibits  dependence 
on  direction  is  an  anisotropic  SSC  state  such  as  the 
d-wave.  On  the  other  hand,  we  must  consider  both 
orbital  and  spin  degrees  of  freedom  to  characterize 
the  TSC  state.  The  order  parameters  of  spin  S  and 
angular  momentum  L  are  composed  of  2S  +  1 
spin  elements  times  2L  -f-  1  orbital  elements. 
Therefore,  TSC  have  many  possibilities  such  as  the 
p-wave,  showing  many  interesting  characteristics. 
These  SSC  and  TSC  states  can  be  characterized  by 
the  group-theoretical  methods  [21]. 

MODIFIED  PHASE  DIAGRAMS 
FOR  t-J  MODELS 

As  a  typical  example  for  explanation  of  long- 
range  orders  discussed  above,  let  us  consider  phase 
diagrams  for  doped  copper  oxides.  Previous  ab 
initio  UHF  [4]  and  CASSCF  [22]  calculations  indi¬ 
cated  that  extra  holes  are  introduced  mainly  into 
the  oxygen  site  for  copper  oxides  [2],  being  consis¬ 
tent  with  the  so-called  t-}  model  derived  from  the 
d-p  two-  or  three-band  Hubbard  model  of  the 
species  [23,  24].  Therefore,  a  reasonable  under¬ 
standing  of  our  /  model  [4]  can  be  obtained  from 
exact  and  numerical  results  obtained  by  recent 
exact  diagonalizations  [25-30]  of  t-]  models  for 
copper  oxides.  The  ID  t-J  model  was  solved  ex¬ 
actly  in  the  special  case  of  /  =  2f  [25],  and  its 
numerical  solution  was  also  obtained  to  depict  the 
phase  diagram  [26].  The  t-J  models  for  the  2D 
lattice  were  also  solved  numerically  to  describe  the 
high-T^  superconductivity  for  copper  oxides 
[27-30].  All  these  calculations  indicated  that  the 
large  /  value  is  responsible  for  the  formation  of  a 
Cooper  pair,  supporting  the  /  model  in  Eq.  (1). 
Here,  the  phase  diagrams  obtained  by  them  are 
utilized  for  the  extension  of  our  model  to  repro¬ 
duce  experimental  results  [2]. 

The  t-J  model  Hamiltonian  can  be  defined  as 

H(f-/)=  -t  £  il  -  -  nj  _„) 

irj,<T 

+  /LSrSy,  (7a) 

irj 

where  t  denotes  the  hopping  integral  for  the  sin¬ 
glet  pair  between  the  Cu-  and  O-hole  spins  [23], 
and  the  spin  operator  and  number  density  are 
defined  by  S,  =  and  n,-  =  re- 


spectively.  The  phase  diagram  for  the  t-J  model 
was  obtained  using  the  slave-boson  method  [25], 
where  is  defined  by 

(8) 

is  a  fermion  operator  that  carries  the  spin  and 
is  a  boson  operator  creating  a  vacancy.  The 
constraint  of  no  double  occupancy  at  the  site  i  is 
expressed  by 

+  (9) 

a 

Then,  the  t-J  model  Hamiltonian  can  be  rewritten 
as 

H(f-/)=  -f  L 

i,j,a  i,j 

-L4Lfrji.  +  Kb,-i),  (7b) 

i  '  or 

where  the  Lagrange  multiplier  A,  represents  the 
constraint  to  exclude  the  double  occupancy  at  each 
site.  The  singlet  spin-pairing  parameter  (S^p  is 
given  by 

-/u/yt)-  (10) 

The  2D  t-J  model  was  solved  under  the  mean- 
field  approximation  [27-30]  and  two  different 
transition  temperatures,  T(SG)  and  T(BE),  were 
obtained;  T(SG)  is  the  transition  temperature  for 
the  pairing  of  the  spinons  below  which  the  fermion 
response  function  shows  the  Meissner  effect,  while 
r(BE)  is  the  Bose-Einstein  condensation  tempera¬ 
ture  of  the  boson.  T(SG)  is  a  decreasing  function  of 
the  hole  concentration  x  and  becomes  zero  when 
f^max  ^  //  where  is  the  maximum  concentra¬ 
tion  for  the  appearance  of  superconductivity.  On 
the  other  hand,  r(BE)  is  an  increasing  function 
of  X,  e.g.,  r(BE)  =  47rfx  =  47r/x  (x  = 

The  four  different  phases  were  defined  by  using 
these  two  characteristic  temperatures,  r(SG)  and 
T(BE) — KFL):  the  Fermi  liquid  phase  where  (b^) 
=  0;  II(SG):  the  spin  gap  (SG)  phase  where  <S,y) 

0  but  (bj)  =  0;  III(SSC):  the  singlet  superconduct¬ 
ing  phase  with  the  d-wave  symmetry  where 
#  0  and  <&p  9^  0;  and  IV(EM):  the  exotic  metal 
phase  where  =  0  and  <bp  =  0.  The  spin  gap 
(SG)  appears  in  the  spin  excitation  spectrum  in  the 
phase  II(SG).  This  has  been  confirmed  by  an  NMR 
experiment  [31].  The  superconducting  order  pa- 
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rameter  [27]  in  Eq.  (6)  is  given  by 

=  {bifiSij),  (11a) 

where  is  defined  by 

A,.^.  =  j  dr  dr'cpf(r)A(r,  r')<p/r').  (11b) 

cpi  denotes  the  site  wave  function.  This  is  not  zero 
in  the  phase  III(SSC),  accompanied  by  the  appear¬ 
ance  of  the  mass  of  the  gauge  field  (the  so-called 
Anderson-Higgs  mechanism)  [19,  20];  the  boson 
(holon)  and  fermion  (spinon)  interact  through  the 
gauge  field  as  in  the  case  of  QCD  theory  [17]. 
Although  the  slave-boson  model  explains,  indeed, 
the  appearance  of  a  spin  gap  [31]  in  the  under¬ 
doped  copper  oxide,  its  behavior  near  the  thresh¬ 
old  X  =  0  is  not  clear,  although  the  experiment 
clearly  shows  the  existence  of  an  antiferromagnetic 
(AF)  phase  [2]. 

Therefore,  our  /  model  in  Eq.  (1)  is  extended  so 
as  to  reproduce  experimental  results  [2]  on  the 
basis  of  the  Ginzburg-Landau  (GL)  model  which 
has  been  developed  for  the  RVB  state  described  by 
the  t-J  model  [32].  From  the  mean-field  results 
[32],  two  order  parameters,  (pQip  and  corre¬ 
sponding  to  fermion  pairing  and  Bose  condensa¬ 
tion,  respectively,  were  introduced  to  define  the 
free  energy: 

^GLxt^GLX/^/ A] 

Kx  r  A  \  ^ 

=  d  ^  -  zma  -  i—nAj(l>cix 

+  2sgn(T  -  T(X)Mc,x\"  +  \<I>glx\'  .  (12) 

where  and  denote  the  coherence  length 
and  critical  field,  and  m  =  2  and  n  =  1  for  the 
fermion  (X  =  F)  and  m  =  1  and  n  =  0  for  the 
boson  (X  =  B).  a  and  A  denote  the  Chern-Simons 
gauge  field  [33]  and  vector  potential,  respectively. 
Here,  we  modify  the  GL  model  so  as  to  approxi¬ 
mately  describe  the  experimental  results  [2]  in 
combination  with  our  /  model,  which  was  derived 
from  the  ab  initio  molecular  orbital  results  for 
transition-metal  oxides  [34].  The  three  transition 
temperatures,  r(AF),  T(SG),  and  T(BE),  are  as¬ 
sumed  to  be  proportional  to  the  hole  concentra¬ 
tion,  while  the  transition  temperature  r(SSC)  for 
singlet  superconductivity  is  approximately  given 


by  the  quadratic  form  of  x: 

T(AF)  =  c(AF)/(x^,, -x)  (13a) 

T(BE)  =  c(BE)/(x-x^iJ  (13b) 

r(SG)  =  c(SG)/(x„,,  -xXx>  x^J  and 
T(SG)  =  ciSG)Jx(x<x^J  ^ 

T(SSC)  =  ciSSOfix  -  x„i„)(x^,,  -  x).  (13d) 

c(X)  are  the  constants  in  Eq.  (1),  and  the  existing 
regions  of  these  phases  are  defined  as  follows: 
antiferromagnetic  phase  (AF),  0  <  x  <  singlet 

superconductivity  phase  (SSC),  <  x  < 
and  Fermi  liquid  phase,  x^^^^  <  ^  (^^max  =  /)•  Fig- 
ure  6  illustrates  the  phase  diagram  for  five  differ¬ 
ent  (AF,  SG,  SSC,  FL,  EM)  phases.  The  maximum 
temperature  for  SSC  is  suppressed  in  our  model, 
in  conformity  with  the  gauge  field  fluctuation,  etc. 

The  SSC  in  the  Fermi  liquid  phase  I  can  be 
described  by  the  spin  fluctuation  model  [35,  36]. 
The  transition  temperature  for  SSC  is  expressed  by 
the  conventional  BCS  formula  [35-37] 

T(SSC)  ==  ~  x)exp(-l/A),  (14) 

where  the  cutoff  energy  is  given  by  the  character¬ 
istic  energy  Fq  of  spin  fluctuation  (SF)  and  is 
the  magnetic  coherence  length  [37].  The  SF  model 
at  least  works  well  in  the  overdoped  region  I  in 
Figure  6,  although  its  behavior  in  the  underdoped 
region  II  is  not  clear  yet.  If  the  true  order  parame¬ 
ter  for  the  high-T^  superconductivity  is  discovered 
in  the  future,  the  coherence  length,  penetration 


T 


FIGURE  6.  Phase  diagram  for  the  Fermi  liquid  I  (FL), 
spin  gap  II  (SG),  singlet  superconductive  III  (SSC),  exotic 
metallic  IV  (EM),  and  antiferromagnetic  V  (AF)  phases  for 
copper  oxide  superconductors  by  the  modified 
slave-boson  model. 
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depth,  and  transition  temperature  can  be  obtained 
by  the  same  formalism  in  Eq.  (12).  The  previous  / 
and  present  modified  GL  models  can  be  utilized 
for  molecular  design  of  spin-mediated  supercon¬ 
ductors,  although  they  are  phenomenological.  In 
fact,  several  experimental  groups  are  now  conduct¬ 
ing  synthetic  efforts  for  them  as  discussed  below. 

ACTIVE  CONTROLS  OF  PROPERTIES 

BY  EXTERNAL  FIELDS 

The  phase  diagram  in  Figure  6  clearly  shows 
that  electronic  phases  of  copper  oxides  are  variable 
with  hole  concentration  (x).  This,  in  turn,  indicates 
that  active  control  of  electronic,  conductive,  mag¬ 
netic,  and  optical  properties  by  chemical  modifica¬ 
tions  and  external  fields  are  feasible  for  potential 
applications  of  molecule-based  materials  to  ad¬ 
vanced  technologies.  Concerning  these  applica¬ 
tions,  there  are  several  experimental  and  theoreti¬ 
cal  reasons  why  the  intermediate  regions  in  Fig¬ 
ures  5  and  6  are  so  interesting: 

1.  Electronic  structures  in  this  regime  are  often 
sensitive  to  external  variables  (doping  of  hole 
or  electron,  pressure,  temperature,  external 
electronic  field,  photoexcitation,  etc.),  and  the 
variations  with  such  parameters  are  nonlin¬ 
ear  in  many  cases. 

2.  Controls  of  electronic,  magnetic,  optical,  and 
other  properties  with  the  above  variables  are 
feasible  in  appropriate  experimental  condi¬ 
tions  because  of  the  strong  coupling  of  charge, 
spin,  and  phonon  degrees  of  freedom. 

3.  Mean-field  theories  often  break  down  be¬ 
cause  of  strong  fluctuations,  and,  therefore, 
new  theories  such  as  gauge  field  theories  in 
quantum  chromodynamics  (QCD)  [17,  19,  20] 
would  be  useful  even  in  the  field  of  solid-state 
quantum  chemistry. 

Thus,  the  switching  of  conductive,  magnetic,  and 
optical  properties  by  the  external  fields  is  a  key 
issue  for  molecule-based  materials  in  the  interme¬ 
diate  regime  [3,  18].  Figure  7  illustrates  the  bista¬ 
bility  expected  in  several  natural  and  artificial  sys¬ 
tems  [14,  18].  Although  the  bistability  is  indeed 
one  of  the  guiding  principles  for  the  construction 
of  electronic  devices,  it  is  equally  applicable  to  the 
design  of  new  functional  materials  responsible  for 
active  controls  [3].  It  would  be  one  of  the  chemist's 
strategies  to  obtain  molecule-based  materials  with 


Electronic 

properties 


FIGURE  7.  Controls  of  electronic,  magnetic,  and  optical 
properties  of  molecule-based  materials  by  external 
parameters  (see  text).  Switchings  by  the  external  fields 
are  feasible  in  the  intermediate  regime. 


such  novel  properties,  although  powerful  metal 
magnets  and  high-T^  copper  oxides  have  already 
been  discovered.  Probably,  molecule-based  func¬ 
tional  systems  have  common  characteristic  fea¬ 
tures  with  biological  systems  [18]. 


Possible  Strategies  for  Charge-  and/ 
or  Spin-mediated  Superconductors 

GEIVERATIONS  OF  NEW  ELECTRONIC 
PHASES  BY  CARRIER  DOPING 

The  discovery  of  high-T^  copper  oxide  [1]  sug- 
gested  the  idea  that  doping  in  antiferro-  or  ferro¬ 
magnetic  systems,  more  generally  molecular 
charge-transfer  (CT)  insulators,  may  provide  sev¬ 
eral  exotic  electronic  phases  as  illustrated  in  Fig¬ 
ure  8  [3-5].  The  antiferromagnetic  insulators  may 
be  converted  upon  doping  into  six  different  phases: 
(1)  ferromagnetic  metal  or  insulator;  (2)  spin  glass; 
(3)  paramagnetic  metal;  (4)  antiferromagnetic 
metal;  (5)  ferrimagnetic  metal  or  insulator;  and  (6) 
charge-  or  spin-mediated  superconductor.  Recent 
field  theoretical  approaches  can  describe  these 
electronic  phases  in  a  systematic  manner  as  shown 
in  Eqs.  (2)-(6),  Unfortunately,  to  our  knowledge, 
theoretical  determinations  ojf  possible  phase  dia¬ 
grams  for  molecular  materials  [17,  22]  are  difficult 
tasks  except  for  a  special  case  such  as  the  Nagaoka 
theorem  [38],  because  the  perturbation  theory 
breaks  down  for  molecular  materials  under  inves¬ 
tigation.  However,  accumulating  experimental  re¬ 
sults  suggest  that  carrier  doping  into  intermediate 
or  strong  correlation  systems  exhibit  several  inter¬ 
esting  phenomena  in  Figure  8  and  other  quantum 
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carrier 

Mott-Hubbard 
and  CT  Insulator 


_ ^  ferromagnetic  metal  (FRM) 

or  insulator  (FRI) 

— spin-glass  (SG) 

— ►  paramagnetic  metal  (PMM) 

— ►  antiferromagnetic  metal  (AFM) 

_ ^  ferrimagnetic  insulator  (FRII) 

or  metal  (FRIM) 

_ ^  charge-  or  spin-mediated 

superconductor  (SC) 

FIGURE  8.  Possible  phases  obtained  by  carrier  doping 
into  Mott-Hubbard  and  CT  insulators. 


(A) 


spin  phenomena  [17]  expected  for  the  low-dimen¬ 
sional  systems  in  Figures  5  and  6,  Therefore,  a 
chemist's  approach  from  the  theoretical  side  would 
search  isoelectronic  series  to  copper  oxides  as  em¬ 
ployed  previously  [3],  while  that  of  the  experimen¬ 
tal  side  is  to  synthesize  new  isoelectronic  com¬ 
pounds  followed  by  physical  observations  [14b]. 
Collaborations  from  both  sides  would  open  the 
new  intersection  field  in  molecular  electronic  ma¬ 
terials.  For  the  theoretical  side,  the  following  hy¬ 
pothesis  3  is  assumed  as  [3-5] 


FIGURE  9.  Four  different  approaches  to  obtain  high-T^ 
superconductors  in  the  Intermediate  regime  of  the 
metal-insulator  transitions:  (A)  charge-mediated 
superconductor;  (B)  spin-mediated  superconductor. 


(H3)  Doping  of  hole  or  electron  into  Mott-Hubbard  or 
CT  insulators  may  provide  exotic  electronic  materials 
summarized  in  Figure  8. 

CHARGE-  AI\D/OR  SPIN-MEDIATED 

SUPERCONDUCTOR 

Although  many  theoretical  studies  of  doped 
copper  oxides  were  initiated  just  after  the  discov¬ 
ery  of  the  high-T^  superconductivity  [1],  theoreti¬ 
cal  attempts  to  examine  possible  molecular  analogs 
to  them  were  scarce  in  1986  [3-5,  7]  although  such 
attempts  are  now  popular  in  chemistry  and  mate¬ 
rial  science.  As  mentioned  previously  [5],  we  have 
therefore  started  our  theoretical  attempts  to  search 
possible  models  for  charge-  and/or  spin-mediated 
superconductors.  For  the  purpose,  we  assumed  a 
working  hypothesis  (H4): 

(H4)  Charge-  and/or  spin-mediated  superconductors 
may  appear  in  the  intermediate  region  of  metal- 
insulator  transitions  as  illustrated  in  Figure  9. 


Figure  9  illustrates  possible  approaches  based 
on  the  hypothesis  H4.  Concerning  charge-medi¬ 
ated  superconductivity,  the  introduction  of  charge 
fluctuation  from  the  metallic  side  was  already  ex¬ 
amined  by  Little  [39]  in  the  case  of  conductive 
polymers  and  related  species  in  Figure  9(A).  On 
the  other  hand,  the  possible  role  of  CT  excitation 
from  the  nonmagnetic  insulator  side  in  the  inter¬ 
mediate  region  of  metal-insulator  transitions  is 
not  elucidated  yet.  Therefore,  this  moot  point 
should  be  examined  later.  The  spin-fluctuation 
model  is  also  conceivable  from  the  metallic  side  in 
the  intermediate  regime  of  antiferro  (or  ferro)  mag¬ 
netic  insulator-to-metal  transitions  as  illustrated  in 
Figure  9(B).  On  the  other  hand,  the  t-}  model  or 
RVB  model  can  be  used  to  formulate  possible 
spin-mediated  superconductivities  from  the  mag¬ 
netic  insulator  side.  In  the  later  sections,  we  there¬ 
fore  examine  possible  candidates  from  these  theo¬ 
retical  viewpoints. 
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Models  for  Exotic  Spin-mediated 
Conductors  and  Superconductors 

ORGANIC  AND  ORGANOMETALLIC  RVB 
SYSTEMS 

First,  let  us  consider  the  spin-mediated  super¬ 
conductor  in  relation  to  copper  oxides.  The  intro¬ 
duction  of  electron  carriers  into  magnetic  insula¬ 
tors  is  one  of  the  possible  ways  to  obtain 
spin-mediated  superconductors  as  shown  in  Fig¬ 
ure  9(B).  The  RVB  state  with  spin  frustration  is 
expected  to  be  a  possible  precursor  for  the  pur¬ 
pose,  although  it  has  been  interesting  from  the 
mechanism  of  free-radical  reactions.  For  example, 
the  Heisenberg  model  for  organic  radicals  and 
radical  clusters  permit  general  spin  structures  such 
as  triangular  and  cone  ones  [40,  41].  In  fact,  these 
general  spin  structures  as  well  as  spin-frustrated 
configurations  appear  even  in  the  three  hydrogen 
system  and  four  hydrogen  system  which 
are  models  of  transition  structures  in  exchange-for¬ 
bidden  radical  reactions.  We  have  thoroughly  in¬ 


vestigated  these  systems  to  elucidate  group-theo¬ 
retical  interrelationships  between  several  theoreti¬ 
cal  models  [40]. 

After  the  discovery  of  the  high-T^  superconduc¬ 
tivity  [1],  triangular  spin  systems  have  attracted 
great  interest  because  of  the  Andersons's  resonat¬ 
ing  valence  bond  (RVB)  theory  [42]  for  magnetism 
and  superconductivity.  Many  field  theoretical 
models  [17]  have  been  presented  for  the  high-T^ 
superconductivity  in  conformity  with  Anderson's 
proposal.  On  the  other  hand,  we  considered  possi¬ 
ble  organic  [10]  and  organometallic  [11]  spin-frus¬ 
tration  systems  such  as  the  triangular  ladder  (IV) 
and  Kagome  ladder  (V)  in  relation  to  RVB  model 
for  copper  oxides  as  illustrated  in  Figure  10.  Both 
through-bond  and  intermolecular  approaches  are 
possible  to  construct  such  spin  networks  or  topol¬ 
ogy.  The  former  approach  enables  us  to  propose 
several  organic  [8]  and  organometallic  [11]  candi¬ 
dates  composed  of  the  bond  units  in  Figure  4.  We 
also  tried  to  synthesize  low-dimensional  systems 
composed  of  4Fe — 4S  clusters  with  spin  frustra¬ 
tions,  but  amorphous  oligomers  resulted  to  pre¬ 
vent  further  experimental  treatments  [43].  Instead 


(A)  (B)  (C) 

FIGURE  10.  1-D,  ladder,  and  2-D  spin  systems  with  and  without  hoies  (or  electrons)  which  are  attractive  from 
magnetism  and  superconductivity. 
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of  the  former  (through-bond)  approach,  the  latter 
(through-space)  approach  is  rather  simple  since 
many  radicals  and  ion-radicals  can  be  used  for 
component  units.  For  example,  three-center  units 
(2),  nitronyl  nitroxide  (NN)  derivatives,  doped  TTF 
and  BEDT-TTF  molecules,  etc.,  would  be  building 
blocks  for  molecular  ladder  compounds  in  Figure 
10. 

Awaga  et  al.  [44]  discovered  the  organic  Kagome 
system  composed  of  p-MPNN,  which  does  not 
exhibit  the  magnetic  ordering  even  at  low  temper¬ 
ature  because  of  strong  spin  frustration.  But  intro¬ 
duction  of  the  carrier  into  this  system  seems  diffi¬ 
cult.  Enoki  [45]  also  found  the  triangular  lattice 
composed  of  the  BEDT-TTF  plus  Br  complex.  Their 
preliminary  results  for  the  complex  showed  pecu¬ 
liar  magnetic  behaviors.  Apparently,  much  more 
experimental  effort  is  desirable  to  open  a  new  field 
in  this  direction.  Then,  we  have  the  following 
h5^othesis  5: 

(H5)  Exotic  electronic  phenomena  may  appear  upon 
doping  of  organic  and  organometallic  resonating  va¬ 
lence  bond  (RVB)  systems. 

iV-BAND  MODELS  FOR  ORGANIC 

FERROMAGNETS,  FERROMAGNETIC 

CONDUCTORS,  AND 
p-WAVE  SUPERCONDUCTORS 

The  ferromagnetism  for  electron  correlated  sys¬ 
tems  with  N-tuple  degenerate  orbitals  is  a  classic 
problem  in  magnetism  because  of  the  potential 
exchange  interaction;  the  parameters  arising  from 
the  Coulomb  interaction  (U,  V)  in  Figure  5  should 
be  considered  [46,  47].  If  an  electron  carrier  is 
doped  into  the  systems,  the  interaction  between 
the  conduction  electron  and  the  localized  spin  is 
realized,  showing  common  features  for  other  sys¬ 
tems  examined  in  this  article.  Recently,  revisited 
interest  for  degenerated  N-band  systems  has  arisen 
from  several  discoveries.  The  orbital  degeneracy 
(freedom)  such  as  the  three  degenerate  ^2^  orbitals 
of  Cgo  (7)  plays  an  important  role  for  orbital  ferro¬ 
magnetism  [46].  The  Hund  rule  for  the  orthogonal 
^2^  orbitals  favors  the  high  spin  state  of  the  dian¬ 
ion  and  trianion  of  7,  but  the  ground  state  of  the 
dianion  is  a  singlet,  showing  the  important  role  of 
an  electron  correlation  effect  as  in  the  case  of 
square  planar  cyclobutadiene.  Thus,  the  interplay 
or  competition  among  ferromagnetic  exchange 


from  orbital  degeneracy,  electron  correlation  (Lf), 
and  the  Jahn-Teller  distortion  ( W  )  is  one  of  funda¬ 
mental  problems  in  molecular  magnets  with  N- 
band  degeneracy.  The  TMAE'C^o  CT  complex  ex¬ 
hibits  complex  magnetic  behaviors  because  of  these 
three  factors;  many  spectroscopic  experiments  are 
not  conclusive  and  are  not  settled  yet  [46]. 

The  backbone  and  pendant-type  polycarbene 
polymers  investigated  by  the  Ito-Takui  [48]  and 
Iwamura  [49]  groups  are  particularly  interesting 
since  the  polycarbene  species  are  regarded  as  the 
orthogonal  two-band  (a  and  tt)  systems.  Theoreti¬ 
cal  calculations  [7,  50]  were  carried  out  to  eluci¬ 
date  possible  spin  states  of  these  species  after 
doping  and  it  was  concluded  that  the  doping  of 
carriers  into  these  polymers  would  become  a  model 
for  the  ferromagnetic  conductor  and  the  molecular 
triplet  superconductor.  Recently,  the  Shida  and 
Ito-Takui  groups  initiated  doping  experiments  into 
polycarbenes  [51]. 

Recently,  Maeno  et  al.  [52]  discovered  the  su¬ 
perconductivity  for  non-copper  oxides:  Sr2Ru04.  A 
theoretical  possibility  of  triplet  superconductivity 
(TSC)  in  Table  I  was  pointed  out  for  Sr2Ru04 
[53].  According  to  the  band  calculation  [54],  the 
ds(7r)-p7r  interaction  is  essential  in  this  com¬ 
pound  in  contrast  to  the  dy(a)-p(T  interaction  in 
copper  oxide.  In  our  /  model  in  Eq.  (1),  the  for 
triplet  superconductivity  should  be  rather  low  be¬ 
cause  the  ferromagnetic  spin  fluctuation  is  weak  in 
Sr2Ru04.  If  Sr2Ru04  is  really  the  triplet  supercon¬ 
ductor,  it  supports  the  isoelectronic  analogies  in 
Figure  2.  Several  recent  findings  for  the  inorganic 
systems  may  provide  the  following  hypothesis  6 
for  molecular  materials: 

(H6)  Carrier-doped  molecule-based  magnets  with  N- 
band  degeneracy  are  interesting  for  ferromagnetism, 
magnetic  conductivity,  and  triplet  superconductivity. 


Models  for  Charge-mediated 
Superconductors 

The  transition  temperature  for  spin-mediated 
superconductors  are  estimated  [5]  to  be  150  K  even 
at  the  upper  limit,  since  the  largest  effective  ex¬ 
change  integrals  are  in  the  order  1000  cm“h  There¬ 
fore,  we  may  change  the  strategy  to  obtain  the 
room-temperature  superconductor.  Several  ap¬ 
proaches  [55]  from  the  metal  side  in  Figure  9(A) 
were  performed.  Probably,  new  approaches  differ- 
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ent  from  them  are  necessary  for  the  purpose.  Here, 
let  us  consider  the  CT  model  from  the  insulator 
side  in  Figure  9(A)  instead  of  charge  fluctuation 
(exciton)  model  by  Little  [39]. 


ALKALI-,  ALKALI-EARTH-,  AND 

HYDROGEN-TERNARY  COMPLEXES 

The  chemical  synthesis  of  high-T^.  superconduc¬ 
tors  without  copper  ion  is  one  of  a  chemist's 
dreams.  According  to  the  working  hypothesis  H5, 
let  us  consider  possible  model  systems  in  the  inter¬ 
mediate  region  of  metal-insulator  transitions  [56, 
57].  Building  blocks  composed  of  hydrogen  (H) 
and  alkali  earth  atoms  are  regarded  as  the  Is-  and 
2  s  orbital  pairs  having  the  isoelectronic  configura¬ 
tions  to  those  in  Figure  2.  The  alkali  (A)-  and 
alkali-earth  (B)-atom  combinations  also  provide  the 
same  electronic  configurations.  These  simple  com¬ 
binations  are  interesting  because  high  pressures 
can  be  used  to  obtain  solids.  The  ternary  com¬ 
plexes,  HBH,  HBA,  and  ABA,  are  isoelectronic  to 
the  three-center  four-electron  systems  depicted  in 
Figures  3  and  4.  On  the  other  hand,  HHH,  HAH, 
AAH,  and  AAA  components  are  formally  isoelec¬ 
tronic  to  the  one-extra  hole-doped  three-electron 
states  of  the  three-center  systems  in  Figure  4.  The 
CT  excitations  (A)  can  be  controlled  so  as  to  couple 
the  Cooper  (2  e)  pair  in  the  intermediate  regime  of 
the  metal-insulator  transitions.  Therefore,  three- 
component  solids  expressed  by  (H)^(A)y(B)2  would 
be  particularly  interesting  from  the  CT  mechanism 
of  the  high-T^  superconductivity  in  Figure  9(A). 
The  partial  addition  of  alkali  and  alkali-earth  atoms 
into  metallic  hydrogens  at  high  pressure  would  be 
one  of  the  charge-fluctuation  approach  in  Figure 
9(A).  On  the  other  hand,  let  us  consider  a  model 
system  of  the  CT  mechanism  of  the  partial  addi¬ 
tion  of  hydrogen  atoms  into  an  insulator,  e.g., 
which  is  consisted  of  LiH  and  BeH2,  at  high  pres¬ 
sure  as  shown  in  Figure  11. 


A  FORMULATION  OF  OLR  CT  MODEL 

To  clarify  the  difference  between  Little  [39]  and 
our  models,  we  describe  here  the  model  Hamilto¬ 
nians  explicitly.  For  the  underdoped  state,  a  model 
Hamiltonian  is  written  as 
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FIGURE  1 1 .  A  model  system  of  the  CT  mechanism  of 
the  partial  addition  of  hydrogen  atom  into  an  insulator 
which  consists  of  LiH  and  BeH2  and  some  possible 
transition  states. 


where 
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In  Eq.  (15a),  the  first  and  second  terms  are  kinetic 
and  electron  interaction  parts,  respectively. 
means  electron-lattice  (positive  charge)  interac¬ 
tion.  Hi  is  the  Coulomb  interaction  between  lat¬ 
tices  (positive  charges).  and  indicate 

electron-phonon  and  lattice  (positive  charge)- 
phonon  interactions,  respectively.  is  the 

creation  (annihilation)  operator  for  an  electron  at 
the  r-th  site  with  a  spin,  is  the  creation  opera¬ 
tor  for  a  state  (positive  charge)  at  the  z-th  site, 
which  is  shown  in  Figure  11(b)  and  (d).  is  the 
creation  operator  for  a  state,  which  is  different 
from  the  above  state,  as  shown  in  Figure  11(c),  The 
operators  are  satisfied  with  the  anticommutation 
relation,  ,  which  is  a  boson,  is  the  phonon  cre¬ 
ation  operator. 

In  particular,  in  the  underdoped  system  as 
shown  in  Figure  12,  the  above  Hamiltonian  is 
approximated  as 

H  =  E  -  V’  E  E 

r,s  or  r,s,t,u  a ,  p 

(16) 


where 


the  following: 

(18) 

where  A  is  the  CT  excitation  energy  and  C  is  a 
constant.  We  expect  that  increases  above  300  K 
(see  Table  II),  because  the  CT  excitation  energy 
equals  to  about  1-3  eV. 

On  the  other  hand,  for  the  overdoped  state, 
another  model  Hamiltonian  is  considered: 

H  =  +  H;,  +H,_^  (19) 

where 
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Tr^s  =  trs  -  E  +  (KK>)-  (17) 
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E 
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The  second  term  in  Eq.  (15a)  and  H;  of  Eq.  (15c) 
are  negligible.  Therefore,  the  transition  tempera¬ 
ture  for  this  approximation  may  be  estimated  as 
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FIGURE  12.  Underdoped  model  system  constructed 
of  alkali-  and  alkali-earth  metals. 


where  is  the  creation  (annihilation)  opera¬ 

tor  for  an  electron  at  the  r-th  site,  the  beryllium 
site,  with  a  spin,  b^^  is  the  creation  operator  for  a 
positive  charge  (hole)  at  the  f-th  site  with  a  spin, 
where  the  label  i  is  the  lithium  site.  The  operators 
are  satisfied  with  the  anticommutation  relation. 
The  first  and  second  terms  in  Eq.  (19a)  mean  the 
kinetic  and  electron  interaction  parts,  respectively. 
In  Eq.  (19b),  the  first  and  second  are  the  kinetic 
and  Coulomb  interactions  for  the  hole.  is  the 
Coulomb  interaction  between  the  electron  and  hole. 

indicates  that  an  electron  (negative  charge)  and 
a  hole  vanish.  This  means  that  the  charged  state 
becomes  neutral  as  shown  in  Figure  11(a),  This 

TABLE  II _ 


Three  mediums  and  7^  for  superconductors. 


Medium 

Maximum  (K) 

Phonon 

30 

Spin 

100 

Charge 

300 
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model  will  reduce  to  a  metal  model,  which  may  be 
a  metallic  hydrogen  model,  at  the  limit  of  a  large 
amount  of  a  doping  hydrogen  atom. 

DESIGN  OF  NONMETAL  TERNARY  SYSTEMS 

The  magnitude  of  the  CT  excitation  energy  ( 
is  one  of  the  possible  parameters  to  elucidate  pos¬ 
sible  organic  systems  which  are  isoelectronic  to 
copper  oxides.  Previously  [5],  we  already  exam¬ 
ined  the  CT  excitation  energies  between  nonmetal 
second  and  third  periodic  elements.  The  A  value 
for  the  sulfur  (S)  and  nitrogen  (N)  pair,  namely, 
transition  energy  from  SN  to  S'^N”,  is  only  0.1  eV, 
and  therefore  the  (SN)^  polymer  corresponds  to 
the  overdoped  copper  oxide  [2],  which  is  a  normal 
metal  without  spin  fluctuation.  The  normal  BCS- 
type  superconductivity  is  expected  for  the  species, 
in  accord  with  the  experiment.  On  the  other  hand, 
the  CT  energies  (A^^)  for  the  CS,  SiS,  PS,  and  CN 
pairs  are  0.6- 1.0  eV,  compatible  with  the  undoped 
copper  oxide  [58].  Therefore  the  ID  polymers  (I), 
ladders  (II),  and  sheet  (III)  consisted  of  these  pairs 
in  Figure  10  could  be  CT  semiconductors/or  insu¬ 
lators.  The  introduction  of  holes  or  electrons  into 
these  systems  by  doping  may  induce  the  metal- 
insulator  transitions  as  in  the  case  of  copper  ox¬ 
ides.  For  example,  the  synthesis  of  ternary  solids, 
(X)^(C)y(N)2,  (X)^(Si)y(S)2,  and  so  on  (X  =  halogen 
atoms,  etc.)  would  be  interesting  targets  since  the 


solid-state  synthesis  under  the  high  pressure  was 
developed  recently  [59].  Then,  we  may  have  the 
following  hypothesis  H7: 

(H7)  Charge-transfer  models  for  the  high-T^  supercon¬ 
ductivity  in  Figure  9(A)  are  conceivable  in  the  ternary 
solids  such  as  (H)jA)jB)^  and  (X)JC)y(N)^. 


Models  for  Photo-induced 
Molecular  Materials 

PHOTOINDUCED  ORGANIC  AND 
ORGANOMETALLIC  CT  FERRO- 
AND  FERRIMAGNETS 

Photo-induced  phase  transitions  have  been  con¬ 
sidered  as  an  advantage  of  molecule-based  materi¬ 
als.  From  this  viewpoint,  the  donor-acceptor  com¬ 
plexes  would  be  interesting  targets  for  magnetic 
modification  [16].  Possibilities  of  organic  charge- 
transfer  (CT)  ferro-  and/or  ferrimagnets  were  ex¬ 
amined  based  on  the  ab  initio  computations  of 
donor-acceptor  (D-A)  complexes  with  radical 
groups  [16].  The  intra-  and  intermolecular  effective 
exchange  couplings  between  ion  radicals  were  ex¬ 
amined  theoretically  as  illustrated  in  Figure  5  in 
[16].  Possibilities  of  photo-induced  or  photo- 
switched  organic  magnets  were  also  proposed  as 
illustrated  in  Figure  13  [16].  To  this  end,  spin 
sources  were  introduced  into  compounds  exhibit- 


potential  energy  Potential  energy 


FIGURE  13.  Photo-induced  or  photo-switched  organic  magnetic  materials. 
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ing  the  neutral-ionic  transitions  by  photoexcita¬ 
tion:  The  TTF-chloranil  system  having  a  radical 
substituent  is  illustrated  as  an  example  in  [18]. 
Similarly,  magnetic  oligomers  such  as  poly(phen- 
ylene  vinylene)  [60],  etc.,  can  be  used  as  a  donor  or 
acceptor  for  photo-induced  CT  magnets  [18]. 

Very  recently,  Hashimoto  et  al.  [61]  demon¬ 
strated  that  the  magnetic  properties  of  Prussian 
blue  analogs  (MXM',  X  =  CN^")  are  controllable 
by  the  electrochemical,  photochemical,  and  other 
techniques,  namely,  magnetic  phase  transitions 
were  induced  by  the  external  perturbations.  Their 
results  lend  support  our  previous  attempt,  e.g.,  see 
[15].  The  success  of  the  photo-induced  magnetism 
[61]  in  the  Prussian  blue  analogs  may  support 
possibilities  of  photo-induced  organic  magnets 
as  well. 

The  key  issues  in  these  p-d,  77-d,  and  tt-R 
systems  are  small  metal-ligand  CT  (MLCT)  and 
metal-metal  CT  (MMCT)  excitations  in  Figure  13. 
Thus,  we  may  have  the  following  hypothesis  8: 

(H8)  Photo-induced  molecular  magnetism  and  tunable 
magnets  by  external  fields  are  expected  in  p-d,  ir-d, 
and  tt-R  systems  with  small  MLCT  and  MMCT  ener¬ 
gies  in  Figure  3. 

POSSIBILITIES  OF  PHOTO-INDUCED 

SUPERCONDUCTORS 

It  is  significant  to  consider  possibilities  of  the 
photo-induced  superconductors  in  an  intermediate 
regime  for  the  metal-insulator  transition.  Let  us 
consider  a  model  system  as  shown  in  Figure  14(a). 
The  phase  of  the  ground  state,  which  is  described 
as  region  I  as  shown  in  Figure  14(b),  is  an  insula¬ 
tor.  By  irradiating  the  photon,  electrons  transit  to 
the  excited  states  which  are  in  region  II  or  III  as 
shown  in  Figure  14(b).  Region  III,  which  is  a  con¬ 
tinuous  level  region,  becomes  the  metal  phase, 
and,  on  the  other  hand,  region  II,  which  has  a  very 
small  energy  gap  between  levels,  will  be  chaotic. 
We  point  out  that  analogously  to  the  above  discus¬ 
sions  of  superconductivity  in  the  intermediate  re¬ 
gion  of  metal-insulator  transitions  the  photo- 
induced  superconductive  phase  may  be  found  in 
region  II  but  for  solving  the  mechanism  of  the 
coupling  Cooper  pair.  Thus,  we  may  have  the 
following  hypothesis  9. 

(H9)  Photo-induced  superconductivity  by  external  fields 
are  also  expected  in  a  semi-continuous  level  band  in 
Figure  14. 


FIGURE  1 4.  A  model  of  photo-induced  superconductor 
and  three  regions  of  excited  levels:  (a)  a  lattice  model  of 
an  insulator;  (b)  energy  levels.  Discrete  energy  levels  are 
in  region  I.  Regions  II  and  III  describe  semicontinuous 
and  continuous  levels,  respectively. 


Discussions  and  Concluding  Remarks 

The  discovery  of  high-T^  superconductors  [1] 
enable  us  to  investigate  possible  organic  and 
organometallic  analogs  to  copper  oxides.  Previ¬ 
ously  [3-5],  we  thoroughly  examined  some  possi¬ 
ble  model  Hamiltonians  for  copper  oxides.  N-band 
(from  two  to  six  bands)  models  for  copper  oxides 
were  examined  by  exact  diagonalization  of  the 
cluster  models.  From  these  computational  results, 
CT  insulators  with  strong  superexchange  interac¬ 
tions  were  proposed  as  possible  precursors  for 
high-T^  superconductors  and  exotic  molecular 
magnets.  Some  typical  examples  described  by  the 
p-d,  TT-d,  and  tt-R  Hamiltonians  were  also  dis¬ 
cussed  from  both  charge  and  spin  fluctuation 
mechanisms  for  the  high-T^  superconductivity. 
Judging  from  the  cutoff  energies,  the  maximum 
would  be  30,  100,  and  300  K  for  phonon-,  spin-, 
and  charge-mediated  superconductors  as  shown  in 
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Table  II.  During  the  past  7  years,  we  have  discov¬ 
ered  and  investigated  many  experimental  conclu¬ 
sions  on  the  basis  of  the  working  hypotheses  pre¬ 
sented  previously  [3-5],  In  this  article,  we  have 
tried  to  explain  these  newly  obtained  experimental 
results  from  theoretical  viewpoints.  The  /  model 
[5]  was  extended  so  as  to  reproduce  many  experi¬ 
mental  results  [2].  As  cited  in  many  articles  in 
the  references,  some  theoretical  computations 
were  carried  out  to  examine  guiding  principles  for 
constructions  of  functional  materials  and  active 
controls  of  various  properties  of  molecule-based 
materials.  Since  theoretical  predictions  of  crystal 
structures  for  complex  systems  discussed  here 
were  difficult  tasks,  numerical  calculations  of  real¬ 
istic  models  were  performed  assuming  possible 
lattices  in  Figure  10.  These  computational  results 
and  available  experiments  cited  in  this  article  en¬ 
able  us  to  propose  several  working  hypotheses 
(H1)-(H9).  For  the  purpose,  a  theoretical  formula¬ 
tion  for  the  CT  model  was  presented.  More  de¬ 
tailed  theoretical  formulations  for  these  hypothe¬ 
ses  will  be  given  elsewhere. 
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ABSTRACT:  We  present  potential  energy  surfaces  for  two  four-atom  systems,  HOCO 
and  Ar-HCO.  In  the  former  system  a  previous,  reduced  dimensionality  2  degree-of- 
freedom  model  of  the  OH  -I-  CO  ^  H  -t-  CO2  reaction  is  extended  by  inclusion  of  the 
nonreactive  CO  stretch.  In  the  latter  system,  extensive  ab  initio  calculations  are  done  for 
19  HCO  geometries  and  120-150  Ar-HCO  planar  configurations,  for  a  fixed  HCO 
geometry.  In  both  systems  a  Shepard-type  representation  of  the  global  potential  is 
presented  in  the  former  system  in  3  degrees  of  freedom,  and  in  the  latter  system  in  the 
full  6  degrees  of  freedom.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  965-973, 1997 


Introduction 

It  is  perhaps  too  obvious  to  state  the  importance 
of  potential  energy  surfaces  for  dynamic  stud¬ 
ies  of  molecular  interactions.  Not  only  are  poten¬ 
tials  essential,  but  in  many  instances  they  are  more 
important  than  the  choice  of  dynamical  approach, 
i.e.,  classical  or  quantum.  During  the  past  20  years, 
and  especially  in  the  past  decade,  a  number  of  ab 
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initio  potential  energy  surfaces  have  been  devel¬ 
oped  for  triatomic  systems.  It  is  fairly  routine  now 
to  perform  high-quality  ab  initio  calculations  at 
sufficiently  many  nuclear  geometries,  of  the  order 
of  1000  (corresponding  to  roughly  10  points  per 
degree  of  freedom),  to  determine  a  global  potential 
for  triatomics.  The  potential  is  typically  obtained 
by  representing  the  full  potential  by  a  functional 
form  with  a  number  of  linear  and  nonlinear  pa¬ 
rameters  which  are  determined  by  a  least-squares 
procedure. 

This  highly  successful  approach  for  triatomic 
systems  has  obvious  and  severe  limitations  for 
larger  systems.  For  example,  for  a  four-atom  sys- 
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tern  the  grid  of  geometries  corresponding  to  the 
modest  number  of  10  points  per  degree  of  freedom 
would  result  in  10^  geometries  at  which  electronic 
energies  would  be  calculated.  This  is  clearly  pro¬ 
hibitive  for  current  ab  initio  methods.  Even  if  one 
thinks  about  bypassing  a  potential  function  and  is 
willing  to  attempt  electronic  structure  calculations 
"on-the-fly/'  i.e.,  exactly  at  grid  points  where  they 
are  needed  by  the  dynamics,  the  problems  are  at 
best  very  severe  for  classical  dynamics  and  again 
prohibitive  for  quantum  dynamics. 

Thus,  other  strategies  are  needed,  and  have 
been  developed,  for  polyatomic  systems  (where 
henceforth  '"polyatomic"  refers  to  systems  of  four 
or  more  atoms).  In  this  study  we  describe  novel 
applications  of  a  Shepard-type  interpolation  meth¬ 
od  to  two  quite  different  systems  that  illustrate 
different  objectives.  One  is  a  reduced  dimensional¬ 
ity  model  of  the  OH  -h  CO  ->  H  +  CO2  reaction, 
and  the  other  is  a  nonreactive  four-atom  system, 
Ar-HCO. 

In  this  type  of  interpolation  a  data  base  of 
localized  and  widely  distributed  (in  configuration 
space)  representations  of  the  potential  energy  sur¬ 
face  is  used  to  obtain  the  surface  globally,  by 
means  of  switching  functions.  This  approach  ap¬ 
pears  to  have  been  used  first  to  obtain  a  global 
representation  of  the  HCO  potential  [1].  In  that 
approach  ab  initio  calculations  of  the  electronic 
energy  were  done  for  30  widely  different  geome¬ 
tries.  Sufficient  calculations  were  done  at  each  ge¬ 
ometry  to  obtain  a  local  fourth-order  force  field, 
and  the  collection  of  force  fields  was  then  used  to 
interpolate  via  switching  functions  to  obtain  the 
full  potential  globally.  (That  representation  was 
then  used  to  obtain  an  efficient  three-dimensional 
spline  representation  of  the  suface.)  More  recently, 
this  general  approach  was  adopted  for  a  study  of 
the  OHH2  potential  surface  [2].  In  this  case,  Hes¬ 
sians  were  obtained  at  several  hundred  geometries 
and  used  as  a  database  to  obtain  a  potential  cen¬ 
tered  around  the  minimum  energy  path. 

A  potentially  serious  problem  with  a  database 
of  force  fields  in  a  Shepard-type  interpolation  is 
with  extrapolation.  That  is,  if  one  evaluates  the 
potential  at  points  in  configuration  space  that  are 
distant  from  any  of  the  force  fields,  the  result  will 
be  the  extrapolated  value  of  a  force  field,  which 
may  contain  spurious  oscillations  or  singularities. 
(By  definition,  force  fields  are  local  representations 
of  the  potential,  and  their  behavior  beyond  their 
radius  of  convergence  may  be  quite  unphysical.)  In 


the  present  work  we  will  specifically  address  this 
problem  and  show  how  it  can  be  overcome. 

In  the  next  section  we  describe  the  new  applica¬ 
tion  of  Shepard-type  interpolation  to  a  reduced 
dimensionality  model  of  the  reaction  OH  +  CO 
H  +  CO2.  Following  that  we  present  a  new  6  de- 
gree-of-freedom  potential  surface  for  Ar-HCO, 
based  on  high-quality  ab  initio  electronic  energies. 
A  summary  and  conclusions  are  given  at  the  end. 


Reduced  Dimensionality  Potential  for 
OH  +  CO  ^  H  +  CO2 

The  goal  here  is  to  construct  a  reduced  dimen¬ 
sionality,  3  degree-of-freedom  (3-DOF)  potential 
energy  surface  for  the  OH  -h  CO  ^  H  +  CO2  re¬ 
action  suitable  for  subsequent  quantum  calcula¬ 
tions.  The  coordinates  we  consider  are  the  OH 
internuclear  distance,  the  CO  internuclear 

distance,  Tqq,  and  K,  the  distance  between  the 
centers  of  mass  of  the  OH  and  CO  fragments.  All 
previous  quantum  calculations  of  the  reaction  dy¬ 
namics  separated  the  nonreactive  CO  bond  from 
the  dynamics  [3-9],  and  our  goal  is  to  insert  that 
mode  back  into  the  dynamics,  although  still  within 
a  reduced  dimensionality  framework.  Our  motiva¬ 
tion  for  doing  this  is  twofold.  First,  resonances 
(due  to  complex  HOCO  formation),  which  domi¬ 
nate  the  reaction  dynamics,  in  which  the  nonreac¬ 
tive  CO  bond  is  temporarily  excited  were  missing 
in  the  previous  calculations.  Second,  in  these  calcu¬ 
lations  the  description  of  the  vibrational  motion  of 
CO2  is  highly  approximate  because,  as  is  well 
known,  the  two  CO  bonds  in  CO2  mix  strongly  to 
form  normal  modes  of  vibration. 

The  first  quantum  studies  of  this  reaction  (in 
which  the  nonreactive  CO  mode  was  uncoupled) 
were  done  using  a  2-DOF  model  in  which  the 
potential  was  given  in  terms  of  the  coordinates 
Tqh  arid  R  [3].  In  that  model  the  'full  6-DOF  poten¬ 
tial  [10]  was  minimized  for  fixed  values  of  Tqh 
and  K,  and  with  r^o  adiabatically  separated  so 
that  the  zero-point  motion  corresponding  to  the 
nonreactive  CO  bond  was  added  to  the  minimized 
2-DOF  potential. 

Initially,  our  idea  was  to  incorporate  the  CO 
bond  by  minimizing  the  same  6-DOF  with  respect 
to  the  3  degrees  of  freedom,  jR,  Toh/  and  Tqq) 
however,  we  encountered  isolated  regions  in  con¬ 
figuration  space  where  the  minimization  proce- 
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dure  gave  unstable  results  for  extended  values  of 
Vqq.  To  overcome  this  problem,  we  adopted  the 
following  strategy.  In  regions  in  R  and  Tqh  where 
the  potential  behaves  smoothly  with  respect  to  the 
variation  of  the  CO  distance  we  minimized  the 
b'DOF  potential  for  a  given  R  and  and  for 
Tqq  =  2.2  a.u.  (its  value  at  the  HOCO  minimum). 
We  then  varied  r^o  in  the  range  1. 8-3.2  a.u  and  fit 
the  resulting  potential  to  a  Morse  functional  form, 
i.e.. 


^irse(^Co)  =  ,  (1) 


where  i  refers  to  a  specific  point  with  coordinates 
(R\  ^oh)-  did  106  such  fits  at  points  shown  in 
Figure  1.  These  points  are  centered  around  the 
reaction  path  for  OH  +  CO  ^  H  +  CO2,  but  avoid 
the  regions  were  the  potential  did  not  give  satis¬ 
factory  results.  Then  we  represented  the  CO-stretch 
at  any  value  of  R  and  by  the  following  Shep¬ 
ard-like  interpolation: 


Hrco,  ,  (2) 

E,ty;(roH,  R) 

where  is  a  weight  given  by 

(3) 


with 


We  varied  the  range  parameter  over  a  wide 
range  [subject  to  the  condition  that  Eq.  (2)  reduces 
to  within  a  few  percent  of  V^^orse^^co)  the  ith 
point],  and  found  that  our  fit  is  insensitive  to  its 
exact  value.  This  robustness  of  the  interpolation  is 
due  at  least  in  part  to  the  fact  that  the  Morse 
potentials  do  not  vary  greatly  over  the  grid  of 
points  in  R-Vq^i  space. 

We  should  note  that  our  objective  is  not  to 
reproduce  the  potential  at  selected  points.  Clearly, 
by  using  a  Morse  to  fit  the  potential  and  then 
using  the  Shepard-like  switching  among  the  Morse 
fits,  the  final  result  is  in  a  sense  twice  removed 
from  the  original  data.  In  general,  the  errors  in  the 
Morse  fits  are  much  greater  than  the  errors  intro¬ 
duced  by  switching  method.  With  this  in  mind  we 
examined  the  precision  of  Eq.  (2)  by  comparing  it 
to  the  actual  potential  in  the  CO  stretch  at  a  value 
of  R  and  not  among  the  fits.  The  result  is 
shown  in  Figure  2  for  one  example.  As  seen,  the 
interpolation  does  provide  a  realistic  representa¬ 
tion  of  the  CO  stretch  potential. 

The  final  3-DOF  potential  was  then  constructed 
by  combining  the  previous  2-DOF  potential  [3] 
with  roH/  However,  since  that  2-DOF 

potential  contained  the  local  zero-point  energy  of 
the  CO,  we  first  had  to  subtract  that  energy  using 
our  Tqh,  R)  potential,  and  then  the  new 

3-DOF  potential  is  given  by 

^CO/  ^OH/  ^OH/  f 

(5) 


SiiR.r, 


OH 


OH 


^OH 


f  +  (R-R‘)\  (4) 


where  ¥2^  is  the  previous  2-DOF-dimensional  po¬ 
tential  of  Schatz  and  Dyck  [3]  minus  the  local  CO 
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FIGURE  1.  Points  where  the  CO  stretch  potential  was 
fit  to  a  Morse  function. 


FIGURE  2.  Comparison  of  the  actual  CO  potential 
(circles)  at  =  2.0  a.u.  and  R  =  4.8  a.u.  and  the  CO 
potential  from  the  Shepard  interpolation. 
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zero-point  energy.  Figure  3  shows  a  2-DOF  con¬ 
tour  plot  of  the  new  surface  at  Tqq  =  2.5  a.u.  The 
potential  is  smooth  and  shows  a  single  well  corre¬ 
sponding  to  the  cis-HOCO  minimum  as  well  as  a 
small  barrier  in  coordinate  which  is  consistent 
with  the  2-DOF  surface  of  Schatz  and  Dyck.  Figure 
4  shows  a  2-DOF  contour  plot  at  fixed  R  =  3.73 
a.u  (the  value  of  R  at  the  cis-HOCO  minimum). 
The  minimum  around  r^o  =  2.2  a.u.  is  very  dis¬ 
tinct  and  corresponds  to  the  stable  cis-HOCO  con¬ 
figuration. 

Finally,  we  note  that  the  zero-point  energy  of 
the  CO  vibration  extracted  from  the  2-DOF  effec¬ 
tive  potential  of  Schatz  and  Dyck  [3]  is  signifi¬ 
cantly  lower  than  the  experimental  results  for  the 
asymptotic  OH  -h  CO  channel,  roughly  900  cm"^ 
compared  to  1100  cm"^  experimentally.  We  have 
calculated  the  zero-point  energy  of  CO  vibration 
using  the  new  3-DOF  potential  of  Eq.  (5),  and  it  is 
in  within  50  cm"^  of  the  experimental  value. 

The  present  application  of  a  Shepard-like  inter¬ 
polation  to  generate  a  3-DOF  potential  from  a 
2-DOF  one  has  potentially  wide  applicability  to 
polyatomic  systems  in  which  there  are  many  non¬ 
reactive  degrees  of  freedom.  If  the  potential  in 
these  degrees  of  freedom  can  be  represented  as 
sums  of  functions  such  as  Morse  functions,  then 
perhaps  a  relatively  small  number  of  determina¬ 
tions  of  these  functions  would  suffice  to  obtain  a 
realistic  global  representation  based  on  the  Shep¬ 
ard-like  interpolation  in  these  degrees  of  freedom. 


FIGURE  3.  Equipotential  contour  plot  in  R  and  Toh  for 
r^o  =  2.5  a.u.  Contour  values  are  in  electron  volts. 


r  (CO),  a.11. 

FIGURE  4.  Equipotential  contour  plot  of  the  new 
potential  for  R  =  3.73  a.u.  Contour  values  are  in  electron 
volts. 


Next  we  consider  another,  quite  different  appli¬ 
cation  of  a  Shepard-like  interpolation  to  the  Ar- 
HCO  system  in  all  6  degrees  of  freedom. 


Six  Degree-of-Freedom  Interaction 
Potential  Surface  for  Ar-HCO 

As  noted  in  the  Introduction  it  is  impossible  to 
do  ab  initio  calculations  in  a  straightforward  six¬ 
dimensional  scan  for  a  four-atom  potential  surface, 
so  some  other  strategy  is  needed  to  construct  a 
reasonable  potential  surface.  In  the  present  case, 
the  system  of  interest  is  Ar-HCO,  and  the  potential 
will  be  used  in  scattering  study  of  collisional  vi¬ 
bration  energy  transfer  and  collision-induced  dis¬ 
sociation/recombination  of  HCO.  Thus,  the  poten¬ 
tial  surface  needs  to  include  a  fairly  wide  variation 
in  the  HCO  geometry.  At  the  very  least,  it  must 
sample  the  geometry  around  the  HCO  minimum, 
and  the  most  important  energetic  pathway  to  dis¬ 
sociation.  In  addition,  since  the  interaction  poten¬ 
tial  is  of  the  van  der  Waals  type,  many  geometries 
of  the  Ar  atom  relative  to  a  given  HCO  geometry 
must  be  considered. 

With  the  above  requirements  in  mind,  we 
adopted  the  following  strategy  to  construct  a  rea¬ 
sonable  six  degree-of-freedom  potential.  For  a 
given  HCO  geometry  we  calculated  the  electronic 
interaction  energy  for  120-150  Ar  geometries  in 
the  HCO  plane.  Those  energies  were  then  fit  to  a 
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functional  form,  and  then  the  various  fits,  corre¬ 
sponding  to  19  HCO  geometries,  were  used  in  a 
Shepard-type  interpolation  to  obtain  a  representa¬ 
tion  for  the  global  potential.  As  above,  the  objec¬ 
tive  is  not  to  reproduce  the  calculated  ab  initio 
points.  The  Shepard-type  interpolation  is  done  on 
the  fits  to  points,  and  in  general  the  errors  in  these 
fits  are  much  greater  than  the  errors  introduced  by 
the  interpolation  of  the  fits. 


Ab  initio  Calculations 

The  basis  set  and  the  level  of  correlation  and 
tests  of  both  have  been  described  in  detail  in  Ref. 
[11].  Briefly,  single  reference,  configuration  interac¬ 
tion  (CISD)  calculations,  using  double-zeta  and 
triple-zeta  basis  sets,  and  counterpoise  correction 
were  done  using  the  code  MOLPRO  [12].  The 
coordinates  we  used  are  body  fixed,  spherical  co¬ 
ordinates  with  the  origin  at  the  center  of  mass  of 
CO,  with  the  z  axis  along  CO,  and  H  in  the  xz 
plane,  as  shown  in  Figure  5.  The  CO  distance  is 
denoted  by  r,  the  position  of  H  to  the  center  of 
mass  of  CO  is  (R,  y),  and  in  HCO  plane  the 
position  of  Ar  is  (Q,  0). 

For  each  HCO  geometry,  6  varied  from  0  to 
180°  in  equal  intervals  of  18°  for  </>  =  0  and  also  for 
(j)  =  180°.  For  each  0,  6  to  7  Q's  were  selected 
according  to  the  shape  of  the  potential  cuts.  Thus, 
ab  initio  calculations  were  done  at  between  120 
and  150  points  per  HCO  geometry.  The  19  HCO 
geometries  considered  are  given  in  Table  I. 


TABLE  I _ 

HCO  geometries  chosen  for  the 
ab  initio  calculation. 


Geom.  No. 

R  (bohr) 

r  (bohr) 

y  (degree) 

1 

3.043 

2.288 

34.773 

2 

4.213 

2.153 

47.261 

3 

5.669 

2.288 

34.773 

4 

3.043 

2.288 

64.773 

5 

3.043 

2.288 

49.773 

6 

3.043 

2.288 

19.773 

7 

3.043 

2.488 

34.773 

8 

3.043 

2.088 

34.773 

9 

3.043 

3.088 

34.773 

10 

7.559 

2.143 

34.773 

11 

4.213 

2.153 

57.261 

12 

4.213 

2.153 

37.261 

13 

4.213 

2.303 

47.261 

14 

4.213 

2.003 

47.261 

15 

3.590 

2.173 

40.000 

16 

3.590 

2.173 

55.000 

17 

3.590 

2.173 

25.000 

18 

3.590 

2.373 

40.000 

19 

3.590 

1.973 

40.000 

In  order  to  make  some  global  comparisons  be¬ 
tween  the  ab  initio  electronic  energies  and  the  fits 
to  them,  described  below,  we  fit  several  sets  of 
Ar-HCO  energies  very  precisely  by  a  somewhat 
complex  functional  form  described  earlier  [11].  Po¬ 
lar  contour  plots  of  the  interaction  potential  (based 
on  these  precise  fits)  are  given  in  Figures  6”8  for 
the  HCO  minimum  (geometry  1),  the  H — CO  tran¬ 
sition  state  (geometry  2),  and  nearly  dissociated  H 
— CO  (geometry  3),  respectively.  As  seen,  there  is 
a  van  der  Waals  minimum  at  a  cis  T-shaped  geom¬ 
etry  that  is  nearly  independent  of  the  H  atom 
position. 


Fits  of  the  Planar  Ar-HCO 
Interaction  Potentials 

A  major  factor  in  considering  a  functional  form 
to  fit  the  ab  initio  energies  for  a  fixed  HCO  geome¬ 
try  is  the  subsequent  use  of  the  fits  in  a  Shepard- 
type  representation  of  the  global  surface.  As  men¬ 
tioned  in  the  Introduction,  a  danger  in  using  this 
type  of  representation  is  possible  unphysical  be¬ 
havior  of  the  potential  in  regions  away  from  any  of 
the  fits.  Thus,  it  is  important  to  choose  a  physically 
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FIGURE  6.  Equipotential  polar  contour  plot  of  ab  initio 
interaction  potential  of  Ar  with  equilibrium  HCO.  Contour 
values  are  in  cm“\ 


reasonable  functional  form  for  the  fits,  even  if  this 
entails  some  sacrifice  in  the  precision  of  the  fit. 
With  this  in  mind  we  chose  to  represent  the  Ar- 
HCO  interaction  potential  as  the  following  sum- 
of-pairs  form  for  a  fixed  HCO  geometry,  r„,  %) 
and  for  planar  Ar-HCO  configurations: 

^Ar-H  /  ^Ar-C  /  ^Ar-O ^  ^  ^A f-X  ^  ^Ar-X  ^ 

X=H,C,0 


FIGURE  7.  Same  as  Fig.  6,  but  for  with  HCO  at 
transition  state  geometry. 


FIGURE  8.  Same  as  Fig.  6,  but  for  Ar  with  HCO  at  a 
nearly  dissociated  geometry. 


where 

^A%(^Ar-x)  =  C{%exp(-Ci%R^,.^) 

Ci%[l  -  exp  - 

Rir-X 

(7) 

The  superscript  corresponds  to  the  nth  HCO  ge¬ 
ometry.  As  seen,  there  are  two  linear  and  one 
nonlinear  parameters  per  two- 

body  potential  for  a  total  of  nine  parameters.  They 
were  determined  by  nonlinear  least  square  fitting 
to  the  ab  initio  energies  separately  for  each  HCO 
geomtery. 

The  average  fit  error  ranges  from  60  to  150 
cm“\  depending  on  the  HCO  geometry,  and  over 
a  range  of  interaction  energies  that  varies  from 
roughly  - 100  to  7000  cm"h  The  fitting  error  scales 
roughly  with  the  value  of  the  interaction  energy. 
Polar  contour  plots  of  the  fitted  interaction  poten¬ 
tials  for  three  HCO  geometries,  corresponding  to 
the  ab  initio  ones  shown  in  Figures  6-8,  are  shown 
in  Figures  9-11.  Comparing  these,  one  sees  that 
the  fitted  potentials  are  reasonable  representations 
of  the  ab  initio  ones.  Further  examination  of  the 
fits  are  shown  in  Figure  12  for  the  cis  planar 
Ar-HCO  geometry,  and  for  an  out-of-plane  config¬ 
uration,  for  HCO  at  its  equilibrium  geometry.  As 
seen,  the  fit  for  the  in-plane  geometry  is  quite 
good,  and  the  one  for  the  out-of-plane  one  is  not  as 
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FIGURE  9.  Same  as  Fig.  6  but  for  the  fitted  sum-of-pairs 
potential. 

good.  This  is  not  surprising  given  that  the  fit  was 
made  to  the  ab  initio  energies  only  for  the  in-plane 
Ar-HCO  configuration. 


Global  Surface 

The  global  6-DOF  potential  surface  is  given  by 
the  following  Shepard-type  representation: 

^Ar-C/  ^Ar-O'  T) 

=  12  ^Ar-H  '  -^Ar-C  /  ^Ar-O^ 

n  I  n 

where  the  weight  function  is,  as  above,  a  simple 
Gaussian 

w„  =  exp[-ai(R  -  -  a2(r  -  r„f 

-cL^iy-y^f].  (9) 

The  three  range  parameters  were  chosen  so  that 
the  global  potential  is  smooth  and  close  to  the  fit 
when  the  HCO  geometry  is  close  to  nth  geometry. 
After  some  trial  and  error  we  determined  that  the 
values,  =  2.0  bohrs“^,  a2  =  25.0  bohrs”^,  a3  = 
17.5  rads~^,  satisfied  the  above  criteria  well.  (These 
orders  of  magnitude  of  the  range  parameters  were 
mainly  determined  by  the  spacing  in  fits  in  the 
corresponding  coordinates.) 


-15 

-15  -IB  -5  B  5  IB  15 
K(bohr) 

FIGURE  1 0.  Same  as  Fig.  7  but  for  the  fitted 
sum-of-palrs  potential. 

In  order  to  test  the  global  potential  surface,  a 
number  of  comparisons  of  the  surface  were  made 
with  additional  ab  initio  calculations.  In  Figure  13 
we  compare  the  global  surface  with  ab  initio  ener¬ 
gies  for  two  planar  Ar-HCO  configurations,  as  R 
is  varied  over  a  large  range,  with  r  and  y  at  their 
values  for  HCO  at  equilibrium.  As  seen,  the  agree¬ 
ment  is  reasonable  and  physically  correct,  i.e.,  the 


Hr-(H— CO) 


FIGURE  1 1 .  Same  as  Fig.  8  but  for  the  fitted 
sum-of-pairs  potential. 
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Q  (bohr) 

FIGURE  12.  Comparison  of  the  interpolated  potential 
(curve)  with  ab  initio  (circles)  energies  as  R  is  varied  for 
=  2.15  bohrs  and  y  =  40°,  for  the  two  indicated  Ar 
geometries. 


FIGURE  13.  Comparison  of  the  interpolated  potential 
(curve)  with  ab  initio  (circles)  energies  for  HCO  at 
equilibrium  for  in-plane  and  out-of-plane  Ar-HCO 
geometries.  The  in-plane  Interpolated  potential  is  the 
sum-of-two-body  fit  to  the  ab  initio  energies. 


interaction  is  more  repulsive  in  the  cis  Ar-HCO 
geometry  (0  =  0)  than  in  the  trans  one  (0  =  180°). 
Another  test  is  for  the  out-of-plane  interaction. 
This  is  shown  in  Figure  14  for  HCO  at  equilibrium, 
and  two  values  of  ^  for  Q  =  5.67  a.u.  As  seen,  the 
behavior  of  the  global  surface  is  quite  reasonable. 

This  potential  surface  is  available  upon  request 
to  JMB. 


Summary  and  Conclusions 

Shepard-type  interpolation  has  been  used  in  two 
four-atom  systems.  In  the  first,  a  3  degree-of-free- 
dom  reduced  dimensionality  potential  was  de¬ 
scribed  for  the  OH  -h  CO  ^  H  +  CO2  reaction. 
The  surface  was  built  up  from  an  existing  2  de¬ 


gree-of-freedom  potential  by  the  addition  of  the 
nonreactive  CO  stretch.  We  minimized  the  6  de- 
gree-of-freedom  potential  for  fixed  values  of  Tqh 
and  R  and  then  fit  the  CO  dependence  of  the 
potential  to  a  Morse  function.  The  CO  dependence 
of  the  potential  at  an  arbitrary  value  of  R  and  Tqh 
was  determined  by  Shepard  interpolation.  This 
surface  is  currently  being  used  to  explore  the  role 
of  this  stretch  in  HOCO  resonances  [13]. 

The  second  four-atom  system  considered  was 
Ar-HCO.  In  this  application,  high-quality  ab  initio 
electronic  energies  were  calculated  for  19  HCO 
geometries  and  120-150  planar  Ar-HCO  geome¬ 
tries  for  each  fixed  HCO  geometry.  The  energies 
were  fit  using  a  physically  motivated  sum  of  two- 
body  potentials.  The  global  surface  was  given  again 
by  a  Shepard  interpolation  in  the  HCO  coordi- 
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^  (deg) 

FIGURE  14.  Comparison  of  the  Interpolated  potential 
(curve)  with  ab  initio  (circles)  energies  for  HCO  at 
equilibrium  as  a  function  of  0,  and  for  the  two  Ar-HCO 
geometries  indicated. 

nates.  Several  tests  of  the  interpolated  surface  were 
presented  and  demonstrated  that  the  global  sur¬ 
face  is  a  good  representation  of  the  ab  initio  ener¬ 
gies.  This  potential  is  currently  being  used  in 
quantum  calculations  of  energy  transfer,  dissocia¬ 
tion,  and  recombination  in  Ar-HCO  collisions  [14]. 

In  both  examples  above,  we  chose  physically 
motivated  representations  of  fits  to  the  energetics, 
for  HOCO  to  an  existing  6  degree-of-freedom  po¬ 


tential,  and  for  Ar-HCO  to  ab  initio  calculations. 
These  are  in  general  better  behaved  than  fits  that 
have  been  used  in  previous  Shepard-type  interpo¬ 
lations;  however,  at  the  expense  of  not  giving  very 
precise  fits  to  the  original  potential  points. 
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ABSTRACT:  Ionic  materials  can  be  conveniently  described  by  quantum  mechanical 
cluster  models.  In  these  models  the  electrostatic  field  of  the  crystal  surrounding  is  often 
taken  into  account  by  embedding  the  cluster  in  an  array  of  point  charges  (PCs).  However, 
anions  at  the  cluster  boundary  can  be  strongly  polarized  by  neighboring  positive  PCs. 
These  artifacts  can  be  significantly  reduced  if  a  model  potential  description  is  employed 
for  the  nearest-neighbor  cations  of  the  surrounding.  We  have  implemented  a  scheme  of 
total  ion  model  potentials  (TIMP)  in  the  density  functional  code  LCGTO-DF.  In  the 
present  work  we  compare  a  number  of  electronic  properties  of  MgO(OOl)  models,  like 
core-level  splittings,  electron  affinity,  and  electrostatic  potential,  computed  with  the  help 
of  TIMP  embedding  to  results  obtained  for  the  same  clusters  surrounded  by  PCs  only. 
The  cluster  models  range  from  MgO|“  to  Mg25025.  Adsorption  of  a  CO  molecule  on 
TIMP  embedded  cluster  models  of  a  regular  surface  site  and  the  three-coordinated  corner 
Mg^^  site  is  also  considered.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  975-986, 
1997 
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Introduction 

Chemical  substances  in  the  solid  state  can  of¬ 
ten  be  efficiently  described  by  quantum  me¬ 
chanical  cluster  models.  Systems  with  strongly 
ionic  bonds,  like  alkali  halides  and  alkaline-earth 
oxides,  the  properties  of  which  are  dominated  by 
long-range  Coulomb  interaction,  are  also  favorably 
studied  by  cluster  models.  A  simple  but  widely 
used  method  to  account  for  electrostatic  effects  of 
the  crystal  surrounding  is  to  embed  a  model  clus¬ 
ter  in  an  array  of  point  charges  (PCs).  This  ap¬ 
proach  also  allows  one  to  reproduce  the  electro¬ 
static  potential  outside  the  cluster  in  an  accurate, 
yet  economic  fashion.  This  is  of  particular  impor¬ 
tance  for  studies  of  adsorption  complexes  on 
strongly  ionic  substrates  which  are  commonly 
formed  mainly  due  to  electrostatic  interactions. 
However,  a  disadvantage  of  the  point  charge  em¬ 
bedding  is  that  PCs  on  the  cluster  boundaries  can 
significantly  distort  the  electron  density  of  ions  at 
the  cluster  edges.  Therefore,  the  accuracy  of  the 
calculations  depends  on  the  distance  between  the 
"chemically  interesting"  central  region  of  the  clus¬ 
ter  and  its  boundaries.  In  order  to  shift  the  bound¬ 
aries  far  enough  one  may  be  forced  to  employ 
extended,  i.e.,  rather  expensive  cluster  models. 
Clearly,  anions  with  their  diffuse  and  easily  polar¬ 
izable  electronic  shells  will  be  distorted  much 
stronger  by  neighboring  positive  PCs  at  the  cluster 
boundary  than  the  more  compact  and  less  polariz¬ 
able  cations  by  negative  PC  neighbors.  Thus,  eval¬ 
uation  and  subsequent  rectification  of  spurious 
anion-PC  effects  on  calculated  observables  repre¬ 
sent  key  problems  in  cluster  modeling  of  ionic 
metal  oxides. 

The  outlined  artifacts  can  be  diminished  by 
substituting  model  potential  (or  pseudopotential) 
cations  for  positive  PCs  closest  to  cluster  anions.  In 
this  way,  it  is  possible  to  restore  the  repulsive 
contribution  of  electron  shells  of  the  anions  and 
cations  which  is  missed  by  an  embedding  in  a 
simple  array  of  PCs.  Less  efficient  from  the  com¬ 
putational  viewpoint  is  the  "brute  force"  alterna¬ 
tive  whereby  all  positive  PCs  near  the  cluster 
borders  are  replaced  by  all-electron  cations. 

The  first  applications  of  pseudopotentials  in 
cluster  models  [1,  2]  were  devoted  to  the  descrip¬ 
tion  of  impurities  in  alkali  halide  crystals  where 
Hartree-Fock  (HF)  based  model  potentials  by 


Huzinaga  et  al.  [3]  and  complete-cation  normcon- 
serving  pseudopotentials  [4]  were  used.  A  similar 
technique  was  found  valuable  to  describe  bulk 
properties  of  alkaline-earth  oxides  [5,  6].  Recently, 
the  general  procedure  on  how  to  perform  calcula¬ 
tions  on  embedded  clusters  under  the  assumption 
of  a  "frozen"  environment  and  the  orthogonality 
requirements  for  the  wave  function  of  the  cluster 
and  the  environment  have  been  discussed  in  detail 
[7,  8]. 

CO  molecules  adsorbed  on  MgO  have  fre¬ 
quently  been  considered  as  a  prototype  system  for 
representing  metal  oxide  surface  complexes  [9]. 
The  properties  of  CO/MgO  cluster  models  embed¬ 
ded  in  an  array  of  PCs  have  been  studied  exten¬ 
sively,  at  both  the  HF  and  post-HF  levels  [10,  11] 
as  well  as  by  density  functional  (DF)  methods 
[12-14].  There  is  now  general  agreement  that  CO 
binds  to  the  MgO  surface  mainly  due  to  electro¬ 
static  interaction.  In  the  case  of  PC  embedding,  the 
calculated  MgO-CO  binding  energy  varies  no¬ 
tably  with  the  size  of  the  model  cluster  [14].  Also, 
the  electrostatic  potential  above  in  MgO  surface  in 
the  region  of  the  adsorbate  has  been  found  to 
depend  strongly  on  shape  and  size  of  the  model 
cluster;  this  effect  was  traced  to  the  polarization  of 
anions  at  the  cluster  borders  [15].  Recently  pub¬ 
lished  cluster  model  investigations  of  MgO  based 
on  ab  initio  model  potentials  are  free  of  this  defi¬ 
ciency  [8,  16,  17].  However,  no  such  model  poten¬ 
tial  study  has  been  reported  so  far  within  the 
framework  of  DF  methods. 

The  present  work  describes  an  implementation 
of  the  pseudopotential  (PP)  technique  in  the 
LCGTO-DF  package  (next  section).  This  method 
has  been  applied  to  improve  cluster  models  of 
MgO  by  using  total  ion  model  potentials  (TIMP)  to 
substitute  positive  PCs  at  the  cluster  boundaries. 
The  models  and  calculational  details  are  outlined 
in  the  third  section.  The  present  study  aims  at 
testing  the  TIMP  technique  within  a  DF  implemen¬ 
tation  rather  than  at  carrying  out  a  comprehensive 
investigation  on  a  physical  or  chemical  problem  of 
an  ionic  material.  Therefore,  MgO  has  been  chosen 
as  a  substrate  which  has  been  well  studied  theoret¬ 
ically  and  experimentally.  The  calculational  results 
presented  demonstrate  how  significantly  features 
of  the  cluster  models  can  change  due  to  TIMP 
improved  boundary  conditions.  One-electron 
properties  of  model  clusters  and  the  electrostatic 
potential  near  an  ideal  MgO(OOl)  surface  have 
been  calculated  with  the  help  of  TIMP  and  PC 
embedded  models.  Adsorption  of  CO  molecules 
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on  cations  on  the  regular  MgO(001)  surface 

and  on  three-coordinated  defect  sites  has  also  been 
considered;  Results  of  DF  TIMP  calculations  will 
be  compared  to  the  corresponding  ab  initio  data 


OUTLINE  OF  THE  COMPUTATIONAL 
TECHNIQUE:  TOTAL  ION 
MODEL  POTENTIALS 

TIMP  embedding  aims  at  overcoming  the  limi¬ 
tations  of  a  simple  PC  embedding  scheme  for 
cluster  models  of  ionic  substrates  by  approxi¬ 
mately  accounting  for  orthogonality  effects  of  clus¬ 
ter  electronic  states  with  respect  to  states  of  ions  in 
the  environment.  PC  embedding  accounts  only  for 
the  electrostatic  field  of  ions  which,  at  large  dis¬ 
tances,  can  be  to  good  accuracy  treated  as  the  field 
of  point  charges.  However,  if  one  does  not  take  the 
finite  size  of  the  cations  into  account,  the  electron 
density  of  the  cluster  will  spread  to  neighboring 
positive  point  charges.  The  '"frozen"  electron  den¬ 
sity  of  these  cations  is  included  in  the  TIMP 
description,  its  Pauli  repulsion  counteracts  the  at¬ 
tractive  potential  of  the  cation  charge  within  the 
effective  ionic  radius.  For  the  practical  realization 
of  TIMPs  used  here,  the  PP  approach  of  replacing 
the  core  electrons  in  calculations  involving  heavier 
atoms  is  modified.  All  electrons  of  the  model  ions 
are  included  in  the  pseudopotential  and  no  basis 
functions  are  placed  on  these  centers.  In  the  fol¬ 
lowing,  a  brief  review  of  angular  specified  norm- 
conserving  PPs  [4,  18,  19]  used  in  this  work  is 
given  to  introduce  their  application  for  construct¬ 
ing  TIMPs. 

In  the  all-electron  linear  combination  of  Gauss- 
ian-type  orbitals  DF  (LCGTO-DF)  method  [20]  the 
nonrelativistic  Kohn-Sham  equation  can  be  writ¬ 
ten  as 

+  Kx.  +  p]  +  Kc[  P,Vp]}.A,  = 

(1) 

where  is  the  nuclear  potential,  is  the  Hartree 
(classical  Coulomb)  potential  of  the  electron  den¬ 
sity  p,  and  is  the  exchange-correlation  (XC) 
potential.  The  one-particle  Kohn-Sham  (KS)  or¬ 
bitals  are  expanded  in  a  basis  of  Gaussian-type 
functions 

(2) 

k 

and  these  basis  functions  are  set  up  as  sym¬ 


metrized  contractions  (linear  combinations)  of 
primitive  functions 

xl  =  Cl,(r,-d,<p)e--^^\  (3) 

where  C',(  r,  -d,  <p)  is  a  real  solid  harmonic  (a  har- 
monic  polynomial). 

In  a  density  functional  PP  calculation,  the  inter¬ 
action  of  electrons 

=  V,[  p]  +  p]  (4) 

is  replaced  by  the  interaction  of  valence  electrons 
and  a  pseudopotential  which  accounts  for  the 
core  electrons, 

V/e  =  V,[  pj  +  VJ  pj  +  Vp(r),  (5) 

where  p^  denotes  the  valence  electron  density.  The 
pseudopotential  in  a  molecular  calculation  is  set 
up  as  a  superposition  of  individual  atomic  contri¬ 
butions 

■  y/r)  = 

A 


According  to  the  angular-momentum-specified 
normconserving  PP  approach  [14,  18],  an  atomic 
pseudopotential  can  be  written  as 

L-l  I 

=  +  E  \lm)Om\.  (7) 

/  =  0  m=-l  ■ 

Here  L  is  chosen  so  that  L-l  equals  the  maxi¬ 
mum  angular  momentum  of  the  atomic  one-elec¬ 
tron  orbitals  which  belong  to  the  frozen  core.  \lm} 
denotes  the  spherical  harmonic  In  a  DF  for¬ 
malism  the  first  term  of  Eq.  (7),  is  the  same  for 
all  electrons  with  I  >  L;  it  originates  from  the 
Coulomb  and  exchange-correlation  interaction  of 
the  valence  electrons  with  the  core  shells.  We  note 
that  this  term  is  a  local  one  in  contrast  to  the 
angular  momentum  specified  terms  which  contain 
the  projection  operators  |Zm>(/m|.  These  latter  terms 
account  for  the  orthogonality  of  the  valence  or¬ 
bitals  with  Z  <  L  to  the  core  orbitals  and  contain 
the  proper  PPs  1/;^,  I  <  L. 

The  PPs  are  determined  by  solving  the  atomic 
radial  equations 


/  1  ^  Z(Z  +  1) 

\  2  ^  ’’’  Ir^ 


—  +  Kt  Pv^ 


+  V,,[p,  +  pf]+V,^y,'’(r) 

=  (8) 
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where  tj/fir)  is  the  radial  part  of  a  nodeless  pseudo 
valence  orbitals,  constructed  from  the  exact  va¬ 
lence  orbital  following  Ref.  [18].  ej  denotes  the 
atomic  one-electron  eigenvalue  of  a  valence  orbital 
of  angular  momentum,  /,  and  T  is  the  nuclear 
charge  decreased  by  the  charge  of  the  core  elec¬ 
trons.  Thus,  is  constructed  to  reproduce  the 
exact  valence  one-electron  energies  for  electrons  in 
the  field  of  a  shielded  nuclear  charge,  but  interact¬ 
ing  with  the  valence  electron  density.  The  frozen 
core  electron  density  p/  is  included  in  the  XC 
potential  to  account  for  the  nonlinear  character  of 
this  quantity  [18]. 

To  apply  the  formalism  sketched  above  to 
TIMPs,  all  electrons  of  a  model  center  are  included 
in  the  PP  and  L  -  1  is  chosen  equal  to  the  highest 
angular  momentum  of  the  occupied  orbitals.  To 
determine  Y^ ,  pseudo  orbitals  of  unoccupied  lev¬ 
els  are  employed  for  each  angular  momentum  1 
which  is  present  in  the  occupied  orbital  manifold. 

In  the  LCGTO-DF  implementation,  the  poten¬ 
tials  Yi^  are  generated  in  a  numerical  atomic  DF 
calculation  and  are  used  in  tabular  form.  The  core 
density  p^  is  determined  as  the  difference  of  the 
total  density  from  an  all-electron  atomic  calcula¬ 
tion  and  the  density  due  to  the  pseudo  valence 
orbitals.  To  describe  the  Coulomb  part  of  the  local 
term  1//,  it  is  represented  by  a  set  of  s-type  Gauss¬ 
ian  functions.  The  coefficients  as  well  as  the  Gauss¬ 
ian  exponents  are  determined  by  means  of  a  non¬ 
linear  least-squares  fit  according  to  the  gradient 
extremum  search  method  [21]. 

To  evaluate  matrix  elements  (  A'/c/  l^/^l  Y?  )  of  fho 
nonlocal  terms  of  Y^^  basis  functions  Xk 
centered  on  atom  B  are  projected  onto  spherical 
harmonics  centered  on  the  PP  center  A  [22]. 
The  projection  of  a  simple  Gaussian  function  is 
given  by 


The  projection  onto  complex  solid  harmonics  cen¬ 
tered  at  A  is  given  by  [24] 

[Cii(r-b)]„„ 

=  I  /L  +  M\(L-M\ 

^  \  21  + l[  I +  tnj\  I- mj 

1<L  (11) 

where  |  ^  j  is  a  binomial  coefficient.  The  resulting 

products  of  spherical  harmonics  centered  at  the 
same  atom  [Eqs.  (9)  and  (11)  substituted  in  Eq.  (3)] 
are  coupled  according  to  [24] 

-  (12) 

LM 

with  being  a  Gaunt  coefficient. 

To  determine  forces  in  a  calculation  employing 
PPs  or  TIMPs  displacement  gradients  of  the  matrix 
elements  of  with  respect  to  atomic  centers  are 
required;  they  yield  contributions  which  parallel 
those  of  the  nuclear  potential.  The  differentiation 
of  the  projected  basis  functions  with  respect  to 
positions  of  all-electron  centers  is  straightforward 
and  analytical  expressions  have  been  derived.  Par¬ 
tial  derivatives  with  respect  to  the  coordinates  of 
pseudo  centers  can  be  obtained  by  exploring  the 
translational  invariance  of  the  matrix  elements. 

Although  it  does  not  constitute  a  special  prob¬ 
lem  to  construct  TIMPs  for  anions  employing  the 
DF  representation  of  the  exchange-correlation  op¬ 
erator  [6],  in  this  work  we  use  only  TIMPs  for 
Mg^”^  cations  and  represent  the  oxygen  anions  by 
PCs.  There  are  two  reasons  for  this  decision:  (i)  the 
deficiency  of  conventional  models  with  embed¬ 
ding  of  the  all-electron  (AE)  cluster  into  a  PC  array 
has  been  attributed  mainly  to  the  distortion  of 
anions  at  the  cluster  borders  by  positive  PCs  at¬ 
tracting  electron  density  [16],  while  the  effect  of 


where  r  =  (r,  fi);  vector  b  denotes  the  position  of  negative  PCs  on  relatively  compact  AE  cation 

center  B  in  the  coordinate  system  of  a  PP  center  neighbors  is  of  subordinate  importance;  (ii)  since 

A,  and  ij  is  a  modified  spherical  Bessel  function  free  O^"  anions  do  not  exist,  additional  approxi- 

[23].  The  real  solid  harmonic  C^,  can  be  easily  mations,  like  application  of  a  Watson  sphere  [6]  or 

represented  as  a  linear  combination  of  complex  embedding  into  a  bulk  crystal  [25],  are  required 

solid  harmonics  [24]  for  generating  such  a  TIMP.  We  have  generated  a 

PP  for  the  neutral  Mg  atom  (3s^3p^  electron  con- 
1/2  figuration).  Thus,  the  frozen  density  of  the  Mg^^ 

qI  =  I  ^  I  yiy  (10)  TIMP  corresponds  to  the  shells  Is^,  2s^,  and  2p^ 

'”  \  2/  +  1  /  of  the  neutral  Mg  atom. 
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Computational  Details 

In  the  present  study  AE  cluster  models  of  regu¬ 
lar  surface  sits  of  MgCXOOl)  range  from  MgOf"  to 
Mg25  025;  smaller  models  are  all  subsystems  of  the 
latter  cluster.  The  Mg-O  distance  in  all  models 
was  taken  from  the  experimental  MgO  bulk  geom¬ 
etry  (2.104  A).  To  simulate  the  Madelung  field  of 
the  crystal  environment,  the  clusters  were  embed¬ 
ded  in  PC  arrays  of  dimension  17  X  17  X  2  which 
allows  us  to  consider  both  Mg  and  O  adsorption 
sites  within  the  same  model.  If  the  PC  array  is 
extended  to  a  thicker  slab  (17  X  17  X  6),  only  neg¬ 
ligible  alterations  of  calculated  adsorption  proper¬ 
ties  are  observed.  For  these  PCs  the  charge  value 
for  an  ideally  ionic  crystal,  ±2.0  a.u.,  was  as¬ 
sumed.  A  more  strict  approach  would  involve 
computing  the  lattice  potential  through  an  explicit 
Ewald  summation  over  the  infinite  crystal  [26-28]. 
Fortunately,  the  Madelung  field  can  be  very  satis¬ 
factorily  approximated  by  the  field  of  a  finite  set  of 
PCs  [29].  In  case  of  low-coordinated  defect  sites 
(e.g.,  three-coordinated  corner  sites  of  MgO)  the 
PC  representation  of  the  crystal  field  is  actually  the 
only  reasonable  choice. 

In  the  new,  more  sophisticated  model  variant 
all  cations  that  are  either  nearest  or  next-nearest 
neighbors  of  the  outer  AE  cluster  atoms  (at  the 
distance  of  2.104  and  2.976  A,  respectively)  were 
described  by  TIMPs.  Cations  situated  at  larger 
distances  were  still  modeled  by  PCs.  Calculations 
of  the  Mg909  cluster  with  AE  Mg^"^  cations  lo¬ 
cated  in  the  positions  previously  occupied  by 
TIMPs  in  the  corresponding  model  (the  cluster 
formula  then  is  Mg909Mgi6'^)  were  also  carried 
out  for  comparison.  To  model  three-coordinated 
Mg^"^  sites  of  MgO,  a  cubic  array  10  X  10  X  10 
point  charges  was  used.  The  clusters  Mg404  and 
Mg03“  at  the  corner  of  this  cube  were  treated  as 
AE  parts  which  were  surrounded  by  TIMPs  ac¬ 
cording  to  the  strategy  just  outlined  for  models  of 
regular  sites. 

Orbital  basis  sets  and  contraction  coefficients, 
(15sl0pld)  [6s5pld]  for  Mg  (the  numbers  of 
primitive  Gaussians  are  given  in  parenthesis,  the 
numbers  of  contracted  basis  functions  in  brackets) 
and  (13sSpld)  ->  [6s5pld]  for  O,  have  been  taken 
from  our  previous  study  of  transition-metal  atom 
adsorption  on  a  Mg(lOO)  surface  [30].  For  the  atoms 
of  the  CO  adsorbate  molecule,  (9s5p2d) 
[SsApld]  basis  sets  have  been  employed  [14].  The 


orbital  bases  were  contracted  using  atomic  eigen¬ 
vectors  obtained  in  the  local  density  approxima¬ 
tion  (EDA). 

To  optimize  the  position  of  the  CO  molecule 
over  the  MgO(OOl)  surface,  calculations  for  differ¬ 
ent  adsorption  heights  were  carried  out,  keeping 
the  C-O  distance  fixed  at  the  value  optimized  for 
the  free  molecule:  The  equilibrium  height  was  then 
determined  by  fitting  these  energies  to  a  polyno¬ 
mial.  Then,  to  find  the  harmonic  frequency  of  the 
C-O  vibration  and  the  equilibrium  C-O  bond 
length,  the  position  of  the  CO  center  of  mass  was 
fixed  and  five  points  for  different  C-O  distances 
were  calculated  (A^  =  0.0;  ±0.05;  ±0.1  Bohr, 
where  is  the  displacement  from  equilibrium 
bond  length  of  free  CO)  and  fitted  by  polynomials. 
To  test  the  implementation  of  the  analytical  energy 
gradients  for  the  pseudopotential  technique,  the 
geometry  optimization  and  the  frequency  calcula¬ 
tions  were  repeated  with  analytical  gradients.  Both 
methods  gave  the  same  results  within  0.001  A  and 
1-2  cm“^  for  geometrical  parameters  and  har¬ 
monic  frqeuencies,  respectively. 

The  auxiliary  fitting  basis  sets  used  in  the 
LCGTO-DF  method  to  represent  the  electron 
charge  density  and  the  exchange-correlation  po¬ 
tential  during  the  self-consistency  cycles  in  single¬ 
point  calculations  were  constructed  from  the  or¬ 
bital  exponents  in  a  standard  fashion  [20].  For  the 
"'post-SCF"  (self-consistent  field)  evaluation  of  the 
total  energy,  as  well  as  during  the  self-consistency 
cycles  of  geometry  optimiation  and  frequency  cal¬ 
culations  employing  analytical  energy  gradients, 
the  exchange-correlation  term  was  calculated  us¬ 
ing  an  accurate  numerical  integration  procedure 
[31]. 

The  local  density  approximation  to  the  ex¬ 
change-correlation  functional  as  suggested  by 
Vosko,  Wilk,  and  Nusair  (VWN)  [32]  was  used  to 
obtain  one-electron  energies  of  the  cluster  models 
presented  below.  To  calculate  electron  affinities 
and  ionization  potentials,  the  total  energy  was 
determined  evaluating  the  XC  contribution  with 
the  help  of  the  generalized  gradient  approximation 
(GGA)  to  the  exchange  energy  as  suggested  by 
Becke  [33]  and  to  the  correlation  energy  as  sug¬ 
gested  by  Lee,  Yang,  and  Parr  (BLYP)  [34],  em¬ 
ploying  the  self-consistent  LDA  density.  For  the 
automatic  geometry  optimizations,  the  total  en¬ 
ergy  and  the  analytical  energy  gradients  were  cal¬ 
culated  within  the  self-consistently  employed 
GGA;  two  variants  were  applied,  BLYP  and  the 
Becke-Perdew  form  (BP,  Becke  exchange  [33], 
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Perdew  correlation  [35]).  The  standard  counter¬ 
poise  technique  [36]  was  applied  to  correct  binding 
energies  for  the  basis  set  superposition  error 
(BSSE). 


Results  and  Discussion 

TIMP  VERSUS  PC  EMBEDDING: 

ONE-ELECTRON  PROPERTIES  OF 

CLUSTER  MODELS 

As  mentioned  in  the  introduction,  the  oxygen 
anions  in  cluster  models  of  MgO  are  strongly  po¬ 
larized  by  neighboring  PCs.  In  Figure  1  we  display 
charge  density  plots  for  TIMP  [Fig.  1(a)]  and  PC 
embedded  [Fig.  1(b)]  Mg909  clusters  along  with 
the  corresponding  charge  difference  plot  [Fig.  1(c)]. 
A  density  decrease  in  the  TIMP  model  with  re¬ 
spect  to  the  PC  case  is  marked  by  dashed  lines. 
One  can  clearly  see  that  all  oxygen  anions  in  the 
TIMP  embedded  model  exhibit  a  regular  spherical 
shape,  while  in  the  PC  embedded  model  the  an¬ 
ions  at  the  cluster  border  are  of  ellipsoid-like  shape 
where  the  charge  density  extends  toward  neigh¬ 
boring  positive  PCs.  The  charge  difference  plot 
[Fig.  1(c)]  clearly  identifies  a  nonzero  electron  den¬ 
sity  around  the  nearest-neighbor  positive  point 
charges  in  the  PC  embedded  model.  This  charge 
density  is  transferred  from  the  oxygen  anion  due 
to  an  artificially  strong  attraction  of  the  naked 
positive  charge  which  is  not  balanced  by  repulsion 
from  core  electrons  around  the  cation. 

Let  us  now  consider  how  this  distorted  electron 
density  affects  one-electron  properties  of  the  mod¬ 
els.  In  the  PC  embedded  models  the  one-electron 
levels  lie  lower  than  those  in  the  models  with 
TIMP  embedding  (Table  I)  because  the  attractive 
potential  due  to  the  surrounding  positive  PCs  is 
overestimated.  Both  core  and  valence  levels  are 
affected.  The  band  gap  of  3.8“4.4  eV  as  measured 
by  the  highest  occupied  molecular  orbital-lowest 
unoccupied  molecular  orbital  (HOMO-LUMO) 
splitting  calculated  for  the  models  with  TIMP  em¬ 
bedding  is  in  reasonable  agreement  with  the  result 
(3.0  eV  at  the  T  point)  of  an  X  a  band  structure 
calculation  on  a  two-layer  MgO  slab  [37].  How¬ 
ever,  for  well-understood  reasons  [38],  the  present 
LDA  value  is  significantly  smaller  than  the  experi¬ 
mental  value  of  7.8  eV  [39].  The  energies  of  the 
LUMO  (-1.2  or  -1.5  eV)  and  HOMO  (-5.4  to 
—  5.6  eV;  Table  I)  are  also  quite  close  to  the  corre¬ 
sponding  values  at  the  T  point  of  a  two-layer  slab 


FIGURE  1 .  Electron  density  contour  maps  for  the 
cluster  model  MggOg  in  the  plane  containing  the  fourfold 
symmetry  axis  and  the  O  —  Mg  —  O  unit:  (a)  TIMP 
embedding;  (b)  PC  embedding;  (c)  density  difference 
Ap  =  p(Mg909,TIMP)  -  pCMggOg,  PC).  The  contour 
values  in  panels  (a)  and  (b)  are  0.64,  0.16,  0.045,  0.01, 
and  0.0025,  an  additional  contour  line  of  0.0008  a.u.  Is 
used  in  panel  (c);  dashed  lines  indicate  negative  values. 

band  structure  (LUMO  -1.5  eV,  HOMO  -4.5  eV 
[37]).  The  shift  of  the  cluster  LUMO  to  low  ener¬ 
gies  in  PC  embedded  models  can  be  rationalized 
by  the  finding  that  this  orbital  is  mainly  localized 
on  the  most  perturbed  border  atoms.  The  large 
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TABLE  I _ 

Properties  of  PC,  TIMP,  and  all-electron  embedded  cluster  models  of  the  MgO(001)  surface/ 


Type  of  embedding 

PC 

TIMP 

Mg2+  AE 

Property 

MggOs 

MggOg 

^925^25 

MggOg 

MggOg 

M925O25 

e(LUMO) 

-8.78 

-6.82 

-7.40 

-1.17 

-1.15 

-1.54 

-1.46 

€(H0M0) 

-9.02 

-9.69 

-9.51 

-5.55 

-5.43 

-5.37 

-5.68 

Ae(HOMO-LUMO) 

0.24 

2.87 

2.11 

4.38 

4.28 

3.83 

4.22 

EA 

6.54 

5.05 

6.00 

-0.47 

-0.33 

0.18 

1.62 

IP 

11.33 

11.42 

10.73 

7.77 

6.93 

6.44 

7.31 

01s  levels 

6  (central  atom) 

-506.86 

-507.99 

-507.64 

-503.42 

-503.48 

-503.59 

-503.70 

e(edge  atoms) 

-508.76 

-508.46 

-507.76 

-503.37 

-503.48 

-503.58 

-503.72 

€  (corner  atoms) 

— 

-508.75 

-507.84 

— 

-503.44 

-503.56 

-53.72 

A6 

1.90 

0.76 

0.20 

0.05 

0.04 

0.03 

0.02 

Mg  Is  levels 

e  (central  atom) 

-1252.12 

-1253.12 

-1252.35 

-1247.96 

-1248.11 

-1248.18 

-1248.34 

e(edge  atoms) 

-1250.04 

-1252.06 

-1252.54 

-1247.92 

-1248.04 

-1248.16 

-1248.27 

e  (corner  atoms) 

— 

-1250.89 

1252.71 

— 

-1248.02 

-1248.16 

-1248.22 

Ae 

2.08 

2.23 

0.36 

0.04 

0.09 

0.02 

0.12 

®  HOMO  and  LUMO  one-electron  energies  (e),  electron  affinities  (EA),  ionization  potentials  (IP),  1s  core  level  energies  (e),  and  their 
spread  Ae  (for  the  largest  cluster  restricted  to  the  subsystem  MggOg).  All  energies  in  eV. 


splitting  of  Is  core  levels  in  PC  embedded  models, 
especially  in  the  smaller  clusters  Mg505  (about  1.9 
and  2.1  eV;  Table  I)  and  Mg909  (about  0.8  and  2.2 
eV),  clearly  demonstrates  the  inequivalence  of 
atoms  of  the  same  kind  located  at  different  cluster 
positions.  TIMP  embedding  reduces  this  level 
splitting  to  less  than  0.1  eV  (Table  I).  Ideally,  ions 
of  one  sort  belonging  to  the  same  layer  of  a  regular 
MgO(OOl)  surface  should  have  identical  core  level 
energies. 

As  a  consequence  of  the  low-lying  LUMO,  clus¬ 
ter  models  with  simple  PC  embedding  manifest  an 
artificially  enhanced  electron  affinity  (EA),  e.g., 
5.05  eV  for  Mg909.  Here,  EA  is  determined  as  the 
difference  of  calculated  total  energies  of  the  neu¬ 
tral  embedded  cluster  and  its  anion,  EA  = 

“  E(anion)*  The  EA  values  for  two  additional  elec¬ 
trons  calculated  at  the  BLYP  level  for  the  PC 
embedded  clusters  Mg505  and  Mg909  are  8.9  and 
7.1  eV,  respectively.  This  implies  that  the  border 
oxygen  anions  exhibit  a  marked  oxidative  power. 
We  note  in  passing  that  no  significant  electron 
charge  transfer  has  been  found  in  studies  on  small 
adsorbates  (e.g.,  on  transition  metal  atoms  [30]) 
deposited  above  the  central  0^~  ion  of  PC  embed¬ 
ded  clusters.  Nevertheless,  using  PC  embedded 
models  for  investigations  of  larger  adsorbed 
species,  like  metal  clusters,  can  lead  to  erroneous 
results  if  no  special  care  is  taken  to  keep  the 


cluster  borders  far  enough  from  the  adsorption 
site.  The  EA  overestimation  is  rectified  in  TIMP 
embedded  cluster  models  (Table  I). 

The  calculated  ionization  potentials  (IPs)  of  PC 
embedded  models  of  about  11  eV  are  higher  by  2 
eV  than  the  experimental  value  of  9.1  eV  [39], 
while  the  IPs  in  TIMP  models  of  about  7  eV  are 
lower  than  experiment  by  the  same  amount  (Table 
I).  It  is  unclear  at  present  to  which  extent  the 
inaccuracy  of  the  calculated  IPs  is  caused  by  the 
DF  method  itself  or  by  limitations  of  the  cluster 
models  employed.  Note  that  increasing  the  cluster 
"thickness"  by  additional  PC  layers  does  not  lead 
to  any  significant  changes. 

We  conclude  this  comparison  of  PC  and  TIMP 
embedded  cluster  models  by  noting  that  the  TIMP 
embedding  procedure  very  satisfactorily  repro¬ 
duces  all  cluster  properties  discussed  here  as 
judged  by  the  results  for  the  AE  correspond¬ 
ing  cluster  models  (cf.  TIMP  Mg909  and  AE 
MggO^Mg?^^;  Table  I). 

ELECTROSTATIC  POTENTIAL  ABOVE 

THE  SURFACE 

It  is  generally  accepted  that  the  electrostatic 
potential  plays  a  key  role  in  interactions  of  adsor¬ 
bates  with  ionic  substrates.  Two  recent  theoretical 
investigations  are  devoted  to  the  study  of  the 
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electrostatic  potential  above  the  MgCXOOl)  surface 
in  different  cluster  models.  Ferrari  and  Pacchioni 
[15]  carried  out  HF  calculations  on  PC  embedded 
models  of  five-coordinated  and  0^“  sites  at 

a  regular  surface  and  on  three-coordinated 
corner  sites.  They  found  that  in  PC  embedded 
models  the  electrostatic  potential  above  regular 
sites  depends  significantly  on  the  cluster  size  and 
is  rather  different  from  that  obtained  in  periodic 
slab  HF  calculations.  In  particular,  in  PC  embed¬ 
ded  models  the  electrostatic  potential  above  a 
Mg^^  site  is  much  stronger  and  decays  slower 
with  the  increasing  distance  from  the  surface  than 
in  the  periodic  model.  At  t5^ical  adsorbates  dis¬ 
tances  above  sites  the  potential  is  even  of  the 
wrong  sign.  Nygren  et  al  [8]  studied  models  of 
regular  surface  Mg^*^  and  O^”  sites  with  the  help 
of  both  PC  and  TIMP  embedding  approaches  us¬ 
ing  the  HF  method.  They  found  that  values  of  the 
electrostatic  potential  obtained  in  TIMP  models  of 
various  size  are  rather  similar  and  above  Mg^"^ 
sites  the  electrostatic  potential  is  close  to  that  one 
generated  by  a  single  Mg^^  cation  surrounded  by 
an  infinite  PC  array. 

Here  we  consider  the  electrostatic  potential  only 
above  Mg^^  sites  of  regular  (001)  surface  of  MgO. 
Figure  2  shows  the  dependence  of  the  electrostatic 
potential  felt  by  a  positive  unit  charge  at  variable 


height  above  the  central  Mg^^  cation  of  different 
cluster  models.  The  point  charge  array  of  the  size 
used  in  the  cluster  embedding,  with  all  ions  repre¬ 
sented  by  PCs  of  charge  +  2.0  a.u.,  has  been  taken 
as  a  reference.  The  resulting  electrostatic  potential 
(Fig.  2,  solid  line)  is  quite  similar  to  that  obtained 
from  HF  calculations  on  a  slab  model  [15]  or  a 
Mg^'^  cation  surrounded  by  an  infinite  array  of 
PCs  [8].  Since  magnesium  oxide  is  a  highly  ionic 
system  with  a  small  covalent  contribution  and 
nearly  complete  charge  separation,  a  good  quan¬ 
tum  chemical  model  should  mimic  to  a  significant 
extent  the  electrostatics  of  the  ideal  PC  representa¬ 
tion. 

The  PC  embedded  models  Mg505  and  Mg909 
(diamonds  and  squares,  respectively)  yield  an 
electrostatic  potential  much  larger  than  that  of  a 
PC  array.  Also  the  electrostatic  potential  of  these 
two  cluster  models  extends  to  significantly  larger 
distances,  featuring  a  rather  slow  decay  which  is 
quite  different  from  that  of  the  reference  system. 

At  variance  with  the  results  of  PC  embedded 
models,  the  TIMP  strategy  yields  electrostatic  po¬ 
tential  curves  that  are  very  similar  to  each  other 
and  to  the  reference  curve  of  the  ideal  PC  system, 
but  are  somewhat  lower  in  value  than  the  latter. 
At  distances  typical  for  the  location  of  adsorbed 
molecules  (CO  in  Fig.  2)  the  TIMP  curves  are  even 


FIGURE  2.  Electrostatic  potential  felt  by  a  positive  charge  at  distance  r  above  the  central  cation  in  different 
cluster  models  of  MgO(001).  Typical  values  for  the  calculated  positions  of  the  nuclei  of  adsorbed  CO  are  indicated  by 
arrows.  The  results  for  different  models  are  marked  as  follows:  17  x  17  x  2  PC  array  (solid  line).  MgOl"  TIMP  (a), 
MggOg  TIMP  (O),  MggOgMg?!^  (x),  Mg^O^  PC  (O).  Mg^Og  PC  (□). 
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below  zero.  This  implies  that  the  corresponding 
model  potential  for  electrons  is  slightly  more  re¬ 
pulsive,  although  the  absolute  values  of  the  elec¬ 
trostatic  potential  are  very  small.  The  electrostatic 
potential  curves  for  Mg505  and  Mg909  TIMP  em¬ 
bedded  models  agree  to  graphical  accuracy;  there¬ 
fore,  only  the  curve  for  the  latter  model  is  shown 
in  Figure  2  (circles).  The  lower  electrostatic  poten¬ 
tial  compared  to  the  reference  system  can  be  at¬ 
tributed  to  a  stronger  screening  of  the  central  Mg^"^ 
cation  by  the  electron  density  of  very  diffuse  shells 
of  neighboring  0^“  anions.  The  electrostatic  po¬ 
tential  produced  by  the  model  Mg909Mgi5'^ 
(crosses)  where  all  16  TIMPs  are  substituted  by  AE 
Mg^^  cations  is  slightly  higher  than  the  reference 
potential  due  to  a  small  charge  transfer  from  the 
Mg909  subsystem  into  valence  orbitals  of  the  16 
cations.  Not  unexpectedly,  all  electrostatic  poten¬ 
tials  calculated  at  the  DF  level  are  in  good  agree¬ 
ment  with  the  electrostatic  potentials  obtained  for 
similar  models  with  the  help  of  the  HF  method  [8, 
15]. 

CO  ADSORPTION  ON  REGULAR 

SURFACE  SITES 

The  interaction  of  an  MgO  substrate  with  an 
adsorbate  like  CO  provides  a  good  test  of  the 
TIMP  DF  models.  The  adsorption  of  CO  on  the 
regular  MgO(100)  surface  has  been  frequently 
studied  theoretically  (see,  e.g.,  [9]  and  references 
therein).  Most  of  these  cluster  studies  employed 
the  PC  embedding  approach.  The  TIMP  technique 
was  applied  only  recently  in  modified  coupled 
pair  functional  (MCPF)  calculations  where  the 
electron  correlation  was  treated  with  high  accuracy 
[16,  17]. 

One  of  the  most  important  characteristics  of  the 
MgO“CO  interaction  derived  from  quantum 
chemical  calculations  is  the  adsorption  energy.  In  a 
previous  DF  study  of  CO  adsorption  on  MgO(001) 
[14]  based  on  the  BLYP  functional  we  calculated 
binding  energy  values  of  0.20,  0.21,  0.11,  and  0.09 
eV  for  the  PC  embedded  clusters  Mg505,  MggOg, 
Mg2i02i,  and  Mg25025,  respectively.  (Note  that 
PCs  of  +1.8  a.u.  were  used.)  Based  on  the  analysis 
of  the  electrostatic  potential  in  the  previous  sec¬ 
tion,  one  expects  a  weaker  adsorption  interaction 
in  TIMP  models.  Recently,  Nygren  and  Pettersson 
computed  the  energy  of  CO  interaction  with  a 
TIMP  embedded  cluster  Mg505  to  0.08  eV  at  the 
MCPF  level  [17].  In  our  present  DF  TIMP  calcula- 


FIGURE  3.  Sketch  of  the  cluster  model  MggOg-CO. 

tions  the  interaction  of  CO  with  the  MgOfOOl) 
surface  (Fig.  3)  turned  out  to  be  extremely  weak. 
Although  we  were  able  to  locate  a  potential  energy 
minimum  at  the  BLYP  level,  the  binding  energy 
was  found  to  be  negative  after  the  BSSE  correction 
was  taken  into  account  (Table  II).  This  finding  is  in 
line  with  results  of  previous  DF  studies  where  the 
BLYP  functional  was  found  to  underestimate  lig¬ 
and-metal  binding  energies  [31].  The  BP  ex¬ 
change-correlation  functional  yields  slightly  short¬ 
er  equilibrium  values  for  the  Mg-C  distance  and 

TABLE  II _ 

Observables  for  CO  adsorption  on  Mg  cations  of  a 
regular  MgO(001)  surface  calculated  for  clusters 
embedded  using  the  TIMP  technique  or  AE 
Mg"*"^  cations^ 


/le  (Mg 

l-c) 

De 

Cluster  model 

BLYP 

BP 

BLYP 

BP 

TIMP  embedding 
MgOf- 

2.57 

2.46 

-0.016 

0.021 

MggOg 

2.53 

2.49 

-0.011 

0.044 

MggOg 

Mg^"^  AE  embedding 

2.56 

2.49 

-0.003 

0.054 

MggOgMg^l^ 

2.56 

2.49^ 

0.010 

0.066 

^Surface-CO  distance  ^e(Mg-C)  (in  A)  and  binding  energy 
Dq  (in  eV),  corrected  for  the  BSSE,  both  for  the  BLYP  and 
the  BP  exchange-correlation  energy  functionals. 

^  Height  as  optimized  for  the  TIMP  embedded  cluster  mode! 
MggOg  at  BP  level. 
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binding  energy  values  of  0.02-0.05  eV  for  the  vari¬ 
ous  cluster  models.  These  larger  values  are  compa¬ 
rable  to  the  result  of  the  MCPF  study,  but  are  still 
very  small,  about  1  kcal/mol.  Apparently,  an  ac¬ 
curate  description  of  the  CO-MgO  interaction  at 
cationic  sites  of  an  ideal  MgO(OOl)  surface  requires 
the  proper  evaluation  of  weak  interaction  contri¬ 
butions,  such  as  dipole-dipole  and  van  der  Waals 
interactions. 

This  notable  weakening  of  the  interaction  of  CO 
with  a  MgO  substrate  can  be  attributed  to  two 
features  of  the  TIMP  models.  First,  the  electrostatic 
potential  above  Mg^"^  at  a  distance  of  about  3  A 
where  CO  molecule  is  situated  is  very  small;  it 
almost  vanishes  near  the  oxygen  nucleus  (see  the 
previous  section).  As  a  consequence,  the  attraction 
of  the  CO  molecule  to  the  surface  due  to  the 
adsorbate  polarization  is  less  than  in  PC  embed¬ 
ded  models.  Second,  in  the  TIMP  models  the  0^“ 
substrate  anions  are  more  extended  in  the  direc¬ 
tion  of  the  adsorbate.  This  leads  to  a  stronger 
repulsion  between  the  CO  molecule  and  the  closed 
shells  of  the  substrate  anions.  Thus,  electrostatic 
attraction  and  Pauli  repulsion  cancel  each  other, 
resulting  in  an  almost  vanishing  adsorption  en¬ 
ergy.  Note  that  the  change  of  the  electrostatic  po¬ 
tential  is  also  a  consequence  of  the  modified  shape 
of  the  oxygen  anions  which  in  the  TIMP  models 
screen  the  Mg^^  cations  much  better. 

It  is  worth  mentioning  that  the  most  recent 
experimental  estimations  of  the  adsorption  energy 
of  CO  on  MgO(OOl)  are  above  0.4  eV  [40,  41]  while 
the  most  accurate  quantum  chemical  investiga¬ 
tions  [8]  including  the  present  study  furnish  val¬ 
ues  lower  than  0.1  eV.  This  clear  disagreement  has 
been  not  resolved  although  a  plausible  explanation 
has  been  given  [8].  Our  results  allows  us  to  com¬ 
ment  on  this  discrepancy  in  the  next  section. 

CO  ADSORPTION  ON 

THREE-COORDINATED  SITES 

Finally,  we  apply  the  TIMP  technique  to  surface 
defects  where  the  interaction  of  a  CO  molecule  is 
expected  to  be  stronger.  Advantages  and  disad¬ 
vantages  of  the  various  types  of  embedding  mod¬ 
els  will  be  more  evident.  Also,  low-coordinated 
defect  sites  of  MgO  are  of  primary  interest  for 
chemical  applications  such  as  catalysis.  We  con¬ 
sider  here  only  models  of  the  three-coordinated 
Mg  site  as  it  is  the  most  characteristic  one.  Evi¬ 
dently,  regular  surface  sites  and  three-coordinated 
corner  sites  are  two  limiting  cases,  and  the  results 


for  the  four-coordinated  cationic  site  are  expected 
to  fall  between  these  two  limits. 

The  results  for  CO  adsorption  at  a  three-coordi¬ 
nated  corner  Mg  cationic  site  are  given  in  Table  III. 
On  the  unrelaxed  site  (where  the  substrate  struc¬ 
ture  is  kept  frozen  at  bulk  termination),  the  bind¬ 
ing  energy  for  CO  adsorption  is  calculated  to 
0.47-0.48  eV  for  the  BLYP  exchange-correlation 
functional.  The  fact  that  the  minimal  cluster  model 
Mg03~  and  the  stoichiometric  cluster  Mg  4O4  yield 
almost  the  same  results  is  indicative  for  the  quality 
of  TIMP  embedding.  The  BLYP  binding  energy 
value  for  the  unrelaxed  corner  site  is  essentially 
identical  to  that  computed  by  Nygren  and  Petters- 
son  with  the  MCPF  method  [17].  It  is  interesting 
that  the  CO  vibrational  frequency  shift  of  26-27 
cm”^  is  significantly  smaller  than  the  value  of  56 
cm“^  obtained  by  Nygren  and  Pettersson  [17], 

For  the  cluster  model  MgO^”  we  also  consid¬ 
ered  the  relaxation  of  the  top  magnesium  atom.  It 
moves  down  toward  the  O^"  anions  by  0.32  A, 
when  the  positions  of  all  the  other  atoms  are  kept 
fixed.  This  is  in  line  with  the  HF  result,  0.26  A, 
obtained  with  PC  embedding  [42].  At  a  relaxed 
corner  site,  the  energy  of  CO  adsorption  is  calcu¬ 
lated  to  0.37  eV,  about  0.1  eV  smaller  than  at  the 
unrelaxed  site.  Despite  this  weaker  interaction  en¬ 
ergy,  a  larger  C-O  frequency  shift  of  39-41  cm“^ 
is  obtained.  For  this  model,  both  BLYP  and  BP 
energy  functionals  yield  very  similar  results  (Table 
III). 

TABLE  III _ 

Observables  for  CO  adsorption  on 
three-coordinated  Mg  cations  of  MgO  calcuiated 
for  clusters  embedded  using  the  TiMP  techniques.^ 


Cluster  model 

/Je  (Mg-C) 

Te  (C-O) 

De 

Awg 

Unrelaxed 
MgO|-,  BLYP 

2.31 

1.1290 

0.469 

2168 

26 

Mg404,  BLYP 

2.31 

1.1288 

0.482 

2169 

27 

Relaxed 
MgO^-,  BLYP 

2.36 

1.1287 

0.366 

2183 

41 

MgO^,  BP 

2.36'’ 

1.1284 

0.376 

2192 

39 

CO  molecule 
BLYP 

_ 

1.1345 

_ 

2142 

_ 

BP 

— 

1.1340 

— 

2153 

— 

^  Mg-CO  distance  (Mg-C)  and  C-O  distance  (C-O) 
(in  A),  binding  energy  (in  eV.  corrected  for  BSSE), 
harmonic  frequency  of  the  C-O  vibration  and  the  corre¬ 
sponding  adsorption-induced  frequency  shift  Awg  (in 
cm""'). 

^  Height  as  optimized  at  BLYP  level. 
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Our  data  for  the  relaxed  three-coordinated  site 
are  very  similar  to  experimental  data  for  CO  ad¬ 
sorption  on  MgO  films  grown  over  a  Mo(lOO) 
surface  [41].  In  this  latter  work,  an  infrared  (IR) 
absorption  peak  at  2178  cm"^  (35  cm“^  blue- 
shifted  with  respect  to  gas-phase  CO)  is  observed 
under  ultrahigh  vacuum  conditions  at  a  tempera¬ 
ture  of  90  K.  The  adsorption  energy  has  been 
determined  from  isothermal  adsorption  and  tem¬ 
perature-programmed  desorption  to  0.43  and  0.46 
eV,  respectively  [41].  It  has  been  argued  [41]  that 
the  supported  MgO  film  used  in  the  experimental 
study  exhibits  a  low  concentration  of  surface  de¬ 
fects,  but  there  is  no  direct  information  from  ex¬ 
periment  as  to  which  type  of  sites  are  being  probed 
by  CO  adsorbates.  The  present  DF  results  support 
the  proposition  [17]  that  only  CO  adsorption  at 
low-coordinated  defect  sites  of  MgCKOOl),  but  not 
at  regular  five-coordinated  Mg  sites,  yields  an  ad¬ 
sorption  energy  exceeding  0.4  eV  as  measured  by 
He  et  al.  [41]. 


Summary 

We  have  presented  a  DF  implementation  of  the 
TIMP  embedding  technique  which  opens  opportu¬ 
nities  for  cluster  model  studies  of  ionic  materials 
with  improved  boundary  conditions.  Comparative 
calculations  of  PC  and  TIMP  embedded  cluster 
models  showed  that  the  TIMP  approach  success¬ 
fully  repairs  the  main  deficiency  of  PC  embedded 
models,  namely  the  artificial  polarization  of  oxy¬ 
gen  anions  by  positive  PCs  at  the  cluster  border. 
TIMP  cluster  models  yield  substantially  improved 
one-electron  energies.  The  splitting  of  core  levels 
of  Mg  and  O  atoms  in  various  cluster  positions  is 
reduced  to  less  than  0.1  eV.  HOMO  and  LUMO 
energies  come  substantially  closer  to  the  corre¬ 
sponding  values  calculated  within  periodic  DF  slab 
models;  the  HOMO-LUMO  gap  increases  by  1-2 
eV.  As  a  consequence  of  the  low-lying  LUMO, 
cluster  models  with  simple  PC  embedding  feature 
an  exaggerated  electron  affinity;  this  shortcoming 
is  also  rectified  in  TIMP  embedded  cluster  models. 
For  TIMP  cluster  models,  the  electrostatic  potential 
above  the  central  Mg^'^  cations  in  the  adsorbate 
region  is  much  weaker  than  in  models  with  simple 
PC  embedding,  resulting  in  significantly  reduced 
adsorbate-substrate  interactions.  In  these  novel 
TIMP  models  the  adsorption  energy  of  CO  on  a 
regular  MgO(OOl)  surface  is  calculated  to  be  only 


about  0.05  eV  at  BP  level.  For  the  CO  adsorption  at 
the  partially  relaxed  three-coordinate  corner  site 
the  calculated  binding  energy  is  0.38  eV  and  the 
CO  vibrational  frequency  shift  is  39  cm~^. 
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ABSTRACT:  Transition-metal  (TM)-main-group  multiply  bonded  complexes  have 
been  heavily  studied,  due  to  their  usefulness  in  a  variety  of  applications,  e.g.,  catalysis. 
The  large  majority  of  multiply  bonded  complexes  are  early  TM  systems,  but  late  metal 
complexes  are  becoming  more  common  and  have  been  shown  to  be  potent  catalysts.  An 
investigation  of  ligand  and  substituent  effects  for  representative  late  TM  multiply 
bonded  complexes  with  heavy  (Pt-phosphinidenes)  and  light  (Ru-alkylidenes)  elements 
is  presented.  Fundamental  differences  in  the  response  of  the  metal-ligand  multiple  bond 
are  observed  for  light  vs.  heavy  main-group  elements  for  these  late  TM  systems.  The 
results  are,  in  many  respects,  diametrically  opposed  to  those  seen  for  their  early  TM 
congeners.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem  65:  987-996,  1997 


Introduction 

Transition-metal  (TM)”main-group  (MG)  mul¬ 
tiply  bonded  complexes  have  been  heavily 
studied  since  the  discovery  of  metal  carbenes  in 
1964  [1].  Multiply  bonded  complexes  participate  in 
a  wide  variety  of  important  reactions,  such  as 
methane  activation  by  imidos  (L„M  =  NR)  [2]  and 
alkane  oxidation  by  oxos  (L„M  =  O)  [3].  The  ma¬ 
jority  of  TM  =  MG  complexes  involve  MG  ele¬ 
ments  from  the  first  row  and  TMs  from  the  early 
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to  middle  portions  of  the  transition  series.  Analogs 
with  heavier  MG  elements  are  becoming  more 
common  [4].  Likewise,  multiply  bonded  com¬ 
plexes  of  later  TMs  (TMiatg=MG)  are  also  being 
more  heavily  studied,  due  in  large  part  to  their 
intermediacy  in  different  catalytic  processes  [5]. 

The  seeming  instability  of  com¬ 

plexes  has  been  explained  to  arise  from  the  repul¬ 
sion  between  the  d-orbital  electron  density  on  the 
late  TM  and  MG  orbitals  that  could  be  used  in 
TT-bonding  [6].  As  described  by  Mayer  for  oxos, 
this  leads  to  the  occupation  of  orbitals  with  7r-anti- 
bonding  character  and,  hence,  weakening  and 
polarization  of  the  metal-element  Tr-bond  [6].  In 
valence  bond  terms,  such  a  situation  suggests  a 
balance  between  resonance  structures  such  as  A 
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and  B.  The  qualitative  model  proposed  by  Mayer 
should  apply: 

L„M+— E-— R  L„M=E— R 

/ 

A  B 

whether  the  MG  is  light  or  heavy,  but  the  anti¬ 
bonding  interaction  may  exacerbate  the  already 
weaker  7r-bonding  power  observed  in  heavier  MG 
elements. 

In  the  majority  of  cases,  the  multiple  bond  is  the 
"'active  site"  which  defines  the  reactivity  of  a 
TM=MG  complex.  The  reactivity  of  TM=MG 
complexes  of  early  TMs,  particularly  for  light  MG 
elements,  is  often  limited  by  the  thermodynamic 
and  kinetic  stability  of  the  multiple  bond.  While 
this  is  advantageous  in  terms  of  experimental  iso¬ 
lation  of  such  species,  it  is  an  obvious  limitation  in 
terms  of  utilizing  TM=MG  complexes  for  further 
reactions.  Furthermore,  our  computations  on  mul¬ 
tiply  bonded  complexes  of  early  TMs  suggest  that 
the  strength  of  the  multiple  bond  engenders  a 
degree  of  insensitivity  to  changes  in  the  ligands 
and  substituents,  making  it  difficult  to  substan¬ 
tially  tailor  their  chemistry  through  synthetic  mod¬ 
ification  [7].  Conversely,  given  the  instability  and 
metastability  of  multiple  bonds  involving  later 
TMs,  it  may  be  possible  to  harness  them  for  syn¬ 
thetic  and  catalytic  benefit  if  one  could  assess  the 
sensitivity  of  TMiate=MG  complexes  to  their 
chemical  environment.  As  it  is  often  difficult  to 
obtain  direct,  detailed  experimental  information 
on  TMiate^^MG  complexes,  they  are  ideal  candi¬ 
dates  for  computational  research. 

In  the  present  research,  we  chose  two 
TM late— MG  systems  for  study:  One  involves  a 
lighter  MG  element  (Ru-alkylidenes)  and  the  other 
a  heavier  MG  element  (Pt-phosphinidenes).  Our 
goal  is  to  more  fully  investigate  the  bonding  in 
TMiate=MG  complexes  using  quantum  calcula¬ 
tions.  An  investigation  of  the  electronic  and  molec¬ 
ular  structure  of  platinum-phosphinidene  and 
ruthenium-alkylidenes  complexes  is  presented. 
Platinum  complexes  studied  are  of  the  form 
(PX3)2Pt=PR,  where  (PX3)2  is  two  monodentate 
or  a  single  bidentate  phosphine  and  R  is  an  aryl  or 
alkyl  group.  These  systems  are  models  of  those 
recently  proposed  by  Glueck  et  al.  as  intermediates 
in  metal-catalyzed  olefin  hydrophosphination  [5a]. 
The  d^(X3P)2Pt^  moiety  is  isolobal  to  methylene 
and  other  isovalent  groups.  This  suggests  some 
multiple  bonding  for  the  PtP  bond.  On  the  other 


hand,  the  configuration  of  (X3P)2Pt^  will  result 
in  occupation  of  orbitals  which  are  PtP  Tr-antibond- 
ing.  It  was  a  goal  of  this  research  to  determine  if 
electronic  effects  can  stabilize  a  Pt-phosphinidene. 
All  known  phosphinidenes  contain  very  bulky  lig¬ 
ands  to  prevent  oligomerization  [8].  This  research 
used  small  models  that  allow  electronic  properties 
to  be  studied,  but  kept  the  models  computation¬ 
ally  feasible. 

In  contrast  to  the  instability  of  platinum  phos¬ 
phinidenes,  ruthenium  alkylidenes  have  been  syn¬ 
thesized,  isolated,  and  structurally  characterized. 
Alkylidenes  of  Ru  have  shown  high  activity  in 
ring-opening  metathesis  polymerization  (ROMP), 
ring-closing  metathesis  (RCM),  and  acyclic  diene 
metathesis  polymerization  (ADMET)  [9].  Grubbs 
et  al.  showed  that  Ru  complexes  are  very  tolerant 
of  functional  substrates  and  catalyze  the  ROMP  of 
both  high-  and  low-strain  cyclic  olefins  [9a,b]. 
Ruthenium  systems  have  also  been  studied  as  wa¬ 
ter-soluble  catalysts,  being  stable  in  protic  media 
[5,  10].  Grubbs  et  al.  increased  the  metathesis  ac¬ 
tivity  of  Ru  catalysts  by  replacing  the  triphenyl- 
phosphine  ligands  with  better  cr-donating 
alkylphosphines  [9b],  so,  clearly,  the  electronics  of 
the  metal  are  an  important  parameter  to  be  fine- 
tuned.  It  has  been  suggested  that  the  active  species 
in  the  catalytic  process  have  a  ruthenium- 
carbon  multiple  bond  [9a].  An  investigation  of 
ligand  and  substituent  effects  on  the  Ru=C  bond 
is  presented.  Square  pyramidal  complexes  of  the 
form  Ru(PX3)2(Y)2(=CZ2)  have  been  modeled 
(where  X,Y,Z  =  H,  Cl,  F,  or  Me).  As  with  plat¬ 
inum  phosphinidenes,  the  substituent  size  has  been 
kept  small  to  keep  the  computational  time  reason¬ 
able. 


Computational  Methods 

Calculations  employed  the  sequential  and  paral¬ 
lel  versions  of  the  GAMESS  quantum  chemistry 
program  [11].  The  effective  core  potentials  (ECPs) 
and  valence  basis  sets  of  Stevens  et  al.  [12]  were 
used  for  all  heavy  atoms  and  the  ~31G  basis  set 
for  H.  In  the  Stevens  scheme,  all  ECPs  for  elements 
heavier  than  neon  are  derived  from  Dirac- 
Hartree-Fock  calculations  and  thus  implicitly  in¬ 
clude  Darwin  and  mass-velocity  relativistic  ef¬ 
fects;  spin-orbit  coupling  is  averaged  out,  using  a 
/-weighted  averaging  scheme,  in  potential  genera¬ 
tion.  ECPs  replace  the  innermost  core  orbitals  for 
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platinum  and  all  core  orbitals  for  MG  elements. 
Thus,  the  ns,  np,  nd,  (n  +  l)s,  and  (n  +  l)p  shells 
are  treated  explicitly  for  ruthenium  and  platinum; 
for  the  MG  elements,  ns  and  np  are  treated  explic¬ 
itly.  The  Pt  and  Ru  valence  basis  sets  are  qua¬ 
druple  and  triple  zeta  for  the  sp  and  d  shells, 
respectively,  while  MG  elements  have  a  double- 
zeta-plus-polarization  valence  basis. 

Geometries  were  optimized  using  restricted 
Hartree-Fock  (RHF)  and  wave  functions  for 
closed-shell  singlets.  Bond  lengths  and  angles  for 
TM  complexes  are  typically  predicted  to  within 
1-3%  of  experiment  using  this  computational 
scheme,  termed  RHF /SBK(d),  involving  com¬ 
plexes  in  a  variety  of  geometries,  oxidation  states, 
and  metals  from  the  entire  transition  series  [13], 

Results 

PLATIIVUM  PHOSPHINIDENES 

The  geometry  of  the  Pt-phosphinidenes  is 
shown  in  1.  All  complexes  were  optimized  under 
Cg  symmetry.  Substituents  are  listed  in  Tables  I-IV 
along  with  corresponding  calculated  data.  The  par¬ 
ent  complex,  where  PX3=PH3  and  R  =  H,  is  a  Q 
symmetry  minimum  with  Pt=P  =  2.196  A  and 
Pt=p_H  =  102.7".  When  Pt=P— H  is  fixed 
linear  (Pt=P — H  =  180°),  Pt=P  decreases  by 
0.03-2.166  A,  supporting  the  idea  that  forcing 
phosphorus  lone-pair  donation  to  the  metal  will 
shorten  the  bond  and  strengthen  the  Pt=P  bond. 


The  linear  structure  is  not  a  minimum,  though. 
Experimental  models  are  rare;  Pt — P  bonds  in  the 
phosphido  dimers^  discussed  by  Glueck  et  al.  [5a] 
are  roughly  2.335  A(trans  to  Cl).  Calculated  Pt=P 
bond  lengths  are  several  percent  shorter  ( 2.2  A), 
consistent  with  some  multiple  bonding  character* 
(Tables  I-IV): 

Pt  =  p^R 

X3P 

1 

Alkyl  and  Silyl  Phosphinidenes 

The  aliphatic  and  silyl  derivatives  modeled  are 
listed  in  Table  I.  Upon  changing  the  phosphorus 
substituent  from  H  to  Me,  Et,  ^Pr,  and  ^Bu,  there  is 
very  little  change  in  bond  length  (0.002  A),  al¬ 
though  the  Pt=P — R  angle  increases  (102.7°, 
111.8°,  112.2°,  113.8°,  and  114.6°,  respectively),  due 
to  the  larger  phosphinidene  substituent.  The  geo¬ 
metries  of  the  two  'Pr  conformations  studied  are 
shown  in  2a  and  2b;  there  is  no  difference  in  Pt=P 
bond  lengths  between  2a  and  2b,  and  the 
pt=p — R  angles  differ  by  only  1.4°.  The  con- 
former  with  methyl  groups  pointing  down,  2b,  is 
lower  in  energy  by  «  1  kcal  mol“^  than  that  of  2a 
at  the  RHF/SBK(rf)  level  of  theory.  Replacing  the 
phosphinidene  substituent  in  the  parent  complex 
with  SiH3  and  CH2SiH3  has  little  effect  on  the 

^Throughout  the  article,  the  Pt — phosphinidene  bond  will 
be  indicated  as  Pt=P  to  distinguish  it  from  the  Pt-phosphine 
bonds. 


TABLE  I _ 

Alkyl  and  silyl  phosphinidene  derivatives.^ 


X 

R 

Pt=P(A) 

R— P— R  (°) 

Q_ 

II 

q1 

0 

CD 

BO(Pt— R) 

<7(R) 

qiP)'^ 

H 

H 

2.196 

102.7 

1.810 

0.898 

-0.645 

0.061 

H 

H  (linear) 

2.166 

180.0 

1.759 

0.937 

-0.442 

-0.327 

H 

Me 

2.198 

111.8 

1.781 

0.924 

-0.680 

0.129 

H 

Et 

2.197 

112.2 

1.811 

0.923 

-0.703 

0.153 

H 

‘Pr  (2a) 

2.197 

113.8 

1.778 

0.869 

-0.718 

0.157 

H 

'Pr  (2b) 

2.197 

112.4 

1.792 

0.875 

-0.710 

0.148 

H 

^Bu 

2.197 

114.6 

1.822 

0.847 

-0.745 

0.184 

Me 

H 

2.200 

102.9 

1.828 

0.891 

-0.787 

-0.040 

dpm 

H 

2.192 

102.4 

1.734 

0.899 

-0.443 

-0.053 

F 

H 

2.215 

101.3 

1.614 

0.903 

-0.665 

0.194 

H 

CH2SiH3 

2.196 

111.9 

1.854 

0.850 

-0.680 

0.139 

H 

SiHs 

2.196 

101.9 

1.817 

0.976 

-0.715 

-0.003 

®See  structure  1  for  definitions  of  X  and  R.  The  bond  order  (BO)  is  calculated  as  described  in  [14];  atomic  charges  (q)  are 
determined  from  a  Mulliken  population  analysis  [15]. 

‘^The  atomic  charge  on  the  phosphinidene  phosphorus  is  q{P). 
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TABLE  II 


Halide  phosphinidene  derivatives.^ 

X 

R 

Pt=P  (A) 

Pt— P— R  (°) 

BO(Pt=P) 

BO(P— R) 

Q(Pt) 

C7(P)" 

H 

H 

2.196 

102.7 

1.810 

0.898 

-0.645 

0.061 

F 

H 

2.215 

101 .3 

1.614 

0.903 

-0.665 

0.194 

H 

Me 

2.198 

111.8 

1.781 

0.924 

-0.680 

0.129 

H 

CH2F 

2.194 

109.2 

1.769 

0.939 

-0.751 

0.217 

H 

CHF2 

2.189 

107.8 

1.826 

0.904 

-0.618 

0.100 

H 

CF3 

2.187 

106.6 

1.820 

0.875 

-0.645 

0.187 

H 

CH2CI 

2.193 

108.4 

1.781 

0.891 

-0,710 

0.233 

H 

CH2Br 

2.193 

107.9 

1.790 

0.892 

-0.703 

0.227 

H 

CH2I 

2.193 

107.7 

1.793 

0.870 

-0.700 

0.238 

®See  footnote  a  to  Table  I, 
‘’See  footnote  b  to  Table  I. 


TABLE  III _ 

Aromatic  phosphinidene  derivatives.^ 


X 

R 

CL 

II 

s: 

Pt— P— R  (°) 

BO(Pt=P) 

BO(P— R) 

q(Pt) 

C?(P)'’ 

H 

H 

2.196 

102.7 

1.810 

0.898 

-0.645 

0.061 

H 

CgHs 

2,204 

115.1 

1.730 

1.066 

-0.697 

-0.012 

H 

CeHg  (3) 

2.194 

108.3 

1.750 

0.823 

-0.647 

0.055 

H 

4-C6H4F 

2.193 

108.3 

1.759 

0.810 

-0.649 

0.059 

H 

2,6-C0H3F  2 

2.188 

107.3 

1.726 

0.718 

-0.619 

0.183 

H 

3,5-CgH3F2 

2.191 

108.1 

1.756 

0.808 

-0.640 

0.065 

H 

2,4,6-C6H2F3 

2.188 

107.4 

1.735 

0.705 

-0.619 

0.181 

H 

CeFs 

2.186 

105.9 

1.737 

0.696 

-0.619 

0.183 

H 

Pyridine 

2.191 

107.9 

1.750 

0.819 

-0.644 

0.082 

®See  footnote  a  to  Table  1. 

‘^See  footnote  b  to  Table  1. 

TABLE  IV 


Cyano  phosphinidene  derivatives. 

X 

R 

Pt=P  (A) 

pt_p_R  (°) 

bo(r=p) 

BO(P— R) 

q(Pt) 

q(P)'’ 

H 

H 

2.196 

102.7 

1.810 

0.898 

-0.645 

0.061 

H 

z: 

0 

1 

c\ 

X 

0 

2.191 

109.2 

1.821 

0.873 

-0.687 

0.209 

H 

C(CN)3 

2.178 

104.5 

1.871 

0.717 

-0.633 

0.299 

H 

C(CH3)2CN 

2.191 

111.9 

1.829 

0.842 

-0.728 

0.241 

dpnn 

CH2— CN 

2.184 

109.3 

1.715 

0.879 

-0.456 

0.088 

®See  footnote  a  to  Table  1. 

‘’See  footnote  b  to  Table  1. 
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Pt=P  bond  length: 


When  the  phosphine  ligands  are  changed  from 
PH 3  to  PMe3,  the  Pt=P  bond  length  increases 
slightly  from  2.196  to  2.200  A.  The  Pt  =  P — H 
angles  are  near  identical:  102.7°  in  (PH3)2Pt=PH 
and  102.9°  in  (PMe3)2Pt=PH.  There  is  an  even 
larger  increase  (0.02  A)  in  the  Pt=P  bond  length 
when  the  parent  phosphines  are  replaced  by  the 
very  good  7r-acceptors  PF3.  Finally,  replacement  of 
the  two  phosphine  ligands  with  a  single  bidentate 
diphosphine  (dpm,  H2PCH2PH2,  a  model  of 
dppm)  shortens  the  Pt=P  bond  length  from  2.196 
to  2.192  A. 

Haloalkyl  Phosphinidenes 

The  haloalkyl-substituted  phosphinidenes  mod¬ 
eled  are  listed  in  Table  II.  The  Pt=P  bond  length 
remained  constant  at  2.193  A  for  the  halomethyl- 
substituted  derivatives  (H3P)2Pt=P — CH2X  (X 
=  Cl,  Br,  I).  The  bond  angles  changed  a  few  de¬ 
grees  over  the  fluoromethyl  series  (106,6°  [in 
(PH  3)2^1=  PCF3  ]  -  109.2°  [in 

(PH3)2Pt  =  PCH2F]).  Upon  replacing  R  =  Me 
with  R  =  CH2F,  the  bond  length  reduces  from 
2.198  to  2.194  A  and  further  decreases  to  2.189  A 
when  R  =  CHF2  and  to  2.187  A  when  R  =  CF3. 
This  suggests  that  the  Pt=P  bond  can  be  affected 
through  the  cr  framework,  assuming  the  primary 
influence  of  the  fluorine  is  by  way  of  inductive 
(electronegativity)  effects. 

Aryl  Phosphinidenes 

Table  III  lists  the  aromatic  substituents  studied. 
Conformer  3  was  assumed  for  all  complexes  stud¬ 
ied  based  on  experimental  models  such  as 
Cp2Mo=PMes*  [8a].  A  gradual  decrease  is  seen 


as  more  fluorines  are  added,  except  for  the  3,5-di- 
fluoro  derivative,  which  increases  slightly  from  the 
2,6-difluoro  derivative  (2.191  vs.  2.188  A  when 
R  =  2,6-C6H3F2).  The  Pt=P^  bond  length  de¬ 
creases  from  2.194  to  2.186  A  after  completely 
replacing  the  phenyl  hydrogens  with  fluorines 
(R  =  C^Hg  vs.  R  =  C5F5).  Also  note  that  a 
para-¥  substituent  has  little  effect  as  can  be  seen 
by  comparing  (PEt3)2Pt=P — Ph  with 
(PH  3)2Pt  =  P  —  p-C^H  4F  or 
(PH  3)2Pt  =  P-2,6-C  ^H  3F2  with 
(PH3)2Pt=P-2,4,6-C6H2F3.  The  shorter  Pt=P 
bond  of  (PH3)2Pt=P— 2,6-C6H3F2  (2.188  A)  vs. 
that  in  (PH3)2Pt=P— 3,5-C6H3F2  (2.191  A)  can 
be  interpreted  as  pointing  to  the  greater  impor¬ 
tance  of  inductive  (a)  over  conjugative  (tt)  effects 
in  terms  of  influencing  the  Pt-phosphinidene  link¬ 
age  (Table  III): 

O 
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Cyanoalkyl  Phosphinidenes 

An  interesting  series  of  complexes  are  the  cyano 
derivatives  listed  in  Table  IV.  The  bond  angles  are 
not  correlated  with  bond  length.  The  shortest  bond 
length  has  the  smallest  angle.  The  largest  angle 
[  R  =  C(CH3)2CN,  Pt=P— R  =  111.9°]  is  likely 
due  to  the  larger  substituent  size  rather  than  to 
any  electronic  effect.  All  cyano  derivatives  have  a 
shorter  Pt=P  bond  length  than  that  of  the  parent 
complex  (R  =  H,  Pt=P  =  2.196  A).  The  tricyano 
complex  [R  =  C(CN)3]  has  the  smallest  Pt — P  bond 
length  (2.178  A)  of  all  the  complexes  modeled, 
supporting  the  above  statement  (Table  IV). 


RUTHENIUM  ALKYLIDENES 

The  two  idealized  conformations  of  the  Ru(IV) 
alkylidenes  are  shown  in  4.  All  complexes  were 
optimized  under  €2^  symmetry.  Tables  V  and  VI 
list  the  substituents  along  with  the  corresponding 
calculated  data; 


Y’ 


PX, 

I 

Ru=C. 

'  I 

PX3 

4a 


I 

Ru'^c*; 

I 

PX3 

4b 


(  =  C  H  —  p-C  4C  1(PC  y3)2  and 
RuCl2(=CHCH=CPh2)(PCy3)2,  both  character¬ 
ized  by  Grubbs  et  al.  [9f],  and  cationic 
[  RuCl(=CHCH=CPh2)(CO)(PTr3)2  ]  charac¬ 
terized  by  Esteruelas  et  al.  [16].  All  three  com¬ 
plexes  correspond  to  conformer  4b. 
RuCl2(=  CH  — p-C6H4Cl)(PCy3)2  has 
Ru=C  =  1.839(3)  A,  Ru— Cl  =  2.401(1),  2.395(1) 
A,  and  Ru  — P  =  2.397(1),  2.435(1)  A; 
RuCl2(=  CHCH  =  CPh2)(PCy3)2  has 
Ru  =  C  =  1  .851(21)  A  ;  and 

[RuCl(  =  CHCH  =  CPh2)(CO)(PiPr3)2]+  has 
Ru=C=  1.8  74  (p  A  and 
Ru — P  =  2.4586(8),  2.4307(8)  A.  No  comparison  to 
Ru — Cl  is  made  in  the  latter  complex  since  CO  is 
trans  to  chloride.  The  best  model  for  these  systems 
is  X  =  Me,  Y  =  Cl,  Z  =  H  /Me,Cl,H),  4b.  The 
model  has  Ru=C  =  1.871  A,  Ru — Cl  =  2.488  A, 
and  Ru— P  =  2.481  A  (Table  V).  The  Ru-al- 
kylidene  bond  length  is  less  than  2%  different 
from  the  X-ray  structures,  while  Ru — P  and 
Ru — Cl  are  both  within  4%  of  experiment,  so  that 
all  three  metal-ligand  bond  lengths  are  in  excel¬ 
lent  agreement  with  experiment. 

Effect  of  Phosphine  Substituent  (X) 

Modification 


Several  five-coordinate  ruthenium  alkylidenes 
have  been  isolated  and  characterized.  The  most 
similar  to  the  model  complexes  are  RUCI2 


The  series  of  phosphine  substituents  studied 
(X  =  H,  F,  Cl,  Me)  were  modeled  for  Y  =  Cl.  Phos¬ 
phine  substituent  modifications  have  a  relatively 


TABLE  V 


Important  bond  lengths  in  ruthenium  aikylidenes.^ 


X 

Y 

z 

Ru— C 
(4a*) 

Ru— P 
(4a) 

Ru— Y 
(4a) 

AErhf 

Ru— C 
(4b) 

Ru— P 
(4b) 

Ru— Y 
(4b) 

H 

H 

H 

1.853 

2.436 

1.708 

14.3 

1.889 

2.408 

1.743 

H 

F 

H 

1.837 

2.509 

2.000 

0.2 

1.873 

2.481 

2.047 

H 

Cl 

H 

1.879 

2.498 

2.440 

-2.7 

1.886 

2.472 

2.468 

H 

Me 

H 

1.841 

2.456 

2.169 

15.4 

1.879 

2.431 

2.237 

H 

Cl 

H 

1.879 

2.498 

2.440 

-2.7 

1.886 

2.472 

2.468 

F 

Cl 

H 

1.908 

2.443 

2.418 

-1.0 

1.917 

2.421 

2.438 

C! 

Cl 

H 

1.893 

2.502 

2.417 

0.5 

1.914 

2.478 

2.441 

Me 

Cl 

H 

1.869 

2.503 

2.450 

-4.5 

1.871 

2.481 

2.488 

H 

Cl 

H 

1.879 

2.498 

2.440 

-2.7 

1.886 

2.472 

2.468 

H 

Cl 

F 

1.980 

2.500 

2.454 

1.5 

1.990 

2.486 

2.460 

H 

Cl 

Cl 

1.965 

2.516 

2.447 

-0.1 

1.984 

2.478 

2.470 

H 

Cl 

Me 

2.007 

2.528 

2.475 

-3.4 

1.983 

2.474 

2.495 

H 

F 

F 

1.954 

2.513 

2.034 

0.4 

1.976 

2.496 

2.042 

Me 

H 

H 

1.847 

2.433 

1.711 

12.2 

1.873 

2.415 

1.752 

®See  structures  4a  and  4b  for  alkylidene  conformations.  Bond  lengths  are  in  A,  and  AErhf  is  the  difference  between  4a  and  4b.  A 
negative  number  indicates  conformer  4b  is  lower  in  energy  than  is  conformer  4a. 
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small  effect  on  the  alkylidene  bond  length  ( <  0.04 
A)  (Table  V).  However,  from  a  structural  point  of 
view,  the  sensitivity  of  the  Ru-alkylidenes  to 
phosphine  modification  is  greater  than  seen  for 
Pt"phosphinidenes. 

When  replacing  phosphine  hydrogens  in  the 
parent  complex  (H,H,H)  with  methyl  groups 
(Me,  H,  H),  the  change  in  Ru  =  C,  Ru — P,  and  Ru 
-^Cl  bond  lengths  is  <  0.01  A  for  4a  and  <  0.02 
A  for  4b.  The  bond  lengths  in  (H,C1,H)„  are 
Ru=C  =  1.879  A,  Ru— P  -  2.498  A,  and  Ru— Cl 
=  2.440  A.  Upon  changing  X  to  fluorine 
(F,C1,H)3,  Ru=C  increases  by  0.029  to  1.908  A, 
Ru — P  decreases  by  0.055  to^  2.443  A,  and  Ru — Cl 
decreases  by  0.022  to  2.418  A.  The  decrease  in  Ru 
— P  bond  length  is  consistent  with  the  increased 
s-character  on  the  P  lone  pair  because  of  the  elec¬ 
tronegative  fluorines  [17].  When  X  is  changed  from 
F  to  Cl  (C1,C1,H\,  Ru=C  and  Ru — Cl  remain 
about  the  same  (1.893  and  2.417  A,  respectively). 
The  Ru — P  bond  length  changes  by  0.059  to  2.502 
A.  When  Cl  is  replaced  with  Me  (Me,Cl,H)^, 
Ru — P  is  nearly  unchanged  (2.503  A),  while 
Ru=C  decreases  by  0.024  to  1-869  A  and  Ru — Cl 
increases  by  0.033  to  2.450  A.  The  geometric 
changes  when  the  X  substituent  is  modified  in 
conformer  4b  are  nearly  identical  to  those  just 
discussed  for  conformer  4a. 

Analysis  of  the  atomic  charge  calculated  by  a 
Mulliken  population  analysis  (MPA)  reveals  that 


(Me,  H,  H)  (both  4a  and  4b)  has  the  most  negative 
charge  on  ruthenium.  This  is  in  support  of  Grubbs' 
finding  [9b]  that  metathesis  activity  is  increased  by 
replacing  the  triphenylphosphine  ligands  with  bet¬ 
ter  cr-donating  alkylphosphines,  if  the  rate-de¬ 
termining  step  is  breaking  the  C=C  double  bond. 
Increased  electron  density  on  the  metal  will  in¬ 
crease  backbonding  into  tt*  orbitals  of  the  olefin 
and  should  facilitate  subsequent  [2^  +  2^]  cy¬ 
cloaddition.  Further  investigation  of  this  point  will 
require  a  more  in-depth  computational  analysis  of 
the  mechanism  of  olefin  metathesis  for  the  Ru-al- 
kylidene  catalysts. 

In  all  cases  for  (X,  Cl,  H),  there  is  a  small  ( <  5 
kcal  mol~^)  energetic  difference  between  conform- 
ers  4a  and  4b  regardless  of  the  phosphine  sub¬ 
stituent.  In  three  cases,  conformer  4b  is  calculated 
to  be  slightly  more  stable  than  is  4a.  For  the 
(Cl,  Cl,  H)  complex,  the  situation  is  reversed  with 
(C1,C1,H)„  more  stable  than  is  (C1,C1,H)^  by  0.5 
kcal  mol“L  The  experimental  catalysts  have  chloro 
ligands  [9].  Hence,  the  computations  are  consistent 
with  experimental  observation  of  two  Ru-al- 
kylidene  conformers,  i.e.,  the  carbene  is  either  or¬ 
thogonal  or  parallel  to  the  P — Ru — P  axis  depend¬ 
ing  on  X  [9b].  This  point  is  addressed  more  below. 

For  a  given  set  of  ligands  and  substituents, 
conformer  4b  has  a  more  negative/less  positive 
charge  on  Ru  and  a  more  positive  charge  on  C 
than  have  the  corresponding  atoms  in  4a.  In  other 


TABLE  VI 


Important  charges  on  ruthenium  alkylidenes.^ 


X— P 
subst. 

Y— Ru 
subst. 

Z— C 
subst. 

q(Ru) 

(4a*) 

q(C) 

{4a) 

q{Ru) 

(4b) 

q(C) 

(4b) 

H 

H 

H 

-0.9262 

0.0227 

- 1 .0547 

0.0998 

H 

F 

H 

0.1691 

0.1463 

0.0335 

0.1831 

H 

Cl 

H 

-0.5921 

0.1888 

-0.7499 

0.2285 

H 

Me 

H 

-0.9382 

0.0194 

-1.1218 

0.1711 

H 

Cl 

H 

-0.5921 

0.1888 

-0.7499 

0.2285 

F 

Cl 

H 

-0.6751 

0.2063 

-0.8217 

0.2529 

Cl 

Cl 

H 

-0.5380 

0.1928 

-06856 

0.2534 

Me 

Cl 

H 

-0.7352 

0.1395 

-0.9179 

0.1916 

H 

Cl 

H 

-0.5921 

0.1888 

-0.7499 

0.2285 

H 

Cl 

F 

-0.5300 

0.6396 

-0.4893 

0.5369 

H 

Cl 

Cl 

-0.3974 

0.0496 

-1.0020 

-0.0680 

H 

Cl 

Me 

-0.5856 

0.0126 

-0.6912 

0.0181 

H 

F 

F 

0.2791 

0.5907 

0.2951 

0.4937 

Me 

H 

H 

-1.0698 

-0.0061 

-1.2218 

0.0743 

®See  structures  4a  and  4b  for  alkylidene  conformations.  The  atomic  charges  (q)  are  determined  from  a  Mulliken  population 
analysis. 
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words,  the  rotating  about  the  Ru=C  bond  leads 
to  increased  polarization  of  the  Ru=C  bond.  As 
the  polarity  of  a  multiple  bond  is  often  an  impor¬ 
tant  determinant  of  its  reactivity,  and  solution- 
phase  data  suggest  that  both  conformers  are  acces¬ 
sible  for  experimental  systems,  this  is  an  important 
result. 

Effect  of  Ligand  (Y)  Modification 

Inspection  of  the  data  in  Table  V  makes  it  evi¬ 
dent  that  from  a  geometric  point  of  view  modifica¬ 
tion  of  the  Ru  ligands  has  a  small  effect.  This  is 
true  for  both  conformers  4a  and  4b.  For  example, 
the  parent  complex  is  (H,  H,  H)^,  with 
Ru=C  =  1,853  A,  Ru— P  =  2.436  A,  and  Ru— H 
==  1.708  A.  When  Ru  ligands  (Y)  in  the  parent 
complex  are  replaced  with  fluorine  (H,  F,  H)^, 
Ru=C  decreases  by  only  0.01  to  1.84  A  and 
Ru — P  increases  by  0.07  to  2.51  A,  Replacing  Y 
with  Cl  (H,C1,H)^  has  an  opposite  effect  to  F  on 
the  Ru=C  bond  length,  i.e.,  it  increases  by  0.03  to 
1.88  A.  The  Ru — P  bond  length  is  about  the  same 
in  (H,  F,  H)g  and  (H,  C1,H)^  at  2.50  A.  Replacing 
the  hydride  ligand  with  a  methyl  ligand  does  not 
result  in  a  large  change  in  geometry.  However,  as 
before,  changes  are  more  substantial  than  seen  for 
Pt-phosphinidenes. 

Despite  the  structural  similarity  of  conformers 
4a  and  4b  with  ligand  modification,  there  are  no¬ 
ticeable  energetic  differences,  particularly  for  non¬ 
halide  ligands.  For  halide  ligands,  conformers  4b 
and  4a  are  close  in  energy  (AE^^p^p  <  3  kcal  mol“\ 
Table  VI).  As  experimental  systems  have  Y  =  Cl, 
theoretical  results  are  consistent  with  experimental 
observation  of  two  rotamers  differing  in  the  orien¬ 
tation  of  the  alkylidene  moiety  [9b].  On  the  other 
hand,  there  is  a  distinct  preference  (>  14  kcal 
mol“^)  for  4a  when  strong  cr-donors  (H,  Me)  are 
coordinated  to  Ru.  Halides  are  7r-donors  while 
phosphines  are  7r-acceptors  so  that  they  should 
work  to  stabilize  and  destabilize,  respectively,  the 
orbitals  involved  in  the  multiple  bond  to  the 
alkylidene  C.  Thus,  it  is  unexpected  that  small 
rotational  barriers  are  seen  for  Y  =  halide.  The 
difference  may  be  due,  in  part,  to  changes  in 
P — Ru — P  and  Y — Ru — Y  angles  which  accom¬ 
pany  rotation  about  the  Ru==C  bond,  as  compres¬ 
sion  of  these  angles  away  from  the  alkylidene 
(increasing  Y — Ru=C  and  P — Ru=C  angles) 
will  polarize  the  orbitals  toward  the  alkylidene 
C,  making  for  more  effective  77-bonding.  Simple 
molecular  mechanics  calculations  also  suggest  that 


the  conformational  energy  difference  about  the 
Ru=C  bond  between  halide  and  nonhalide  lig¬ 
ands  is  partly  steric  in  origin. 

Although  the  changes  in  geometry  of  the  Ru-al- 
kylidenes  as  a  function  of  Y  are  limited,  analysis  of 
calculated  properties  indicate  that  interesting 
changes  are  taking  place  in  the  Ru=C  active  site. 
First,  it  can  be  seen  from  the  atomic  charge  calcu¬ 
lated  by  an  MPA  that  replacement  of  the  hydride 
ligands  in  (H,  H,  H)^  with  either  chlorine  or  fluo¬ 
rine  makes  the  charge  on  the  Ru  more  positive/less 
negative.  Likewise,  these  same  modifications  make 
the  carbon  more  positively  charged.  As  the  magni¬ 
tude  of  the  shifts  in  atomic  charge  are  larger  on  the 
Ru  than  on  the  C,  replacement  of  electroneutral 
with  electronegative  Y  decreases  the  polarization 
of  the  RuC  double  bond.  Finally,  for  both  alkyli¬ 
dene  conformers,  the  MPA  suggests  that  the  elec- 
trophilicity  of  the  Ru  center  can  be  substantially 
enhanced  through  ligand  modification. 

Effect  of  Alkylidene  Substituent  {Z} 

Modification 

The  alkylidene  substituents  have  been  modified 
in  the  same  manner  as  above,  using  Z  =  H,  F,  Cl, 
and  Me,  with  X  =  H  and  Y  =  Cl.  When  the  alkyli¬ 
dene  substituent  in  (H,  Cl,  H)^  is  replaced  by  fluo¬ 
rine,  (H,C1,  F)^,  the  Ru — P  and  Ru — Cl  bond 
lengths  are  almost  unchanged,  but  Ru=C  is  in¬ 
creased  from  1.88  to  1.98  A,  a  difference  of  0.10  A 
(5%).  When  F  is  replaced  with  Cl,  (H,  C1,C1)^, 
again,  the  Ru — P  and  Ru — Cl  bond  lengths  are 
roughly  the  same.  The  alkylidene  bond  length  is 

o  o 

decreased  to  1.97  A,  a  difference  of  0.02  A,  al¬ 
though  the  Ru  =  C  bond  length  is  still  much 
longer  (0.09  A)  than  is  (H,C1,H)^.  The  longest 
alkylidene  bond  length  was  obtained  with  methyl 
substituents  on  the  carbon,  (H,  Cl,  Me)^;  the 
Ru  =  C  bond  length  is  increased  to  2.01  A,  a 
change  of  0.13  A  from  (H,  Cl,  H)^.  As  before,  Ru — P 
and  Ru — Cl  show  relatively  small  changes  with 
modification  of  the  alkylidene  substituents.  The 
Ru  =  C  bond  length  in  (H,  Cl,  Me)  is  the  only  case 
where  conformer  4a  is  longer  than  4b.  The  longer 
bond  length  is  due  to  sterics  between  the  phos¬ 
phine  ligands  and  methyl  substituents  on  the 
alkylidene  carbon.  Conformers  4a  and  4b  are  very 
similar  in  energy  for  the  small  phosphines  studied 
here.  The  bulky  phosphines  needed  to  prevent 
oligomerization  (e.g.,  ^Pr,Cy)  in  an  experimental 
system  will  lead  to  a  preference  for  an  alkylidene 
orientation  in  which  the  alkylidene  plane  is  or- 
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thogonal  to  P — Ru — P  (4b),  as  seen  in  the  solid- 
state  structures  of  Ru-alkylidenes  [9b]  and  in  the 
lower  energy  of  (H,Cl,Me)^  than  of  (H,Cl,Me)^ 
(Table  V). 

The  large  change  in  Ru=C  bond  length  is  in 
contrast  to  what  is  seen  in  early,  high-valent 
alkylidenes  [13,18],  particularly  for  second-  and 
third-row  TMs  that  have  more  covalent  bonding 
than  their  first-row  congeners.  In  those  cases,  it  is 
generally  observed  that  metal-carbon  double 
bonds  are  insensitive  to  modification  of  metal  lig¬ 
ands  and  carbon  substituents.  For  example,  the 
Zr  =  C  bond  in  H2Zr=CH2  is  1.99  A  [18],  while 
that  in  H2Zr=CH2  is  only  0.04  A  (2%)  longer. 
Likewise,  for  a  third-row  TM,  going  from  a  meth- 
ylidene  H3Ta  =  CH2  (Ta  =  C  =  1.90  A)  to 
H3Ta  =  Cl2  (Ta=^C  =  1.94  A)  also  results  in  a 
2%  increase  in  the  metal-ligand  multiple  bond 
length  [18].  In  such  cases,  substituents  such  as 
chlorine  generally  transmit  their  influence  through 
the  TT-system  [13,18].  As  indicated  by  atomic 
charges  from  an  MPA,  fluorine  yields  a  substan¬ 
tially  more  electrophilic  alkylidene  carbon,  sug¬ 
gesting  that  its  influence  is  transmitted  primarily 
through  cr-withdrawing  rather  than  7r-donating 
ability.  Interestingly,  chlorine  substituents  led  to  a 
larger  change  in  the  charge  on  ruthenium  than 
carbon:  =  -  0.59  and  q^  =  0.19  (H,  Cl,  H)^;  and 

^Ru  =  -0.40  and  q^  =  0.05  (H,C1,C1)^.  The  less 
positive  charge  on  the  alkylidine  carbon  in 
(H,  Cl,  Cl)^  indicates  a  dominance  for  the  chlorine 
substituent  of  tt  over  a  effects. 


Discussion 

Analyses  of  computational  results  for  Pt-phos- 
phinidenes  and  Ru-alkylidenes  point  to  some  in¬ 
teresting  differences  between  these  late  TM,  multi¬ 
ply  bonded  complexes  and  their  more  heavily 
studied,  computationally  and  experimentally,  early 
TM  analogs. 

For  Pt-phosphinidenes,  the  steric  bulk  of  R  has 
an  effect  on  bond  angle,  as  expected.  The  sub¬ 
stituent  size,  however,  was  kept  small  for  compu¬ 
tational  feasibility,  so  this  was  not  vigorously  pur¬ 
sued.  However,  the  calculation  on  the  (H3P)2 
Pt=PH  with  a  constrained  linear  Pt=P — H  an¬ 
gle  suggests  that  substituents  that  engender  large 
Pt=P — R  angles  should  serve  to  strengthen  the 
Pt=P  bond.  The  bidentate  phosphine  dpm 
(H2PCH2CH2)  also  led  to  a  shortening  of  the 


Pt=P  bond  length  vs.  bis(monodentate-phos- 
phine)  analogs. 

Halogenated  substituents  (both  aliphatic  and 
aromatic)  caused  small  decreases  in  the  Pt=P 
bond  length.  Somewhat  surprisingly,  the  bonding 
and  structure  of  the  Pt-phosphinidene  moiety  was 
more  sensitive  to  substituent  changes  in  alkylphos- 
phinidenes  than  in  arylphosphinidenes.  Analysis 
of  the  molecular  orbitals  for  (H3P)2Pt=P — CH3 
and  (H3P)2Pt=P — suggests  that  this  may 
be  due  to  lack  of  delocalization  of  phenyl-based 
MOs  to  the  Pt— P  "'active  site,"  while  in  the 
(H3P)2Pt=P — CH3,  there  is  some  mixing  of 
C — H  (T  orbitals  with  Pt=P  orbitals.  These 
points,  taken  together,  suggest  that  the  Pt=P 
bond  is  primarily  affected  through  the  cr  frame¬ 
work,  a  conclusion  that  is  consistent  with  a  polar¬ 
ized  description  of  the  Pt-phosphinidene  bond,  A. 

The  largest  decreases  in  Pt=P  bond  length 
were  caused  by  cyano  substituents.  Hence,  calcula¬ 
tions  suggest  that  bulky  phosphinidene  sub¬ 
stituents  (coupled  with  bulky  bidentate  phosphine 
ancillary  ligands),  combined  with  cyano  phos¬ 
phinidene  substituents,  could  be  a  means  to  ob¬ 
taining  a  terminal  platinum  phosphinidene  with 
sufficient  stability  for  structural  characterization. 

Strong  o“-donor  ligands  (e.g.,  hydride)  led  to 
large  rotational  barriers  about  the  Ru=C  bond. 
On  the  other  hand,  halide  ligands  show  small 
differences  between  Ru-alkylidene  conformers 
(4a,  4b).  The  latter  observation  is  consistent  with 
experiments  [9b]  that  have  been  interpreted  as 
pointing  to  two  conformers  in  solution  that  differ 
in  the  orientation  of  the  alkylidene  moiety  with 
respect  to  the  other  ligands. 

The  changes  in  molecular  and  electronic  struc¬ 
ture  of  Ru-alkylidenes  as  a  function  of  metal  lig¬ 
and  and  alkylidene  substituent  are  larger  than 
observed  for  Pt-phosphinidenes  studied  in  this 
work  and  also  those  observed  in  research  on 
alkylidenes  involving  high- valent,  early  TMs  [13, 
18].  These  observations  are  important  in  two  re¬ 
spects:  First,  for  early  TM  =  MG  complexes,  there 
is  greater  chemical  diversity  with  changes  in 
chemical  environment  when  MG  is  a  heavy  ele¬ 
ment  (e.g.,  phosphorus)  [19].  Second,  for  late  TMs, 
the  situation  is  reversed.  Based  on  observations  to 
date,  our  current  working  hypothesis  is  as  follows: 
For  TMg3riy=MGiight  complexes,  the  strength  of 
the  metal-ligand  tt  bond  engenders  a  degree  of 
insensitivity  to  chemical  modification.  For 
TMgaj.iy~^Gheavy  complexes,  the  situation  is  re¬ 
versed  due  primarily  to  weak  metal-ligand  tt 
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bonding.  The  TMi3tg=MGijgi^t  complexes  must 
have  weaker  tt  bonding  than  observed  for  early 
metal  congeners,  making  them  akin,  in  a  concep- 
tual  sense,  to  TM^^riy  ^  ^^heavy  systems  with  re¬ 
spect  to  their  response  to  ligand  and  substituent 
modification.  Finally,  for  com¬ 

plexes,  the  inherently  weaker  77-bonding  of  the 
heavier  MG  elements,  coupled  with  the  low-valent 
nature  of  the  metals,  conspire  to  yield  a  bonding 
situation  where  77-bonding  is  virtually  nonexistent 
so  that  any  effects  on  the  metal-ligand  multiple 
bond  must  be  transmitted  through  the  cr  frame¬ 
work  which  generally  results  in  small  changes. 

The  foregoing  hypotheses  suggest  that 
TM=MG  complexes  involving  late  TMs,  as  they 
are  increasingly  explored  through  theory  and  ex¬ 
periment,  will  yield  new  chemistry  not  observed 
in  the  more  well-studied  early,  TM  complexes. 
How  these  changes  can  be  exploited  for  catalytic 
or  synthetic  benefit  will  require  a  more  in-depth 
analysis  of  the  reactivity  of  these  systems.  Such 
studies  are  now  under  way  in  our  lab. 
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ABSTRACT:  Values  of  several  parameters  appearing  in  the  modified  Bethe-Bloch 
theory  must  be  known  if  one  is  to  calculate  the  stopping  power  for  a  given 
projectile-target  combination  within  the  energy  interval  of  applicability  of  the  theory.  In 
the  course  of  the  past  several  years,  the  author  has  established  values  for  kapton  and 
mylar  target  materials  through  fits  of  experimental  data  with  the  modified  Bethe-Bloch 
theory.  During  the  past  year,  several  sets  of  measurements  of  polystyrene  stopping 
powers  for  light  projectiles  have  been  studied  so  as  to  extract  values  of  Bethe-Bloch 
parameters  also  for  this  target  material.  Results  of  these  studies,  including  uncertainties 
in  the  parameter  values  that  reflect  the  number  and  accuracy  of  the  measurements 
analyzed,  are  summarized  in  this  presentation.  ©  1997  John  Wiley  &  Sons,  Inc.  Int  J  Quant 
Chem  65:  997-1003,  1997 


Introduction 

The  stopping  power  of  matter  for  charged  par¬ 
ticles  has  been  studied  for  nearly  a  century 
now.  Not  only  is  this  topic  one  of  theoretical  im¬ 
portance  in  several  areas  of  physics,  but  it  holds 
great  significance  in  numerous  areas  of  experimen¬ 
tal  physics,  i.e.,  it  is  often  necessary  to  know  with 
considerable  accuracy  the  energy  loss  of  a  charged 
subatomic  projectile  during  traversal  of  a  known 
thickness  of  target  material.  If  one  is  to  avoid 
energy-loss  measurements  for  every  projectile- 
target  combination  encountered,  a  reliable  means 


of  calculation  must  be  available.  In  this  connection, 
the  modified  Bethe-Bloch  theory  of  stopping 
power  has  proved  very  helpful  over  a  broad  range 
of  projectile  velocities,  provided  that  values  of 
several  parameters  of  the  theory  are  available  [1-4]. 
In  the  case  of  a  target  that  manifests  aggregation 
effects,  such  as  alloys  or  compounds,  other  compli¬ 
cations  arise.  In  these  cases,  one  must  invoke 
Bragg's  rule  of  (linear)  additivity  of  stopping  ef¬ 
fects  [5].  A  technique  for  calculating  energy  loss  in 
this  sort  of  target  has  recently  been  explained 
[6-9]. 

In  the  past  few  years,  three  measurements  of 
the  stopping  powers  of  polystyrene  for  light  pro¬ 
jectiles  have  been  reported  [10-12],  two  of  them 
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[10,12]  covering  lower  energy  intervals  than,  and 
one  of  them  [11]  overlapping  considerably,  that 
explored  previously  by  the  author  [13,14].  The 
current  study  was  undertaken  with  the  goal  of 
analyzing  for  the  first  time  the  recent  measure¬ 
ments  [10-12]  in  terms  of  the  modified  Bethe- 
Bloch  theory  and  of  comparing  extracted  values  of 
parameters  with  those  established  earlier  through 
experiment  [13,14]  and  by  means  of  theoretical 
calculations.  In  the  case  of  the  dominant 
Bethe-Bloch  parameter,  the  mean  excitation  en¬ 
ergy  (I),  the  previously  determined  experimental 
value  was  71.1  +  2.0  eV  [14],  whereas  a  very  reli¬ 
able  calculated  value  was  68.7  eV  [15].  Nearly  a 
decade  ago,  the  author  similarly  analyzed  several 
extant  sets  of  stopping  power  measurements  for 
two  other  plastics,  mylar  and  kapton,  both  of  which 
are  used  widely  in  numerous  areas  of  physics  and 
the  first  of  which  has  been  extensively  utilized  in 
the  U.S.  space  program  [6,7].  The  results  of  these 
investigations  [6, 7]  are  compared  with  those  of  the 
current  study  of  polystyrene.  Moreover,  uncertain¬ 
ties  in  all  three  sets  of  extracted  parameter  values 
were  calculated  in  the  current  study  for  the  sake  of 
both  clarity  and  comparison. 


Theory 

The  modified  Bethe-Bloch  formalism  and  the 
method  employed  in  analyziitg  stopping  power 
measurements  for  the  purpose  of  extracting  values 
of  the  various  parameters  of  the  formalism  were 
described  in  previous  studies  [1-4, 6-9, 13, 14].  The 
stopping  power,  S,  of  an  elemental  target  of  atomic 
weight  A  and  atomic  number  Z  for  a  projectile  of 
atomic  number  z  and  velocity  u  =  /3  c  is  given  by 

0.30708  Z 

S  =  — ^z^-L,  (1) 

P  ^ 

in  units  of  keV  cm^/mg.  Here,  L,  which  represents 
the  (dimensionless)  stopping  number  per  target 
electron,  contains  three  terms: 

L  =  Lq  +  ^zLj  +  L2,  (2) 

where  the  basic  stopping  number,  Lq,  is  given  by 

/  2mc^l3^ 

^  \  1  - 


The  symbol  I  in  this  expression  signifies  the  target 
mean  excitation  energy,  mc^  represents  the  rest 
mass  energy  of  the  electron,  C  is  the  sum  of  target 
shell  corrections,  and  S  stands  for  the  density 
effect  correction  needed  for  highly  relativistic  pro¬ 
jectiles  [16].  The  mean  excitation  energy  can  be 
determined  from  a  fit  to  accurate  stopping  power 
measurements.  Shell  corrections  can  be  obtained 
with  the  method  of  Bichsel  [17, 18],  where  Walske 
K-  and  L-shell  corrections  [19,20]  are  utilized  with 
appropriate  scaling  factors  employed  in  the  L-shell 
correction  so  as  to  obtain  the  M-  and  N-shell 
corrections: 

C  =  C,(/3^)  +  +  V^Q(H^/3^) 

+  (4) 

In  this  expression,  and  represent  the  K-  and 
L-shell  corrections,  respectively,  and  the  and 
(with  i  ==  L,  M,  N )  denote  the  scaling  factors. 

The  Li  and  L2  terms  in  Eq.  (2)  represent  two 
terms  of  higher  order  in  the  projectile  atomic  num¬ 
ber  (z).  The  (zLj)  term  is  known  as  the  Barkas- 
effect  correction  term,  where  L^  can  be  calculated 
from  one  of  at  least  three  existing  formalisms 
[21-25].  A  comparison  of  the  methods  [2]  has 
shown  that  the  first  [21-23]  clearly  provides  a 
superior  fit  to  measurements  over  the  broadest 
projectile  energy  interval.  Thus,  this  formalism  has 
been  used  in  the  stopping  number  calculated  for 
analyses  in  the  current  study.  In  this  case, 

F(b/x^/^) 

“  2V2^3/2  ' 

where  F  represents  a  function  graphed  in  [21], 
X  =  (18  787)j3^/Z,  and  b  denotes  the  sole  free 
parameter  of  the  formalism.  ^  appears  in  Eq.  (2)  as 
an  adjustable  amplitude  of  the  Barkas-effect  term 
in  order  to  reflect  a  previous  controversy  about 
iinclusion  of  close-collision  contributions  to  the  term 
[1,9].  The  L2  term,  called  the  Bloch  term  [26],  is 
given  by 

L^iy)  =  ~  Re[il^(l  F  iy)],  (6) 

where  il/  represents  the  digamma  function  [27] 
and  y  =  za/jS,  with  a  signifying  the  fine  struc¬ 
ture  constant. 

The  several  parameters  of  the  three  stopping 
number  terms  in  modified  Bethe-Bloch  theory, 
consisting  of  the  mean  excitation  energy  I,  the 


-  -In  I  -  C/Z  -  8/2,  (3) 
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Barkas-effect  parameters  b  and  and  the  scaling 
parameters  for  the  shell  corrections  Vf  and  are 
all  independent  of  projectile  velocity.  Whereas  the 
Bloch  term  clearly  depends  on  the  projectile  en¬ 
ergy  and  charge,  there  are  no  free  parameters. 
Thus,  a  fit  to  stopping  power  measurements  to 
establish  values  of  some  subset  of  the  above 
parameters  should  yield  parameter  values  that  are 
valid  over  the  projectile  energy  interval  in  which 
the  modified  Bethe-Bloch  formula  is  valid. 

The  original  Bohr  and  Bethe  theories  applied 
strictly  to  pure  monatomic  targets  in  the  gaseous 
state  [28].  A  target  as  simple  as  an  element  in  the 
condensed  state,  or  as  a  homonuclear  diatomic 
molecule  in  the  gaseous  state,  manifests  bonding 
effects.  Extension  of  Bethe  theory  to  molecules  and 
condensed  matter  [28]  was  outlined  in  a  recent 
report  [9].  Both  physical-state  effects  and 
chemical-bonding  effects  are  often  grouped  as  ag¬ 
gregation  effects.  A  target  subject  to  aggregation 
effects  will  henceforth  be  referred  to  as  a  compos¬ 
ite  target,  obvious  examples  being  compounds, 
alloys,  and  mixtures.  A  method  of  managing  ag¬ 
gregation  effects,  initially  advanced  by  Bragg  [5], 
is  to  assume  linear  additivity  of  stopping  effects  of 
constituents  in  the  composite  target.  This  assump¬ 
tion,  called  "Bragg's  rule"  or  the  "additivity  rule," 
is  used  as  a  first  approximation  in  the  investiga¬ 
tion  of  aggregation  effects.  In  this  vein,  stopping 
cross  sections,  stopping  powers,  and  various  pa¬ 
rameters  in  the  Bethe-Bloch  formula  are  character¬ 
ized  on  the  basis  of  departures  from  the  additivity 
rule.  The  current  study  is  based  in  part  on  continu¬ 
ing  interest  in  the  additivity  problem.  Calculation 
of  average  parameter  values  has  been  described 
previously  [22,29,30].  The  Bloch  term  [26]  con¬ 
tains  no  target-dependent  parameters,  but  the 
Barkas-effect  correction  term  [21-23]  requires  av¬ 
eraging.  Moreover,  Lq  contains  the  mean  excita¬ 
tion  energy,  J,  and  the  shell  corrections,  C — both 
of  which  are  target-dependent — that  must  be  as¬ 
signed  average  values  [29,30].  The  general  dearth 
of  knowledge  of  the  correct  parameter  values  for 
target  constituents  poses  an  obstacle  to  implemen¬ 
tation  of  averaging  procedures  [9].  Consequently, 
the  parameter  selected  for  testing  the  additivity 
rule  is  the  mean  excitation  energy,  for  which 
the  calculated  average  (Bragg)  value,  Jg,  can  be 
obtained  [28, 29]  as 


In  Jg 


Ey  rijZj  In  ly 


(7) 


where  ny,  Zy,  and  Jy,  respectively,  represent  the 
atomic  concentration,  atomic  number,  and  mean 
excitation  energy  of  the  ;th  component  of  the  com¬ 
posite  material.  I  values  extracted  from  measure¬ 
ments  are  expected  to  exceed  the  respective  Bragg 
values  in  general,  with  the  largest  deviations  oc¬ 
curring  for  strong  chemical  bonds  in  low-Z  com¬ 
pounds,  since  in  such  cases,  a  large  fraction  of  the 
total  number  of  electrons  participate  in  the  bond¬ 
ing.  For  a  given  compound,  the  chemical  binding 
effect  should  be  greatest  at  low  projectile  veloci¬ 
ties,  since  the  tightly  bound  inner-shell  electrons 
contribute  less  to  the  stopping  than  do  the  outer, 
valence  electrons  [31]. 


Method  of  Analysis 

The  basic  procedure  that  is  utilized  in  assigning 
values  to  some  parameters  and  searching  for  val¬ 
ues  of  others  through  fits  to  experimental  data 
[1-4,6-9,13,14,30]  is  based  on  the  premise  that 
the  assigned  values,  which  have  been  established 
through  prior  analyses,  are  essentially  correct. 
Then,  the  number  and  accuracy  of  the  remaining 
parameter  values  that  can  be  ascertained  are 
determined  by  the  number  and  accuracy  of  the 
measurements. 

In  the  ^se  of  polystyrene,  the  average  atomic 
number  (Z)  is  3.500,  the  average  atomic  weight 
(A)  is  6.509,  and  the  Bragg  value  [5]  of  mean 
excitation  energy  (Jg)  based  on  current  values  of 
constituent  mean  excitation  energies  [18]  and  cal¬ 
culated  in  the  usual  manner  [1, 6-9]  is  69.5  eV.  The 
shell  correction  scaling  parameters  for  the  L  shell, 
calculated  according  to  the  Bichsel  rubric  [17, 18], 
are  =  l-OO-  (The  parameter  val¬ 

ues  utilized  for  polystyrene,  mylar,  and  kapton  are 
displayed  in  Table  I.  Constituent  mean  excitation 
energies  of  19.2  eV  for  hydrogen,  81  eV  for  carbon, 

TABLE  I _ _ _ 

Average  atomic  number  (Z),  average  atomic 
weight  (A),  logarithmic  average  mean  excitation 
energy  (Iq),  and  shell  correction  scaling 
parameters  assigned  to  polystyrene,  mylar, 
and  kapton. 


Target 

Z 

A 

Ib  (eV) 

Vl 

Hl 

Polystyrene 

3.50 

6.51 

69.5 

0.188 

1.00 

Mylar 

4.54 

8.74 

78.7 

0.318 

1.00 

Kapton 

5.03 

9.80 

79.6 

0.379 

1.00 
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82  eV  for  nitrogen,  and  106  eV  for  oxygen  were 
taken  from  [18].)  A  typical  method  of  analysis  is  to 
fix  the  values  of  shell  correction  scaling  parame¬ 
ters  and  to  search  for  best-fit  values  of  the  mean 
excitation  energy  (/),  the  Barkas-effect  parameter 
(fo),  and  the  amplitude  of  the  Barkas-effect  term 
( ^  )  in  a  three-parameter  fit  or  to  set  the  value  of  ^ 
at  unity  and  proceed  to  conduct  a  two-parameter 
fit.  The  best  fit  is  established  by  minimizing  the 
root-mean-square  relative  deviation  of  calculated 
from  measured  stopping  powers,  cr.  This  quantity 
is  defined  as 


a  = 


1  ^  /  S  - 

-  E  — 

N  ,.^1 1  AS 


m  )  i 


1/2 


(8) 


for  measurements  at  N  energies,  where  repre¬ 
sents  the  measured  stopping  power;  the  calcu¬ 
lated  stopping  power;  and  AS^,  the  statistical 
uncertainty  in  the  measured  value.  Thus,  a  value 
of  cr  near  unity  indicates  an  acceptable  level  of 
agreement  between  theory  and  experiment. 


Analyses  and  Results  (Protons) 

The  point  of  departure  in  the  present  study  was 
a  reappraisal  of  the  results  of  a  previous  analysis 
[14]  by  the  author  of  measurements  reported  by 
the  author  and  his  collaborators  [13].  The  earlier 
analysis  [14]  was  conducted  prior  to  introducing 
an  automated  fitting  routine  into  the  computer 
code  utilized  [32,33],  i.e.,  manual  searches  were 
carried  out  to  obtain  the  values  (I  =  71,1  ±  1.8  eV 
and  =  1.90  ±  0.05  with  a  =  0.70)  in  the  previous 
study  [14].  The  central  values  were  checked  with 
the  automated  search  routine,  with  the  resulting 
new  values,  I  =  71.0  eV,  b  =  2.14,  and  a  =  0.69. 
Thus,  it  would  appear  that  the  lower  b  value  first 
obtained  was  a  consequence  of  finding  a  local 
minimum  close  to  that  located  in  the  automated 
search.  (The  above  fits  were  made  with  at  a 
higher  value  [0.625]  that  had  been  selected  for  the 
first  analysis  [14].)  When  was  reset  at  0.188  for 
the  current  series  of  analyses,  the  results  were  that 
J  =  71.0  eV,  b  ^  2.26,  and  a  =  0.69.  (A  three- 
parameter  search  for  J,  b,  and  |  proved  fruitless, 
since  the  value  of  b  became  negative  [  -0.312]  and 
that  of  ^  very  small  [0.09],  with  a  =  0.69;  remain¬ 
ing  searches  were  made  in  the  two-parameter 
mode.)  Clearly,  the  effect  of  selecting  a  smaller 
value  of  Vi  in  the  present  study  had  little  effect  on 


the  derived  values  of  1  and  b:  1  remained  the 
same,  whereas  b  rose  by  0.12  to  2.26  and  a  re¬ 
mained  constant. 

Analyses  of  more  recent  measurements  [10, 12] 
followed  in  an  attempt  to  compare  extracted 
parameter  values  with  those  obtained  from  the 
measurements  by  the  author  and  collaborators  with 
2.2-5.9  MeV  protons  [13].  The  first  set  of  new 
measurements  analyzed  was  one  with  0.02-0.4 
MeV  protons  [10].  A  vital  question  to  be  answered 
at  the  outset  was  that  of  the  lowest-energy  data 
point  that  could  be  included  without  utilizing  a 
charge-state  correction.  In  other  recent  studies  by 
the  author,  it  was  learned  that  proton  energies 
extending  down  to  300  keV  could  be  included 
without  appreciable  distortion  of  extracted  param¬ 
eter  values  [4,34,35]  and  that  a  corresponding 
lower  bound  for  alpha  particles  was  1.0  MeV 
[36,37].  In  the  present  study,  proton  data  points 
below  300  keV  were  initially  excluded,  then  only 
those  below  250  keV,  and  finally  only  those  below 
200  keV,  in  an  attempt  to  stretch  the  energy  inter¬ 
val  downward.  The  test  was  successful.  Thus,  the 
data  for  0,2-0.4  MeV  were  included  in  the  analy¬ 
sis,  with  resulting  values  of  J  =  71.9  eV,  b  =  1.20, 
and  a  =  0.04.  Another  set  of  similar  measure¬ 
ments  [12],  from  which  proton  data  points  from 
0.2-0,4  MeV  were  selected,  yielded  /  =  69.8  eV, 
b  =  1.15,  and  a  =  0.28.  The  data  from  [12]  are 
truly  composite  in  nature.  Measurements  with 
and  projectiles  at  40-3000  keV/u  were  re¬ 
ported.  These  data  were  combined  with  previously 
reported  measurements  [10,38]  and  supplemented 
by  means  of  analyses  with  a  fitting  function  [39]  in 
order  to  extend  the  covered  energy  interval  up¬ 
ward,  The  fitting  function  proposed  earlier  [39]  is 
an  expansion  in  energy  per  mass  unit  for  the 
(reciprocal)  stopping  power: 

=  E  (9) 

/=0 

where  units  of  keV/u  for  £  will  yield  units  of 
10~i5  eV  cm^/atom  for  £.  The  procedure  used  was 
first  to  establish  values  by  obtaining  the  best 
overall  fit  to  proton  stopping  powers  from  10  to 
1000  keV  in  the  Janni  tables  [40]  and  then  to 
determine  values  of  the  through  fits  of  the 
experimental  data  from  [10,12,39],  The  resulting 
values  of  coefficients  are  purportedly  valid  for 
hydrogen  projectile  energies  of  40-400  keV /u  [12]. 
Actual  measured  stopping  powers  were  obtained 
from  one  of  the  authors  [41]  in  a  third  set  of  recent 
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TABLE  II _ 

Extracted  values  of  mean  excitation  energy  (/)  and 
Barkas-effect  parameter  (b)  and  associated  error 
function  (a)  for  each  of  six  measurements  of 


polystyrene  stopping  power  for  protons,  including 
the  projectile  energy  Interval  explored  in  each  case. 


Ref. 

AEp  (MeV) 

/(eV) 

b 

cr 

[13] 

2.2 -5.9 

71.0 

2.26 

0.69 

[10] 

o 

1 

CVJ 

o 

71.9 

1.20 

0.04 

[12] 

0.2 -0.4 

69.8 

1.15 

0.28 

[11] 

0.9 -3.0 

62.2 

0.84 

0.43 

[42] 

0.20-0.34 

68.3 

1.12 

0.24 

[38] 

0.20-0.35 

64.8 

1.25 

0.49 

Measurements  from  [12]  with  and  from  40  to  300 
keV/u  were  supplemented  by  the  authors  with  results  of 
other  measurements  [10, 38]  and  of  a  fitting  function  [39]  to 
characterize  the  data  of  an  extensive  compilation  [40]  so  as 
to  extend  the  energy  Interval  upward. 


measurements  [11],  utilizing  0.9-3.0  MeV  protons; 
analysis  yielded  I  =  62.2  eV,  b  =  0.84,  and  cr  = 
0.43.  The  results  of  these  fits  of  proton  measure¬ 
ments  are  displayed  in  Table  II,  along  with  results 
from  current  fits  of  two  older  data  sets  [38,42]. 
These  results  will  be  discussed  extensively  below. 


Analyses  and  Results 
(Alpha  Particles) 

Two  of  the  references  contained  also  stopping 
power  measurements  for  alpha  particles  [11,12]. 
The  data  for  0.8-2.9  MeV  projectiles  [11]  yielded 
1  =  60.7  eV,  b  =  1.66,  and  cr  =  1.77.  (Only  one  of 
the  nine  data  points  featured  a  projectile  energy 
below  1.0  MeV.)  The  measurements  for  1. 0-3.0 
MeV  projectiles  [12]  yielded  I  =  62.5  eV,  b  =  1.48, 
and  a  =  0.75.  Once  again,  it  must  be  noted  that 
the  data  from  [12]  are  composite  in  nature. 
Measurements  with  ^He  and  ^He  projectiles  at 
25-90  keV /u  were  reported.  These  data  were  sup¬ 
plemented  by  previously  reported  measurements 
[11]  in  order  to  extend  the  covered  energy  interval 
upward.  These  data  were  fit  with  the  function 
given  by  Eq.  (9)  to  obtain  new  (He  projectile) 
values  of  the  coefficients,  purportedly  valid  for 
the  energy  interval  from  20  to  750  keV/u  [12].  All 
these  results,  summarized  in  Table  III,  will  be 
discussed  further  below. 


TABLE  III _ 

Extracted  values  of  mean  excitation  energy  (/)  and 
Barkas-effect  parameter  (b)  and  associated  error 
function  (a)  for  each  of  two  measurements  of 
polystyrene  stopping  power  for  alpha  particles, 
including  the  projectile  energy  interval  explored 
in  each  case. 


Ref. 

(MeV) 

/(eV) 

b 

cr 

[11] 

0.8 -2.9 

60.7 

1.66 

1.77 

[12] 

o 

CO 

1 

q 

62.5 

1.48 

0.75 

Measurements  from  [12]  with  ^He  and  "^He  from  25  to  90 
keV/u  were  supplemented  by  results  of  other  measure¬ 
ments  [11]  to  extend  the  energy  interval  upward. 


Discussion  of  Results 

In  the  course  of  developing  the  Barkas-effect 
formalism  utilized  herein  [21-23,43],  it  was  rec¬ 
ommended  by  two  of  the  authors  [43]  that,  in 
general,  b  =  lA  ±  0.1.  Moreover,  according  to  the 
"'13%  rule"  [18],  one  might  expect  that  the  mean 
excitation  energy  for  a  solid  compound  of  low 
atomic  number  will  lie  some  13%  higher  than  the 
Bragg  value;  the  Bragg  value  of  I  for  polystyrene 
is  Ig  =  69.5  eV,  and  1.13Ig  =  78.5  eV.  (Since  con¬ 
stituent  mean  excitation  energies  have  evolved 
over  time,  the  4  values  given  herein  may  differ 
from  those  provided  in  cited  earlier  references.) 
All  values  of  I  extracted  from  experimental  data 
lie  well  below  the  "13%  rule"  prediction,  and  the 
proton-based  values,  instead,  provide  68.0  eV  as  a 
simple  arithmetic  average.  A  reliable  calculated 
value  of  I  is  68.7  eV  [15].  Two  values  of  1  ex¬ 
tracted  from  stopping  power  measurements  with 
70  MeV  protons  and  two  different  samples  of 
polystyrene,  67.55  and  69.0  eV,  were  reported  re¬ 
cently  [44].  Nearly  all  extracted  values  of  the 
Barkas-effect  parameter  (b)  are  eminently  reason¬ 
able,  the  exceptions  being  those  of  proton  data 
from  [11, 13].  I  values  obtained  from  the  proton 
measurements  of  [11,13]  appear  to  be  somewhat 
low  and  high,  respectively.  Only  the  alpha  particle 
data  of  [11]  provided  a  fit  characterized  by  a 
<j-value  greater  than  unity. 

The  experiments  were  conducted  with  thin-film 
transmission  techniques  in  [10,13,38,42]  and  with 
the  Rutherford  back-scattering  method  in  [11,  12]. 
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TABLE  IV _ 

Recommended  values  of  mean  excitation  energy 
(/)  and  Barkas-effect  parameter  ib)  for 
polystyrene,  mylar,  and  kapton,  with 
calculated  uncertainties  (A  /  and  Ab). 


Target 

/(eV) 

A/  (eV) 

b 

Ah 

Polystyrene 

71.9 

6.5 

1.20 

0.36 

Mylar 

80.5 

7.0 

1.33 

0.60 

Kapton 

78.2 

8.1 

1.08 

0.87 

However,  the  complete  data  from  [12]  that  were 
analyzed  herein  were  composite  in  nature,  as  noted 
above.  Hence,  no  systematic  trends  in  extracted 
parameter  values  are  discernible  solely  on  the 
basis  of  the  experimental  method  employed. 

Results  of  searches  for  parameter  values  for 
kapton  and  mylar  were  reported  previously  [6,7]. 
However,  no  uncertainties  in  the  values  were  pro¬ 
vided  in  those  reports.  An  important  feature  of  the 
current  study  is  the  calculation  of  uncertainties  in 
recommended  values  of  I  and  b  for  polystyrene, 
mylar,  and  kapton.  These  uncertainties,  calculated 
with  the  method  described  in  detail  previously 
[1,45],  reflect  the  number  and  accuracy  of  the 
measurements  of  the  stopping  power  for  a 
given  projectile-target  combination.  Part  of  the 
procedure  for  establishing  uncertainties  in  param¬ 
eters  1  and  b  for  mylar  and  kapton  was  to 
calculate  best-fit  central  values  with  the  current 
version  of  the  fitting  program.  Hence,  the  recom¬ 
mended  values  of  these  parameters  differ  slightly 
from  those  previously  published  [6, 7].  The  partic¬ 
ular  measurements  chosen  for  analysis  in  this  por¬ 
tion  of  the  study,  were,  for  each  target  material, 
those  which  reflected  the  greatest  consistency  with 
expectations  based  on  modified  Bethe-Bloch 
theory.  All  three  sets  of  measurements  were 
made  with  protons  traversing  thin-film  targets, 
those  utilized  being  from  [10]  for  polystyrene 
and  from  [46]  for  mylar  and  kapton.  Results  of  this 
portion  of  the  analyses  are  displayed  in  Table  IV. 
Each  uncertainty  shown  was  derived  solely  from 
the  particular  set  of  measurements  analyzed,  thus 
reflecting  the  number  and  accuracy  of  only  those 
measurements. 


Summary 

Eight  experimental  studies  of  the  stopping 
power  of  polystyrene,  six  for  protons  and  two  for 


alpha  particles,  have  been  reviewed  in  order  to 
ascertain  modified  Bethe-Bloch  stopping  power 
parameters  by  consistent  data  analysis  in  each 
case.  Four  of  the  six  proton  studies  were  con¬ 
ducted  with  the  thin-film  transmission  method 
[10,13,38,42],  whereas  the  remaining  four  meas¬ 
urements  were  conducted  with  the  Rutherford 
back-scattering  (RBS)  technique  [11,12].  The  ma¬ 
jority  of  results  for  mean  excitation  energy  (J) 
were  reasonably  consistent  with  the  Bragg  value  of 
69.5  eV  or  a  reliable  theoretical  value  of  68.7  eV 
[15],  although  considerable  dispersion  among  de¬ 
rived  values  was  found.  All  but  two  extracted 
values  of  the  Barkas-effect  parameter  (b)  were  quite 
consistent  with  the  expected  range  of  values  [43] 
of  1.4  ±  0.1,  one  exception  occurring  in  the  case  of 
a  proton  transmission  experiment  [13]  and  the 
other  occurring  in  the  case  of  a  proton  RBS  experi¬ 
ment  [11].  Dispersion  in  extracted  parameter  val¬ 
ues  has  been  encountered  in  previous  reviews  of 
stopping  power  data  [1,8,36,47],  but  no  convinc¬ 
ing  explanation  therefor  has  yet  been  advanced. 
Generally,  this  dispersion  merely  reflects  obvious 
dispersion  among  sets  of  stopping  power  measure¬ 
ments — a  concern  within  the  stopping  power  com¬ 
munity  for  several  decades.  An  additional  factor 
contributing  to  dispersion  among  reported  sets  of 
stopping  power  measurements  is  undoubtedly  the 
use  of  various  smoothing  functions  before  listing 
the  actual  measurements — a  practice  openly  de¬ 
plored  in  a  previous  study  [47].  (An  excellent  ex¬ 
ample  of  this  smoothing  procedure  is  the  afore¬ 
mentioned  report  of  measurements  in  [12].)  The 
practice  can  distort  the  values  of  parameters  ex¬ 
tracted  from  the  data  in  an  analysis  based  on  firm 
theory. 

The  situation  for  kapton  targets  also  required 
careful  assessment  [6,7].  In  one  study  [7],  a  set  of 
proton  measurements  was  rejected  on  the  basis  of 
quite  implausible  values  of  parameters  extracted 
therefrom.  Similarly,  in  the  case  of  mylar  targets, 
another  study  [7]  resulted  in  doubt  being  cast  on 
the  validity  of  a  set  of  alpha  particle  measure¬ 
ments,  These  selection  processes  will  hopefully  be 
justified  by  future  sets  of  definitive  measurements. 
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